Patrick Liu's picture

8

Patrick Liu

patrick2015

·

AI & ML interests

Focus on ML use cases like AML, Anti-fraud, etc. Now shifting to LLM based solutions.

Recent Activity

upvoted a paper 1 day ago

CritiCal: Can Critique Help LLM Uncertainty or Confidence Calibration?

upvoted a paper 1 day ago

Advancing Creative Physical Intelligence in Large Multimodal Models

upvoted a paper 1 day ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

View all activity

Organizations

None yet

upvoted 6 papers 1 day ago

CritiCal: Can Critique Help LLM Uncertainty or Confidence Calibration?

Paper • 2510.24505 • Published Oct 28, 2025 • 5

Advancing Creative Physical Intelligence in Large Multimodal Models

Paper • 2605.26396 • Published about 1 month ago • 21

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published May 6 • 23

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

Paper • 2601.11004 • Published Jan 16 • 31

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents

Paper • 2511.02734 • Published Nov 4, 2025 • 23

GeoBrowse: A Geolocation Benchmark for Agentic Tool Use with Expert-Annotated Reasoning Traces

Paper • 2604.04017 • Published Apr 5 • 8

upvoted a collection 1 day ago

awesome-agentic-benchmarks

3 items • Updated 1 day ago • 2

upvoted a paper 1 day ago

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 4 days ago • 85