Barv's picture

5

Barv

dogp8999

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation

upvoted a paper 12 days ago

FinAuditing: A Financial Taxonomy-Structured Multi-Document Benchmark for Evaluating LLMs

upvoted a paper 2 months ago

From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models

View all activity

Organizations

None yet

upvoted a paper 11 days ago

DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation

Paper • 2510.09116 • Published 15 days ago • 94

upvoted a paper 12 days ago

FinAuditing: A Financial Taxonomy-Structured Multi-Document Benchmark for Evaluating LLMs

Paper • 2510.08886 • Published 16 days ago • 19

upvoted a paper 2 months ago

From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models

Paper • 2508.13491 • Published Aug 19 • 58

upvoted 2 papers 4 months ago

MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation

Paper • 2506.14028 • Published Jun 16 • 93

FinTagging: An LLM-ready Benchmark for Extracting and Structuring Financial Information

Paper • 2505.20650 • Published May 27 • 17