LongCat

community

AI & ML interests

None defined yet.

Recent Activity

swt submitted a paper 4 days ago

AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation

Fengjiao updated a dataset 5 days ago

meituan-longcat/LARYBench

Fengjiao submitted a paper 11 days ago

LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment

View all activity

Papers

AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation

LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment

View all Papers

meituan-longcat 's datasets 21

meituan-longcat/LARYBench

Updated 5 days ago • 6.7k • 15

meituan-longcat/General365_Public

Viewer • Updated 12 days ago • 720 • 118 • 6

meituan-longcat/AMO-Bench

Viewer • Updated Feb 5 • 50 • 3.24k • 30

meituan-longcat/VitaBench

Updated Jan 27 • 680 • 26

meituan-longcat/CEdit-Bench

Viewer • Updated Dec 5, 2025 • 1.46k • 53 • 3

meituan-longcat/UNO-Bench

Viewer • Updated Dec 4, 2025 • 3.73k • 801 • 22

meituan-longcat/R-HORIZON-Websearch

Viewer • Updated Oct 21, 2025 • 505 • 111

meituan-longcat/R-HORIZON-AMC23

Viewer • Updated Oct 21, 2025 • 200 • 1.94k

meituan-longcat/R-HORIZON-Math500

Viewer • Updated Oct 21, 2025 • 2.5k • 23

meituan-longcat/R-HORIZON-AIME25

Viewer • Updated Oct 21, 2025 • 150 • 27

meituan-longcat/R-HORIZON-AIME24

Viewer • Updated Oct 21, 2025 • 150 • 33

meituan-longcat/R-HORIZON-training-data

Updated Oct 21, 2025 • 21 • 2

meituan-longcat/Meeseeks

Updated Sep 10, 2025 • 128 • 1

meituan-longcat/OIBench

Viewer • Updated Jul 15, 2025 • 275 • 45 • 1

meituan-longcat/CoreCodeBench-Source_Copy

Updated Jul 7, 2025 • 25

meituan-longcat/ViC-Bench

Preview • Updated May 30, 2025 • 301 • 2

meituan-longcat/Q-Eval-100K

Updated May 27, 2025 • 8.13k • 10

meituan-longcat/Audio-Turing-Test-Corpus

Preview • Updated May 16, 2025 • 36 • 1

meituan-longcat/Audio-Turing-Test-Audios

Viewer • Updated May 16, 2025 • 104 • 24 • 1

meituan-longcat/CoreCodeBench-Multi

Viewer • Updated May 15, 2025 • 420 • 43 • 1

meituan-longcat/CoreCodeBench-Single

Viewer • Updated May 15, 2025 • 3.48k • 80