Paper - a FalconLlamalpaca Collection

FalconLlamalpaca 's Collections

Chain of thought

Olympic Coder Datasets

Agentic

Speech & Vision LLMs

Paper

Spaces

LLMs

Paper

updated 3 days ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17, 2024 • 61
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published Mar 7 • 46
Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6 • 96
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13 • 53
facebook/blt

Updated Apr 30 • 53 • 73
Running

86

86

Large Reasoning Models Leaderboard

🐳

A leaderboard to rank large reasoning models
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement

Paper • 2410.13842 • Published Oct 17, 2024 • 6
SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14 • 94
ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models

Paper • 2508.18773 • Published Aug 26 • 15
NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings

Paper • 2509.04011 • Published Sep 4 • 28
Towards a Unified View of Large Language Model Post-Training

Paper • 2509.04419 • Published Sep 4 • 73
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published Sep 1 • 56
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9 • 98
Behavioral Fingerprinting of Large Language Models

Paper • 2509.04504 • Published Sep 2 • 5
Virtual Agent Economies

Paper • 2509.10147 • Published Sep 12 • 26
Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion

Paper • 2501.17887 • Published Jan 27 • 1
Understanding the Thinking Process of Reasoning Models: A Perspective from Schoenfeld's Episode Theory

Paper • 2509.14662 • Published Sep 18 • 13
Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published 26 days ago • 73
Variational Reasoning for Language Models

Paper • 2509.22637 • Published 24 days ago • 68
Fine-tuning Done Right in Model Editing

Paper • 2509.22072 • Published 24 days ago • 27
Sequential Diffusion Language Models

Paper • 2509.24007 • Published 22 days ago • 41
Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

Paper • 2509.26628 • Published 20 days ago • 12
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published 20 days ago • 46
MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation

Paper • 2509.26391 • Published 20 days ago • 18
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published 22 days ago • 114
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published 21 days ago • 132
GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published 19 days ago • 86
Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training

Paper • 2509.26625 • Published 20 days ago • 42
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published 24 days ago • 29
RLP: Reinforcement as a Pretraining Objective

Paper • 2510.01265 • Published 24 days ago • 37
Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published 7 days ago • 30
RAG-Anything: All-in-One RAG Framework

Paper • 2510.12323 • Published 6 days ago • 22