Demystifying Reinforcement Learning in Agentic Reasoning Paper • 2510.11701 • Published 5 days ago • 27
DocReward: A Document Reward Model for Structuring and Stylizing Paper • 2510.11391 • Published 5 days ago • 26
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published 5 days ago • 154
DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation Paper • 2510.09116 • Published 8 days ago • 92
FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution Paper • 2510.12747 • Published 4 days ago • 31
Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks Paper • 2510.12635 • Published 4 days ago • 14
LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens Paper • 2510.11919 • Published 5 days ago • 4
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published 5 days ago • 138
InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue Paper • 2510.13747 • Published 3 days ago • 28
Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs Paper • 2510.13795 • Published 3 days ago • 42
The Art of Scaling Reinforcement Learning Compute for LLMs Paper • 2510.13786 • Published 3 days ago • 21
When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA Paper • 2510.04849 • Published 12 days ago • 88
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper • 2510.14979 • Published 1 day ago • 53
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar Paper • 2510.14972 • Published 1 day ago • 27
Attention Is All You Need for KV Cache in Diffusion LLMs Paper • 2510.14973 • Published 1 day ago • 25
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published 2 days ago • 28
Large Language Models Do NOT Really Know What They Don't Know Paper • 2510.09033 • Published 8 days ago • 14
Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures Paper • 2510.14616 • Published 2 days ago • 9
LiveResearchBench: A Live Benchmark for User-Centric Deep Research in the Wild Paper • 2510.14240 • Published 2 days ago • 9
MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systems Paper • 2510.14252 • Published 2 days ago • 2
RAGCap-Bench: Benchmarking Capabilities of LLMs in Agentic Retrieval Augmented Generation Systems Paper • 2510.13910 • Published 3 days ago • 1