Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning Paper • 2511.14460 • Published 2 days ago • 13
Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark Paper • 2511.13853 • Published 3 days ago • 33
MetaCluster: Enabling Deep Compression of Kolmogorov-Arnold Network Paper • 2510.19105 • Published 29 days ago • 1
Kontext CAM Angles Collection Multiple Adapters for Heterogeneous Angles • 6 items • Updated 11 days ago • 2
MetaCLIP2 Image Classification Experiments Collection Domain-Specific Downstream Tasks • 5 items • Updated 5 days ago • 2
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis Paper • 2501.04561 • Published Jan 8 • 17
From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities Paper • 2410.02155 • Published Oct 3, 2024 • 4
ClaimVer: Explainable Claim-Level Verification and Evidence Attribution of Text Through Knowledge Graphs Paper • 2403.09724 • Published Mar 12, 2024 • 2
CODA: Repurposing Continuous VAEs for Discrete Tokenization Paper • 2503.17760 • Published Mar 22 • 4
Cognition Collection Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend. • 200 items • Updated Apr 15 • 6
Representation & Optimization Collection Understanding about representation sheds light on optimization • 98 items • Updated 17 days ago • 5
BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design Paper • 2508.21184 • Published Aug 28 • 2