DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning Paper • 2602.11089 • Published 6 days ago • 18
Low Rank Sparse Attention Collection Open source weights of Lorsa modules introduced in "Towards Understanding the Nature of Attention with Low-Rank Sparse Decomposition". • 3 items • Updated 7 days ago • 3
Game-RL Collection [ICLR 2026] Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning • 8 items • Updated 7 days ago • 4
FutureOmni Collection First Omni-modal Future Forecasting Benchmark • 2 items • Updated 7 days ago • 2
ABC-Bench Collection Evaluating Agentic Backend Coding Capabilities in Real-World Development Scenarios • 4 items • Updated 7 days ago • 3
MOSS Transcribe Diarize Collection A unified multimodal large language model for end-to-end speaker-attributed, time-stamped transcription. • 2 items • Updated 7 days ago • 5
OpenMed/OpenMed-NER-GenomicDetect-PubMed-109M Token Classification • 0.1B • Updated Aug 5, 2025 • 154k • • 3
OpenMed/OpenMed-NER-DiseaseDetect-BioMed-335M Token Classification • 0.3B • Updated Aug 5, 2025 • 182k • • 6
Italian PII & De-Identification Collection 33 models for Italian PII detection & de-identification. 55+ entity types. HIPAA & GDPR compliant. Apache 2.0. • 35 items • Updated about 9 hours ago • 2
German PII & De-Identification Collection 33 models for German PII detection & de-identification. 55+ entity types. HIPAA & GDPR compliant. Apache 2.0. • 35 items • Updated about 9 hours ago • 2
French PII & De-Identification Collection 33 models for French PII detection & de-identification. 55+ entity types. HIPAA & GDPR compliant. Apache 2.0. • 35 items • Updated about 9 hours ago • 3
Multilingual PII & De-Identification Collection Multilingual models for extracting PII entities and de-identifying clinical text, with support for HIPAA and GDPR compliance. • 155 items • Updated about 9 hours ago • 19