The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models Paper • 2601.10387 • Published 8 days ago • 10
LucaOne Collection Generalized biological foundation model with unified nucleic acid and protein language(Nature Machine Intelligence),https://github.com/LucaOne/LucaOne • 6 items • Updated 23 days ago • 2
view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 19 days ago • 36
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published about 1 month ago • 17
Are We on the Right Way to Assessing LLM-as-a-Judge? Paper • 2512.16041 • Published Dec 17, 2025 • 33
Hierarchical Dataset Selection for High-Quality Data Sharing Paper • 2512.10952 • Published Dec 11, 2025 • 2
Causal Judge Evaluation: Calibrated Surrogate Metrics for LLM Systems Paper • 2512.11150 • Published Dec 11, 2025 • 6
Skywork-Reward-V2 Collection Scaling preference data curation to the extreme • 9 items • Updated Jul 4, 2025 • 26
Reward Models 10-2025 Collection A collection of great reward models for research and production • 7 items • Updated 3 days ago • 12
Olmo 3 Pre-training Collection All artifacts related to Olmo 3 pre-training • 10 items • Updated Dec 23, 2025 • 32
Mitigating Label Length Bias in Large Language Models Paper • 2511.14385 • Published Nov 18, 2025 • 8
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases Nov 5, 2025 • 58
Nemotron RAG Collection Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex do • 11 items • Updated 3 days ago • 63
OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation Paper • 2511.13655 • Published Nov 17, 2025 • 10
view article Article The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs Nov 15, 2025 • 13