SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper ⢠2602.02361 ⢠Published Feb 2 ⢠60
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Paper ⢠2601.04720 ⢠Published Jan 8 ⢠56
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper ⢠2512.01374 ⢠Published Dec 1, 2025 ⢠105
RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing Paper ⢠2507.20352 ⢠Published Jul 27, 2025
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper ⢠2506.01939 ⢠Published Jun 2, 2025 ⢠188
Rationales Are Not Silver Bullets: Measuring the Impact of Rationales on Model Performance and Reliability Paper ⢠2505.24147 ⢠Published May 30, 2025
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models Paper ⢠2506.05176 ⢠Published Jun 5, 2025 ⢠79