RLFR: Extending Reinforcement Learning for LLMs with Flow Environment Paper • 2510.10201 • Published 8 days ago • 35
HuggingFaceTB/SmolVLM2-256M-Video-Instruct Image-Text-to-Text • 0.3B • Updated Apr 8 • 456k • 80
MME-SCI: A Comprehensive and Challenging Science Benchmark for Multimodal Large Language Models Paper • 2508.13938 • Published Aug 19 • 1
microsoft/Phi-3.5-vision-instruct Image-Text-to-Text • 4B • Updated Sep 26, 2024 • 567k • 706
Internal Consistency and Self-Feedback in Large Language Models: A Survey Paper • 2407.14507 • Published Jul 19, 2024 • 46