PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper β’ 2510.14528 β’ Published Oct 16 β’ 98
Group-in-Group Policy Optimization for LLM Agent Training Paper β’ 2505.10978 β’ Published May 16 β’ 18
G-CUT3R: Guided 3D Reconstruction with Camera and Depth Prior Integration Paper β’ 2508.11379 β’ Published Aug 15 β’ 12
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success Paper β’ 2508.04280 β’ Published Aug 6 β’ 35
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success Paper β’ 2508.04280 β’ Published Aug 6 β’ 35
Reinforcement Learning for Long-Horizon Interactive LLM Agents Paper β’ 2502.01600 β’ Published Feb 3 β’ 1
view article Article LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone! Mar 7 β’ 88