MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering Paper • 2601.22859 • Published 8 days ago • 16
VLS: Steering Pretrained Robot Policies via Vision-Language Models Paper • 2602.03973 • Published 3 days ago • 20
SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation Paper • 2602.02402 • Published 4 days ago • 31
EgoActor: Grounding Task Planning into Spatial-aware Egocentric Actions for Humanoid Robots via Visual-Language Models Paper • 2602.04515 • Published 3 days ago • 33
MARS: Modular Agent with Reflective Search for Automated AI Research Paper • 2602.02660 • Published 4 days ago • 56
YOLOE-26: Integrating YOLO26 with YOLOE for Real-Time Open-Vocabulary Instance Segmentation Paper • 2602.00168 • Published 8 days ago • 1
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper • 2602.00919 • Published 6 days ago • 217
Open-AgentRL Collection RLAnything & DemyAgent: Open-Source RL for LLMs and Agentic Scenarios • 12 items • Updated 4 days ago • 5
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System Paper • 2602.02488 • Published 4 days ago • 30
PLANING: A Loosely Coupled Triangle-Gaussian Framework for Streaming 3D Reconstruction Paper • 2601.22046 • Published 8 days ago • 21
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published 8 days ago • 81
DINO-SAE: DINO Spherical Autoencoder for High-Fidelity Image Reconstruction and Generation Paper • 2601.22904 • Published 8 days ago • 13
DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment Paper • 2601.20218 • Published 10 days ago • 15
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published 7 days ago • 129
RADIO Collection A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 18 items • Updated 2 days ago • 30