jina-embeddings-v5-omni: Text-Geometry-Preserving Multimodal Embeddings via Frozen-Tower Composition Paper • 2605.08384 • Published 7 days ago • 7
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 11 days ago • 305
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation Paper • 2604.28196 • Published 15 days ago • 71
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published 16 days ago • 106
MultiWorld: Scalable Multi-Agent Multi-View Video World Models Paper • 2604.18564 • Published 25 days ago • 45
view article Article Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs +3 lapp0, LouisCastricato, ScottieFox, shahbuland, xAesthetics • Apr 9 • 29
Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments Paper • 2602.11964 • Published Feb 12 • 13
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders Paper • 2412.09586 • Published Dec 12, 2024 • 6
ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation Paper • 2510.04290 • Published Oct 5, 2025 • 21
story writing favourites Collection Models I personally liked for generating stories in the past. Not a recommendation, most of these are outdated. • 17 items • Updated Mar 2 • 102