view article Article European Vision-Language Model Optimization for Nuclear Regulatory Data Aug 26 • 6
CognitiveDrone: A VLA Model and Evaluation Benchmark for Real-Time Cognitive Task Solving and Reasoning in UAVs Paper • 2503.01378 • Published Mar 3 • 5
WebGen-Agent: Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Reinforcement Learning Paper • 2509.22644 • Published Sep 26 • 20
view article Article UI-DETR-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions Oct 1 • 16
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 548
Multimodal DSE Retrievers Collection A collection of DSE models for multimodal retrieval • 5 items • Updated Apr 15 • 15
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face Jul 29 • 199
VisionDocumentRetrieval Datasets Collection Datasets for vision document retrieval (VDR) • 21 items • Updated 11 days ago • 7
view article Article Advancing European AI Sovereignty Through Racine.ai Flantier Open-Source Multimodal Models Mar 26 • 12