Democratizing Fine-grained Visual Recognition with Large Language Models Paper • 2401.13837 • Published Jan 24, 2024 • 1
Superpowering Open-Vocabulary Object Detectors for X-ray Vision Paper • 2503.17071 • Published Mar 21 • 1
Knowledge to Sight: Reasoning over Visual Attributes via Knowledge Decomposition for Abnormality Grounding Paper • 2508.04572 • Published Aug 6 • 1
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection Paper • 2405.10053 • Published May 16, 2024 • 1
Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery Paper • 2303.15975 • Published Mar 28, 2023 • 1
Organizing Unstructured Image Collections using Natural Language Paper • 2410.05217 • Published Oct 7, 2024 • 1
Test-time Vocabulary Adaptation for Language-driven Object Detection Paper • 2506.00333 • Published May 31 • 1
UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos Paper • 2510.15018 • Published 16 days ago • 1