Multi3DRefer: Grounding Text Description to Multiple 3D Objects Paper • 2309.05251 • Published Sep 11, 2023
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images Paper • 2406.11579 • Published Jun 17, 2024
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning Paper • 2503.07572 • Published Mar 10 • 47
An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion Paper • 2408.03178 • Published Aug 6, 2024 • 41
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Paper • 2402.09844 • Published Feb 15, 2024 • 21
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents Paper • 2306.16527 • Published Jun 21, 2023 • 46