Audio2Face-3D Collection Open-weight networks and a test dataset for the training framework • 8 items • Updated about 9 hours ago • 8
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds Paper • 2508.14879 • Published Aug 20 • 65
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V Paper • 2310.11441 • Published Oct 17, 2023 • 29