PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image Paper • 2511.13648 • Published 3 days ago • 43
Simulating the Visual World with Artificial Intelligence: A Roadmap Paper • 2511.08585 • Published 9 days ago • 28
Uniform Discrete Diffusion with Metric Path for Video Generation Paper • 2510.24717 • Published 23 days ago • 39
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper • 2510.14979 • Published Oct 16 • 65
GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning Paper • 2410.15266 • Published Oct 20, 2024
NEO1_0 Collection From Pixels to Words -- Towards Native Vision-Language Primitives at Scale • 7 items • Updated Oct 17 • 3
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper • 2510.14979 • Published Oct 16 • 65
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper • 2510.14979 • Published Oct 16 • 65 • 2
NEO1_0 Collection From Pixels to Words -- Towards Native Vision-Language Primitives at Scale • 7 items • Updated Oct 17 • 3
NEO1_0 Collection From Pixels to Words -- Towards Native Vision-Language Primitives at Scale • 7 items • Updated Oct 17 • 3
NEO1_0 Collection From Pixels to Words -- Towards Native Vision-Language Primitives at Scale • 7 items • Updated Oct 17 • 3