OneWorld: Taming Scene Generation with 3D Unified Representation Autoencoder Paper • 2603.16099 • Published 8 days ago • 1
Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding Paper • 2510.15253 • Published Oct 17, 2025
Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment Paper • 2505.21494 • Published May 27, 2025 • 8
Hyper3D: Efficient 3D Representation via Hybrid Triplane and Octree Feature for Enhanced 3D Shape Variational Auto-Encoders Paper • 2503.10403 • Published Mar 13, 2025
OneWorld: Taming Scene Generation with 3D Unified Representation Autoencoder Paper • 2603.16099 • Published 8 days ago • 1
JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence Paper • 2510.23538 • Published Oct 27, 2025 • 98
Thinking with Geometry: Active Geometry Integration for Spatial Reasoning Paper • 2602.06037 • Published Feb 5 • 1
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders Paper • 2601.16208 • Published Jan 22 • 55
Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies Paper • 2508.20072 • Published Aug 27, 2025 • 32
Build error 86 Super Resolution Anime Diffusion 🐉 86 Generate high-quality anime images with super resolution
FlashWorld: High-quality 3D Scene Generation within Seconds Paper • 2510.13678 • Published Oct 15, 2025 • 74
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI Paper • 2512.16676 • Published Dec 18, 2025 • 221