 zzfive
			's Collections
			zzfive
			's Collections
			
			
				
				
 - TextureDreamer: Image-guided Texture Synthesis through Geometry-aware
  Diffusion- 
			Paper
			 •- 
			2401.09416
			 •
			Published
				
			•- 
				11
			 
 - SHINOBI: Shape and Illumination using Neural Object Decomposition via
  BRDF Optimization In-the-wild- 
			Paper
			 •- 
			2401.10171
			 •
			Published
				
			•- 
				14
			 
 - DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction
  Model- 
			Paper
			 •- 
			2311.09217
			 •
			Published
				
			•- 
				22
			 
 - GALA: Generating Animatable Layered Assets from a Single Scan- 
			Paper
			 •- 
			2401.12979
			 •
			Published
				
			•- 
				9
			 
 - ReplaceAnything3D:Text-Guided 3D Scene Editing with Compositional Neural
  Radiance Fields- 
			Paper
			 •- 
			2401.17895
			 •
			Published
				
			•- 
				16
			 
 - Advances in 3D Generation: A Survey- 
			Paper
			 •- 
			2401.17807
			 •
			Published
				
			•- 
				19
			 
 - AToM: Amortized Text-to-Mesh using 2D Diffusion- 
			Paper
			 •- 
			2402.00867
			 •
			Published
				
			•- 
				11
			 
 - GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object
  with Gaussian Splatting- 
			Paper
			 •- 
			2402.10259
			 •
			Published
				
			•- 
				16
			 
 - MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for
  Single or Sparse-view 3D Object Reconstruction- 
			Paper
			 •- 
			2402.12712
			 •
			Published
				
			•- 
				18
			 
 - FlashTex: Fast Relightable Mesh Texturing with LightControlNet- 
			Paper
			 •- 
			2402.13251
			 •
			Published
				
			•- 
				15
			 
 - Consolidating Attention Features for Multi-view Image Editing- 
			Paper
			 •- 
			2402.14792
			 •
			Published
				
			•- 
				8
			 
 - MVD^2: Efficient Multiview 3D Reconstruction for Multiview Diffusion- 
			Paper
			 •- 
			2402.14253
			 •
			Published
				
			•- 
				7
			 
 - ViewFusion: Towards Multi-View Consistency via Interpolated Denoising- 
			Paper
			 •- 
			2402.18842
			 •
			Published
				
			•- 
				15
			 
 - TripoSR: Fast 3D Object Reconstruction from a Single Image- 
			Paper
			 •- 
			2403.02151
			 •
			Published
				
			•- 
				16
			 
 - ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models- 
			Paper
			 •- 
			2403.01807
			 •
			Published
				
			•- 
				9
			 
 - CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction
  Model- 
			Paper
			 •- 
			2403.05034
			 •
			Published
				
			•- 
				22
			 
 - 3D-VLA: A 3D Vision-Language-Action Generative World Model- 
			Paper
			 •- 
			2403.09631
			 •
			Published
				
			•- 
				10
			 
 - GVGEN: Text-to-3D Generation with Volumetric Representation- 
			Paper
			 •- 
			2403.12957
			 •
			Published
				
			•- 
				6
			 
 - GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation- 
			Paper
			 •- 
			2403.12365
			 •
			Published
				
			•- 
				11
			 
 - TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation- 
			Paper
			 •- 
			2403.12906
			 •
			Published
				
			•- 
				7
			 
 - Compress3D: a Compressed Latent Space for 3D Generation from a Single
  Image- 
			Paper
			 •- 
			2403.13524
			 •
			Published
				
			•- 
				8
			 
 - VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation- 
			Paper
			 •- 
			2403.17001
			 •
			Published
				
			•- 
				6
			 
 - Gamba: Marry Gaussian Splatting with Mamba for single view 3D
  reconstruction- 
			Paper
			 •- 
			2403.18795
			 •
			Published
				
			•- 
				20
			 
 - GaussianCube: Structuring Gaussian Splatting using Optimal Transport for
  3D Generative Modeling- 
			Paper
			 •- 
			2403.19655
			 •
			Published
				
			•- 
				19
			 
 - Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field
  Representation and Generation- 
			Paper
			 •- 
			2403.19319
			 •
			Published
				
			•- 
				14
			 
 - FlexiDreamer: Single Image-to-3D Generation with FlexiCubes- 
			Paper
			 •- 
			2404.00987
			 •
			Published
				
			•- 
				23
			 
 - PointInfinity: Resolution-Invariant Point Diffusion Models- 
			Paper
			 •- 
			2404.03566
			 •
			Published
				
			•- 
				16
			 
 - Robust Gaussian Splatting- 
			Paper
			 •- 
			2404.04211
			 •
			Published
				
			•- 
				10
			 
 - Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion- 
			Paper
			 •- 
			2404.06429
			 •
			Published
				
			•- 
				7
			 
 - MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based
  Monocular Guidance- 
			Paper
			 •- 
			2404.08252
			 •
			Published
				
			•- 
				6
			 
 - CompGS: Efficient 3D Scene Representation via Compressed Gaussian
  Splatting- 
			Paper
			 •- 
			2404.09458
			 •
			Published
				
			•- 
				7
			 
 - Taming Latent Diffusion Model for Neural Radiance Field Inpainting- 
			Paper
			 •- 
			2404.09995
			 •
			Published
				
			•- 
				7
			 
 - MeshLRM: Large Reconstruction Model for High-Quality Mesh- 
			Paper
			 •- 
			2404.12385
			 •
			Published
				
			•- 
				27
			 
 - Interactive3D: Create What You Want by Interactive 3D Generation- 
			Paper
			 •- 
			2404.16510
			 •
			Published
				
			•- 
				21
			 
 - CAT3D: Create Anything in 3D with Multi-View Diffusion Models- 
			Paper
			 •- 
			2405.10314
			 •
			Published
				
			•- 
				48
			 
 - Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode
  Multi-view Latent Diffusion- 
			Paper
			 •- 
			2405.09874
			 •
			Published
				
			•- 
				20
			 
 - Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory
  Score Matching- 
			Paper
			 •- 
			2405.11252
			 •
			Published
				
			•- 
				16
			 
 - CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and
  Interactive Geometry Refiner- 
			Paper
			 •- 
			2405.14979
			 •
			Published
				
			•- 
				19
			 
 - HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed
  via Gaussian Splatting- 
			Paper
			 •- 
			2405.15125
			 •
			Published
				
			•- 
				8
			 
 - Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with
  Dynamic Gaussian Surfels- 
			Paper
			 •- 
			2405.16822
			 •
			Published
				
			•- 
				12
			 
 - Part123: Part-aware 3D Reconstruction from a Single-view Image- 
			Paper
			 •- 
			2405.16888
			 •
			Published
				
			•- 
				12
			 
 - GFlow: Recovering 4D World from Monocular Video- 
			Paper
			 •- 
			2405.18426
			 •
			Published
				
			•- 
				17
			 
 - 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian
  Splatting- 
			Paper
			 •- 
			2405.18424
			 •
			Published
				
			•- 
				9
			 
 - NPGA: Neural Parametric Gaussian Avatars- 
			Paper
			 •- 
			2405.19331
			 •
			Published
				
			•- 
				10
			 
 - GECO: Generative Image-to-3D within a SECOnd- 
			Paper
			 •- 
			2405.20327
			 •
			Published
				
			•- 
				11
			 
 - PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting- 
			Paper
			 •- 
			2405.19957
			 •
			Published
				
			•- 
				10
			 
 - 4Diffusion: Multi-view Video Diffusion Model for 4D Generation- 
			Paper
			 •- 
			2405.20674
			 •
			Published
				
			•- 
				15
			 
 - Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion- 
			Paper
			 •- 
			2406.03184
			 •
			Published
				
			•- 
				22
			 
 - 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion
  Models- 
			Paper
			 •- 
			2406.07472
			 •
			Published
				
			•- 
				13
			 
 - Physics3D: Learning Physical Properties of 3D Gaussians via Video
  Diffusion- 
			Paper
			 •- 
			2406.04338
			 •
			Published
				
			•- 
				39
			 
 - 3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and
  Less Hallucination- 
			Paper
			 •- 
			2406.05132
			 •
			Published
				
			•- 
				30
			 
 - Real3D: Scaling Up Large Reconstruction Models with Real-World Images- 
			Paper
			 •- 
			2406.08479
			 •
			Published
				
			•- 
				7
			 
 - LRM-Zero: Training Large Reconstruction Models with Synthesized Data- 
			Paper
			 •- 
			2406.09371
			 •
			Published
				
			•- 
				5
			 
 - GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors- 
			Paper
			 •- 
			2406.10111
			 •
			Published
				
			•- 
				6
			 
 - MeshAnything: Artist-Created Mesh Generation with Autoregressive
  Transformers- 
			Paper
			 •- 
			2406.10163
			 •
			Published
				
			•- 
				33
			 
 - L4GM: Large 4D Gaussian Reconstruction Model- 
			Paper
			 •- 
			2406.10324
			 •
			Published
				
			•- 
				13
			 
 - ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians- 
			Paper
			 •- 
			2406.16815
			 •
			Published
				
			•- 
				7
			 
 - YouDream: Generating Anatomically Controllable Consistent Text-to-3D
  Animals- 
			Paper
			 •- 
			2406.16273
			 •
			Published
				
			•- 
				43
			 
 - GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly
  Enhanced Quality- 
			Paper
			 •- 
			2406.18462
			 •
			Published
				
			•- 
				12
			 
 - Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side
  Images- 
			Paper
			 •- 
			2407.06191
			 •
			Published
				
			•- 
				14
			 
 - RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models- 
			Paper
			 •- 
			2407.06938
			 •
			Published
				
			•- 
				25
			 
 - Controlling Space and Time with Diffusion Models- 
			Paper
			 •- 
			2407.07860
			 •
			Published
				
			•- 
				17
			 
 - StyleSplat: 3D Object Style Transfer with Gaussian Splatting- 
			Paper
			 •- 
			2407.09473
			 •
			Published
				
			•- 
				13
			 
 - CharacterGen: Efficient 3D Character Generation from Single Images with
  Multi-View Pose Canonicalization- 
			Paper
			 •- 
			2402.17214
			 •
			Published
				
			•- 
				2
			 
 - DreamCatalyst: Fast and High-Quality 3D Editing via Controlling
  Editability and Identity Preservation- 
			Paper
			 •- 
			2407.11394
			 •
			Published
				
			•- 
				12
			 
 - Animate3D: Animating Any 3D Model with Multi-view Video Diffusion- 
			Paper
			 •- 
			2407.11398
			 •
			Published
				
			•- 
				10
			 
 - Click-Gaussian: Interactive Segmentation to Any 3D Gaussians- 
			Paper
			 •- 
			2407.11793
			 •
			Published
				
			•- 
				3
			 
 - Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for
  Unconstrained Photo Collections- 
			Paper
			 •- 
			2407.12306
			 •
			Published
				
			•- 
				6
			 
 - Shape of Motion: 4D Reconstruction from a Single Video- 
			Paper
			 •- 
			2407.13764
			 •
			Published
				
			•- 
				20
			 
 - PlacidDreamer: Advancing Harmony in Text-to-3D Generation- 
			Paper
			 •- 
			2407.13976
			 •
			Published
				
			•- 
				5
			 
 - BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis
  in Large-scale Scenes- 
			Paper
			 •- 
			2407.15848
			 •
			Published
				
			•- 
				17
			 
 - HoloDreamer: Holistic 3D Panoramic World Generation from Text
  Descriptions- 
			Paper
			 •- 
			2407.15187
			 •
			Published
				
			•- 
				13
			 
 - Temporal Residual Jacobians For Rig-free Motion Transfer- 
			Paper
			 •- 
			2407.14958
			 •
			Published
				
			•- 
				5
			 
 - F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions- 
			Paper
			 •- 
			2407.12435
			 •
			Published
				
			•- 
				14
			 
 - SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View
  Consistency- 
			Paper
			 •- 
			2407.17470
			 •
			Published
				
			•- 
				16
			 
 - DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car
  Reconstruction- 
			Paper
			 •- 
			2407.16988
			 •
			Published
				
			•- 
				8
			 
 - Floating No More: Object-Ground Reconstruction from a Single Image- 
			Paper
			 •- 
			2407.18914
			 •
			Published
				
			•- 
				20
			 
 - Cycle3D: High-quality and Consistent Image-to-3D Generation via
  Generation-Reconstruction Cycle- 
			Paper
			 •- 
			2407.19548
			 •
			Published
				
			•- 
				27
			 
 - Expressive Whole-Body 3D Gaussian Avatar- 
			Paper
			 •- 
			2407.21686
			 •
			Published
				
			•- 
				8
			 
 - Improving 2D Feature Representations by 3D-Aware Fine-Tuning- 
			Paper
			 •- 
			2407.20229
			 •
			Published
				
			•- 
				7
			 
 - NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation
  Learning for Neural Radiance Fields- 
			Paper
			 •- 
			2404.01300
			 •
			Published
				
			•- 
				4
			 
 - SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and
  Illumination Disentanglement- 
			Paper
			 •- 
			2408.00653
			 •
			Published
				
			•- 
				32
			 
 - TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and
  Resampling- 
			Paper
			 •- 
			2408.01291
			 •
			Published
				
			•- 
				13
			 
 - MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh
  Tokenization- 
			Paper
			 •- 
			2408.02555
			 •
			Published
				
			•- 
				32
			 
 - An Object is Worth 64x64 Pixels: Generating 3D Object via Image
  Diffusion- 
			Paper
			 •- 
			2408.03178
			 •
			Published
				
			•- 
				40
			 
 - RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel
  View Synthesis- 
			Paper
			 •- 
			2408.03356
			 •
			Published
				
			•- 
				10
			 
 - Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields- 
			Paper
			 •- 
			2408.03822
			 •
			Published
				
			•- 
				14
			 
 - Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from
  User's Casual Sketches- 
			Paper
			 •- 
			2408.04567
			 •
			Published
				
			•- 
				26
			 
 - FruitNeRF: A Unified Neural Radiance Field based Fruit Counting
  Framework- 
			Paper
			 •- 
			2408.06190
			 •
			Published
				
			•- 
				18
			 
 - HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors- 
			Paper
			 •- 
			2408.06019
			 •
			Published
				
			•- 
				15
			 
 - SlotLifter: Slot-guided Feature Lifting for Learning Object-centric
  Radiance Fields- 
			Paper
			 •- 
			2408.06697
			 •
			Published
				
			•- 
				15
			 
 - 3D Gaussian Editing with A Single Image- 
			Paper
			 •- 
			2408.07540
			 •
			Published
				
			•- 
				12
			 
 - MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and
  3D Editing- 
			Paper
			 •- 
			2408.08000
			 •
			Published
				
			•- 
				9
			 
 - MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction
  Model- 
			Paper
			 •- 
			2408.10198
			 •
			Published
				
			•- 
				35
			 
 - SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse
  Views- 
			Paper
			 •- 
			2408.10195
			 •
			Published
				
			•- 
				13
			 
 - ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their
  Self-Supervised Pretraining- 
			Paper
			 •- 
			2408.10906
			 •
			Published
				
			•- 
				3
			 
 - DreamCinema: Cinematic Transfer with Free Camera and 3D Character- 
			Paper
			 •- 
			2408.12601
			 •
			Published
				
			•- 
				31
			 
 - Subsurface Scattering for 3D Gaussian Splatting- 
			Paper
			 •- 
			2408.12282
			 •
			Published
				
			•- 
				7
			 
 - LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation- 
			Paper
			 •- 
			2408.13252
			 •
			Published
				
			•- 
				26
			 
 - T3M: Text Guided 3D Human Motion Synthesis from Speech- 
			Paper
			 •- 
			2408.12885
			 •
			Published
				
			•- 
				13
			 
 - FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting
  for Customizable Rendering- 
			Paper
			 •- 
			2408.12894
			 •
			Published
				
			•- 
				6
			 
 - MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware
  Diffusion and Iterative Refinement- 
			Paper
			 •- 
			2408.14211
			 •
			Published
				
			•- 
				11
			 
 - Towards Realistic Example-based Modeling via 3D Gaussian Stitching- 
			Paper
			 •- 
			2408.15708
			 •
			Published
				
			•- 
				8
			 
 - ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion
  Model- 
			Paper
			 •- 
			2408.16767
			 •
			Published
				
			•- 
				32
			 
 - SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners- 
			Paper
			 •- 
			2408.16768
			 •
			Published
				
			•- 
				28
			 
 - 3D Reconstruction with Spatial Memory- 
			Paper
			 •- 
			2408.16061
			 •
			Published
				
			•- 
				15
			 
 - GST: Precise 3D Human Body from a Single Image with Gaussian Splatting
  Transformers- 
			Paper
			 •- 
			2409.04196
			 •
			Published
				
			•- 
				16
			 
 - UniDet3D: Multi-dataset Indoor 3D Object Detection- 
			Paper
			 •- 
			2409.04234
			 •
			Published
				
			•- 
				9
			 
 - Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video
  Diffusion Models- 
			Paper
			 •- 
			2409.07452
			 •
			Published
				
			•- 
				21
			 
 - FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally- 
			Paper
			 •- 
			2409.08270
			 •
			Published
				
			•- 
				12
			 
 - Phidias: A Generative Model for Creating 3D Content from Text, Image,
  and 3D Conditions with Reference-Augmented Diffusion- 
			Paper
			 •- 
			2409.11406
			 •
			Published
				
			•- 
				27
			 
 - SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction- 
			Paper
			 •- 
			2409.11211
			 •
			Published
				
			•- 
				9
			 
 - Vista3D: Unravel the 3D Darkside of a Single Image- 
			Paper
			 •- 
			2409.12193
			 •
			Published
				
			•- 
				10
			 
 - 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive
  Diffusion- 
			Paper
			 •- 
			2409.12957
			 •
			Published
				
			•- 
				21
			 
 - FlexiTex: Enhancing Texture Generation with Visual Guidance- 
			Paper
			 •- 
			2409.12431
			 •
			Published
				
			•- 
				13
			 
 - 3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt- 
			Paper
			 •- 
			2409.12892
			 •
			Published
				
			•- 
				5
			 
 - Portrait Video Editing Empowered by Multimodal Generative Priors- 
			Paper
			 •- 
			2409.13591
			 •
			Published
				
			•- 
				17
			 
 - DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D
  Diffusion- 
			Paper
			 •- 
			2409.17145
			 •
			Published
				
			•- 
				15
			 
 - Game4Loc: A UAV Geo-Localization Benchmark from Game Data- 
			Paper
			 •- 
			2409.16925
			 •
			Published
				
			•- 
				8
			 
 - TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans- 
			Paper
			 •- 
			2409.16666
			 •
			Published
				
			•- 
				7
			 
 - LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with
  3D-awareness- 
			Paper
			 •- 
			2409.18125
			 •
			Published
				
			•- 
				34
			 
 - Disco4D: Disentangled 4D Human Generation and Animation from a Single
  Image- 
			Paper
			 •- 
			2409.17280
			 •
			Published
				
			•- 
				11
			 
 - MonST3R: A Simple Approach for Estimating Geometry in the Presence of
  Motion- 
			Paper
			 •- 
			2410.03825
			 •
			Published
				
			•- 
				19
			 
 - RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion
  Models- 
			Paper
			 •- 
			2409.19989
			 •
			Published
				
			•- 
				18
			 
 - Semantic Score Distillation Sampling for Compositional Text-to-3D
  Generation- 
			Paper
			 •- 
			2410.09009
			 •
			Published
				
			•- 
				15
			 
 - GS^3: Efficient Relighting with Triple Gaussian Splatting- 
			Paper
			 •- 
			2410.11419
			 •
			Published
				
			•- 
				12
			 
 - Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage
  Gaussian Splats- 
			Paper
			 •- 
			2410.12781
			 •
			Published
				
			•- 
				6
			 
 - FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without
  Learned Priors- 
			Paper
			 •- 
			2410.16271
			 •
			Published
				
			•- 
				84
			 
 - SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes- 
			Paper
			 •- 
			2410.17249
			 •
			Published
				
			•- 
				42
			 
 - 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with
  View-consistent 2D Diffusion Priors- 
			Paper
			 •- 
			2410.16266
			 •
			Published
				
			•- 
				5
			 
 - DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes- 
			Paper
			 •- 
			2410.18084
			 •
			Published
				
			•- 
				14
			 
 - LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias- 
			Paper
			 •- 
			2410.17242
			 •
			Published
				
			•- 
				5
			 
 - MotionCLR: Motion Generation and Training-free Editing via Understanding
  Attention Mechanisms- 
			Paper
			 •- 
			2410.18977
			 •
			Published
				
			•- 
				15
			 
 - Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling- 
			Paper
			 •- 
			2410.18912
			 •
			Published
				
			•- 
				6
			 
 - MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D- 
			Paper
			 •- 
			2411.02336
			 •
			Published
				
			•- 
				24
			 
 - GenXD: Generating Any 3D and 4D Scenes- 
			Paper
			 •- 
			2411.02319
			 •
			Published
				
			•- 
				20
			 
 - AutoVFX: Physically Realistic Video Editing from Natural Language
  Instructions- 
			Paper
			 •- 
			2411.02394
			 •
			Published
				
			•- 
				17
			 
 - DreamPolish: Domain Score Distillation With Progressive Geometry
  Generation- 
			Paper
			 •- 
			2411.01602
			 •
			Published
				
			•- 
				11
			 
 - GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single
  In-the-Wild Image using a Dataset with Levels of Details- 
			Paper
			 •- 
			2411.03047
			 •
			Published
				
			•- 
				9
			 
 - DimensionX: Create Any 3D and 4D Scenes from a Single Image with
  Controllable Video Diffusion- 
			Paper
			 •- 
			2411.04928
			 •
			Published
				
			•- 
				57
			 
 - StdGEN: Semantic-Decomposed 3D Character Generation from Single Images- 
			Paper
			 •- 
			2411.05738
			 •
			Published
				
			•- 
				15
			 
 - KMM: Key Frame Mask Mamba for Extended Motion Generation- 
			Paper
			 •- 
			2411.06481
			 •
			Published
				
			•- 
				5
			 
 - SAMPart3D: Segment Any Part in 3D Objects- 
			Paper
			 •- 
			2411.07184
			 •
			Published
				
			•- 
				28
			 
 - Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model
  with Compact Wavelet Encodings- 
			Paper
			 •- 
			2411.08017
			 •
			Published
				
			•- 
				11
			 
 - LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models- 
			Paper
			 •- 
			2411.09595
			 •
			Published
				
			•- 
				77
			 
 - GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D
  Generation- 
			Paper
			 •- 
			2411.08033
			 •
			Published
				
			•- 
				25
			 
 - VeGaS: Video Gaussian Splatting- 
			Paper
			 •- 
			2411.11024
			 •
			Published
				
			•- 
				7
			 
 - Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable
  Single-stage Image-to-3D Generation- 
			Paper
			 •- 
			2411.14384
			 •
			Published
				
			•- 
				9
			 
 - Material Anything: Generating Materials for Any 3D Object via Diffusion- 
			Paper
			 •- 
			2411.15138
			 •
			Published
				
			•- 
				50
			 
 - SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting
  Synthesis- 
			Paper
			 •- 
			2411.16443
			 •
			Published
				
			•- 
				12
			 
 - 
			Paper
			 •- 
			2411.13550
			 •
			Published
				
			•- 
				7
			 
 - TEXGen: a Generative Diffusion Model for Mesh Textures- 
			Paper
			 •- 
			2411.14740
			 •
			Published
				
			•- 
				18
			 
 - Learning 3D Representations from Procedural 3D Programs- 
			Paper
			 •- 
			2411.17467
			 •
			Published
				
			•- 
				9
			 
 - SAR3D: Autoregressive 3D Object Generation and Understanding via
  Multi-scale 3D VQVAE- 
			Paper
			 •- 
			2411.16856
			 •
			Published
				
			•- 
				13
			 
 - CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models- 
			Paper
			 •- 
			2411.18613
			 •
			Published
				
			•- 
				58
			 
 - MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D
  Content Creation- 
			Paper
			 •- 
			2411.17945
			 •
			Published
				
			•- 
				27
			 
 - Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready
  3D Characters- 
			Paper
			 •- 
			2411.18197
			 •
			Published
				
			•- 
				14
			 
 - DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow
  Decoding- 
			Paper
			 •- 
			2411.19527
			 •
			Published
				
			•- 
				11
			 
 - SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction
  with 3D Autonomous Characters- 
			Paper
			 •- 
			2412.00174
			 •
			Published
				
			•- 
				23
			 
 - World-consistent Video Diffusion with Explicit 3D Modeling- 
			Paper
			 •- 
			2412.01821
			 •
			Published
				
			•- 
				4
			 
 - Imagine360: Immersive 360 Video Generation from Perspective Anchor- 
			Paper
			 •- 
			2412.03552
			 •
			Published
				
			•- 
				29
			 
 - Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion- 
			Paper
			 •- 
			2412.03515
			 •
			Published
				
			•- 
				27
			 
 - Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene
  Understanding- 
			Paper
			 •- 
			2412.00493
			 •
			Published
				
			•- 
				17
			 
 - MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation- 
			Paper
			 •- 
			2412.03558
			 •
			Published
				
			•- 
				20
			 
 - Structured 3D Latents for Scalable and Versatile 3D Generation- 
			Paper
			 •- 
			2412.01506
			 •
			Published
				
			•- 
				83
			 
 - MV-Adapter: Multi-view Consistent Image Generation Made Easy- 
			Paper
			 •- 
			2412.03632
			 •
			Published
				
			•- 
				24
			 
 - Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large
  Scene Reconstruction- 
			Paper
			 •- 
			2412.04887
			 •
			Published
				
			•- 
				18
			 
 - 2DGS-Room: Seed-Guided 2D Gaussian Splatting with Geometric Constrains
  for High-Fidelity Indoor Scene Reconstruction- 
			Paper
			 •- 
			2412.03428
			 •
			Published
				
			•- 
				11
			 
 - You See it, You Got it: Learning 3D Creation on Pose-Free Videos at
  Scale- 
			Paper
			 •- 
			2412.06699
			 •
			Published
				
			•- 
				13
			 
 - MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and
  Photorealism From Sparse Views- 
			Paper
			 •- 
			2412.06767
			 •
			Published
				
			•- 
				8
			 
 - Turbo3D: Ultra-fast Text-to-3D Generation- 
			Paper
			 •- 
			2412.04470
			 •
			Published
				
			•- 
				4
			 
 - Neural LightRig: Unlocking Accurate Object Normal and Material
  Estimation with Multi-Light Diffusion- 
			Paper
			 •- 
			2412.09593
			 •
			Published
				
			•- 
				18
			 
 - PIG: Physics-Informed Gaussians as Adaptive Parametric Mesh
  Representations- 
			Paper
			 •- 
			2412.05994
			 •
			Published
				
			•- 
				19
			 
 - GenEx: Generating an Explorable World- 
			Paper
			 •- 
			2412.09624
			 •
			Published
				
			•- 
				97
			 
 - IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and
  Illuminations- 
			Paper
			 •- 
			2412.12083
			 •
			Published
				
			•- 
				12
			 
 - GaussianProperty: Integrating Physical Properties to 3D Gaussians with
  LMMs- 
			Paper
			 •- 
			2412.11258
			 •
			Published
				
			•- 
				13
			 
 - DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation
  for High-quality 3D Asset Creation- 
			Paper
			 •- 
			2412.15200
			 •
			Published
				
			•- 
				9
			 
 - Sequence Matters: Harnessing Video Models in 3D Super-Resolution- 
			Paper
			 •- 
			2412.11525
			 •
			Published
				
			•- 
				11
			 
 - 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D
  Scene Understanding- 
			Paper
			 •- 
			2412.18450
			 •
			Published
				
			•- 
				36
			 
 - DepthLab: From Partial to Complete- 
			Paper
			 •- 
			2412.18153
			 •
			Published
				
			•- 
				36
			 
 - PartGen: Part-level 3D Generation and Reconstruction with Multi-View
  Diffusion Models- 
			Paper
			 •- 
			2412.18608
			 •
			Published
				
			•- 
				18
			 
 - Orient Anything: Learning Robust Object Orientation Estimation from
  Rendering 3D Models- 
			Paper
			 •- 
			2412.18605
			 •
			Published
				
			•- 
				22
			 
 - SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single
  Images- 
			Paper
			 •- 
			2501.04689
			 •
			Published
				
			•- 
				17
			 
 - Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation- 
			Paper
			 •- 
			2501.04144
			 •
			Published
				
			•- 
				19
			 
 - 
			Paper
			 •- 
			2501.07574
			 •
			Published
				
			•- 
				13
			 
 - CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities- 
			Paper
			 •- 
			2501.08983
			 •
			Published
				
			•- 
				20
			 
 - CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation- 
			Paper
			 •- 
			2501.09433
			 •
			Published
				
			•- 
				18
			 
 - GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar
  Editor- 
			Paper
			 •- 
			2501.09978
			 •
			Published
				
			•- 
				6
			 
 - Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D
  Assets Generation- 
			Paper
			 •- 
			2501.12202
			 •
			Published
				
			•- 
				47
			 
 - GSTAR: Gaussian Surface Tracking and Reconstruction- 
			Paper
			 •- 
			2501.10283
			 •
			Published
				
			•- 
				5
			 
 - Relightable Full-Body Gaussian Codec Avatars- 
			Paper
			 •- 
			2501.14726
			 •
			Published
				
			•- 
				10
			 
 - Multiview Equivariance Improves 3D Correspondence Understanding with
  Minimal Feature Finetuning- 
			Paper
			 •- 
			2411.19458
			 •
			Published
				
			•- 
				6
			 
 - DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian
  Splat Generation- 
			Paper
			 •- 
			2501.16764
			 •
			Published
				
			•- 
				22
			 
 - Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric
  Diffusion- 
			Paper
			 •- 
			2501.18804
			 •
			Published
				
			•- 
				5
			 
 - Fast Encoder-Based 3D from Casual Videos via Point Track Processing- 
			Paper
			 •- 
			2404.07097
			 •
			Published
				
			•- 
				4
			 
 - Text-to-CAD Generation Through Infusing Visual Feedback in Large
  Language Models- 
			Paper
			 •- 
			2501.19054
			 •
			Published
				
			•- 
				10
			 
 - DreamDPO: Aligning Text-to-3D Generation with Human Preferences via
  Direct Preference Optimization- 
			Paper
			 •- 
			2502.04370
			 •
			Published
				
			•- 
				7
			 
 - CAD-Editor: A Locate-then-Infill Framework with Automated Training Data
  Synthesis for Text-Based CAD Editing- 
			Paper
			 •- 
			2502.03997
			 •
			Published
				
			•- 
				9
			 
 - Exploring the Potential of Encoder-free Architectures in 3D LMMs- 
			Paper
			 •- 
			2502.09620
			 •
			Published
				
			•- 
				26
			 
 - TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified
  Flow Models- 
			Paper
			 •- 
			2502.06608
			 •
			Published
				
			•- 
				40
			 
 - Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and
  Texture Generation- 
			Paper
			 •- 
			2502.14247
			 •
			Published
				
			•- 
				6
			 
 - Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via
  Sparse Time-Variant Attribute Modeling- 
			Paper
			 •- 
			2502.20378
			 •
			Published
				
			•- 
				5
			 
 - Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models- 
			Paper
			 •- 
			2503.01774
			 •
			Published
				
			•- 
				44
			 
 - Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation- 
			Paper
			 •- 
			2503.01370
			 •
			Published
				
			•- 
				15
			 
 - RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling- 
			Paper
			 •- 
			2503.09601
			 •
			Published
				
			•- 
				16
			 
 - 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large
  Language Models- 
			Paper
			 •- 
			2503.10437
			 •
			Published
				
			•- 
				32
			 
 - TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree
  Sequencing- 
			Paper
			 •- 
			2503.11629
			 •
			Published
				
			•- 
				6
			 
 - Unleashing Vecset Diffusion Model for Fast Shape Generation- 
			Paper
			 •- 
			2503.16302
			 •
			Published
				
			•- 
				43
			 
 - DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement
  Learning- 
			Paper
			 •- 
			2503.15265
			 •
			Published
				
			•- 
				46
			 
 - DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis- 
			Paper
			 •- 
			2503.15667
			 •
			Published
				
			•- 
				8
			 
 - SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling- 
			Paper
			 •- 
			2503.21732
			 •
			Published
				
			•- 
				9
			 
 - Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal
  Bridging- 
			Paper
			 •- 
			2503.22236
			 •
			Published
				
			•- 
				11
			 
 - Progressive Rendering Distillation: Adapting Stable Diffusion for
  Instant Text-to-Mesh Generation without 3D Data- 
			Paper
			 •- 
			2503.21694
			 •
			Published
				
			•- 
				15
			 
 - MeshCraft: Exploring Efficient and Controllable Mesh Generation with
  Flow-based DiTs- 
			Paper
			 •- 
			2503.23022
			 •
			Published
				
			•- 
				6
			 
 - DSO: Aligning 3D Generators with Simulation Feedback for Physical
  Soundness- 
			Paper
			 •- 
			2503.22677
			 •
			Published
				
			•- 
				5
			 
 - VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in
  One Step- 
			Paper
			 •- 
			2504.01956
			 •
			Published
				
			•- 
				40
			 
 - HoloPart: Generative 3D Part Amodal Segmentation- 
			Paper
			 •- 
			2504.07943
			 •
			Published
				
			•- 
				28
			 
 - In-2-4D: Inbetweening from Two Single-View Images to 4D Generation- 
			Paper
			 •- 
			2504.08366
			 •
			Published
				
			•- 
				10
			 
 - InteractVLM: 3D Interaction Reasoning from 2D Foundational Models- 
			Paper
			 •- 
			2504.05303
			 •
			Published
				
			•- 
				5
			 
 - 3D CoCa: Contrastive Learners are 3D Captioners- 
			Paper
			 •- 
			2504.09518
			 •
			Published
				
			•- 
				5
			 
 - MCP Safety Audit: LLMs with the Model Context Protocol Allow Major
  Security Exploits- 
			Paper
			 •- 
			2504.03767
			 •
			Published
				
			•- 
				3
			 
 - Diffusion Distillation With Direct Preference Optimization For Efficient
  3D LiDAR Scene Completion- 
			Paper
			 •- 
			2504.11447
			 •
			Published
				
			•- 
				4
			 
 - Vivid4D: Improving 4D Reconstruction from Monocular Video by Video
  Inpainting- 
			Paper
			 •- 
			2504.11092
			 •
			Published
				
			•- 
				9
			 
 - BlockGaussian: Efficient Large-Scale Scene Novel View Synthesis via
  Adaptive Block-Based Gaussian Splatting- 
			Paper
			 •- 
			2504.09048
			 •
			Published
				
			•- 
				7
			 
 - HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation- 
			Paper
			 •- 
			2504.13072
			 •
			Published
				
			•- 
				13
			 
 - StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on
  3D Gaussians- 
			Paper
			 •- 
			2504.15281
			 •
			Published
				
			•- 
				23
			 
 - DiMeR: Disentangled Mesh Reconstruction Model- 
			Paper
			 •- 
			2504.17670
			 •
			Published
				
			•- 
				24
			 
 - HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene
  Generation- 
			Paper
			 •- 
			2504.21650
			 •
			Published
				
			•- 
				16
			 
 - Scenethesis: A Language and Vision Agentic Framework for 3D Scene
  Generation- 
			Paper
			 •- 
			2505.02836
			 •
			Published
				
			•- 
				8
			 
 - PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with
  Auto-Regressive Transformer- 
			Paper
			 •- 
			2505.04622
			 •
			Published
				
			•- 
				27
			 
 - 3D Scene Generation: A Survey- 
			Paper
			 •- 
			2505.05474
			 •
			Published
				
			•- 
				21
			 
 - PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes- 
			Paper
			 •- 
			2505.05288
			 •
			Published
				
			•- 
				14
			 
 - Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured
  3D Assets- 
			Paper
			 •- 
			2505.07747
			 •
			Published
				
			•- 
				61
			 
 - Constructing a 3D Town from a Single Image- 
			Paper
			 •- 
			2505.15765
			 •
			Published
				
			•- 
				24
			 
 - Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse
  Attention- 
			Paper
			 •- 
			2505.17412
			 •
			Published
				
			•- 
				21
			 
 - Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and
  Styles- 
			Paper
			 •- 
			2505.21060
			 •
			Published
				
			•- 
				4
			 
 - UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes- 
			Paper
			 •- 
			2505.23253
			 •
			Published
				
			•- 
				4
			 
 - CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian
  Splatting- 
			Paper
			 •- 
			2505.22854
			 •
			Published
				
			•- 
				4
			 
 - ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and
  Understanding- 
			Paper
			 •- 
			2506.01853
			 •
			Published
				
			•- 
				32
			 
 - Pro3D-Editor : A Progressive-Views Perspective for Consistent and
  Precise 3D Editing- 
			Paper
			 •- 
			2506.00512
			 •
			Published
				
			•- 
				5
			 
 - FlexPainter: Flexible and Multi-View Consistent Texture Generation- 
			Paper
			 •- 
			2506.02620
			 •
			Published
				
			•- 
				14
			 
 - Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting- 
			Paper
			 •- 
			2506.05327
			 •
			Published
				
			•- 
				11
			 
 - Aligning Text, Images, and 3D Structure Token-by-Token- 
			Paper
			 •- 
			2506.08002
			 •
			Published
				
			•- 
				21
			 
 - EmbodiedGen: Towards a Generative 3D World Engine for Embodied
  Intelligence- 
			Paper
			 •- 
			2506.10600
			 •
			Published
				
			•- 
				8
			 
 - StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated
  Video Streams- 
			Paper
			 •- 
			2506.08862
			 •
			Published
				
			•- 
				5
			 
 - Test3R: Learning to Reconstruct 3D at Test Time- 
			Paper
			 •- 
			2506.13750
			 •
			Published
				
			•- 
				27
			 
 - Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with
  Hybrid History Condition- 
			Paper
			 •- 
			2506.17201
			 •
			Published
				
			•- 
				56
			 
 - DreamCube: 3D Panorama Generation via Multi-plane Synchronization- 
			Paper
			 •- 
			2506.17206
			 •
			Published
				
			•- 
				23
			 
 - Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate
  Details- 
			Paper
			 •- 
			2506.16504
			 •
			Published
				
			•- 
				26
			 
 - Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with
  Production-Ready PBR Material- 
			Paper
			 •- 
			2506.15442
			 •
			Published
				
			•- 
				12
			 
 - 3D Arena: An Open Platform for Generative 3D Evaluation- 
			Paper
			 •- 
			2506.18787
			 •
			Published
				
			•- 
				13
			 
 - AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion
  Models- 
			Paper
			 •- 
			2506.19851
			 •
			Published
				
			•- 
				60
			 
 - PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for
  Realistic Articulated Object Modeling- 
			Paper
			 •- 
			2506.20936
			 •
			Published
				
			•- 
				12
			 
 - BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing- 
			Paper
			 •- 
			2506.17450
			 •
			Published
				
			•- 
				63
			 
 - LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with
  TriMap Video Diffusion- 
			Paper
			 •- 
			2507.02813
			 •
			Published
				
			•- 
				60
			 
 - SeqTex: Generate Mesh Textures in Video Sequence- 
			Paper
			 •- 
			2507.04285
			 •
			Published
				
			•- 
				9
			 
 - LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+
  FPS- 
			Paper
			 •- 
			2507.07136
			 •
			Published
				
			•- 
				38
			 
 - From One to More: Contextual Part Latents for 3D Generation- 
			Paper
			 •- 
			2507.08772
			 •
			Published
				
			•- 
				25
			 
 - Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos
  with Spatio-Temporal Diffusion Models- 
			Paper
			 •- 
			2507.13344
			 •
			Published
				
			•- 
				56
			 
 - Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with
  Regularized Score Distillation Sampling- 
			Paper
			 •- 
			2507.11061
			 •
			Published
				
			•- 
				37
			 
 - Gaussian Splatting with Discretized SDF for Relightable Assets- 
			Paper
			 •- 
			2507.15629
			 •
			Published
				
			•- 
				23
			 
 - Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention- 
			Paper
			 •- 
			2507.17745
			 •
			Published
				
			•- 
				34
			 
 - Elevating 3D Models: High-Quality Texture and Geometry Refinement from a
  Low-Quality Model- 
			Paper
			 •- 
			2507.11465
			 •
			Published
				
			•- 
				17
			 
 - HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D
  Worlds from Words or Pixels- 
			Paper
			 •- 
			2507.21809
			 •
			Published
				
			•- 
				131
			 
 - BANG: Dividing 3D Assets via Generative Exploded Dynamics- 
			Paper
			 •- 
			2507.21493
			 •
			Published
				
			•- 
				64
			 
 - 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding- 
			Paper
			 •- 
			2507.23478
			 •
			Published
				
			•- 
				15
			 
 - Dens3R: A Foundation Model for 3D Geometry Prediction- 
			Paper
			 •- 
			2507.16290
			 •
			Published
				
			•- 
				8
			 
 - Gaussian Variation Field Diffusion for High-fidelity Video-to-4D
  Synthesis- 
			Paper
			 •- 
			2507.23785
			 •
			Published
				
			•- 
				18
			 
 - DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior- 
			Paper
			 •- 
			2508.00599
			 •
			Published
				
			•- 
				7
			 
 - Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D
  Generation- 
			Paper
			 •- 
			2508.00428
			 •
			Published
				
			•- 
				3
			 
 - MeshLLM: Empowering Large Language Models to Progressively Understand
  and Generate 3D Mesh- 
			Paper
			 •- 
			2508.01242
			 •
			Published
				
			•- 
				10
			 
 - Matrix-3D: Omnidirectional Explorable 3D World Generation- 
			Paper
			 •- 
			2508.08086
			 •
			Published
				
			•- 
				75
			 
 - VertexRegen: Mesh Generation with Continuous Level of Detail- 
			Paper
			 •- 
			2508.09062
			 •
			Published
				
			•- 
				37
			 
 - StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image
  Translation- 
			Paper
			 •- 
			2508.11203
			 •
			Published
				
			•- 
				10
			 
 - TexVerse: A Universe of 3D Objects with High-Resolution Textures- 
			Paper
			 •- 
			2508.10868
			 •
			Published
				
			•- 
				17
			 
 - 4DNeX: Feed-Forward 4D Generative Modeling Made Easy- 
			Paper
			 •- 
			2508.13154
			 •
			Published
				
			•- 
				62
			 
 - SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass- 
			Paper
			 •- 
			2508.15769
			 •
			Published
				
			•- 
				19
			 
 - MV-RAG: Retrieval Augmented Multiview Diffusion- 
			Paper
			 •- 
			2508.16577
			 •
			Published
				
			•- 
				38
			 
 - VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D
  Space- 
			Paper
			 •- 
			2508.19247
			 •
			Published
				
			•- 
				41
			 
 - Pixie: Fast and Generalizable Supervised Learning of 3D Physics from
  Pixels- 
			Paper
			 •- 
			2508.17437
			 •
			Published
				
			•- 
				36
			 
 - FastMesh:Efficient Artistic Mesh Generation via Component Decoupling- 
			Paper
			 •- 
			2508.19188
			 •
			Published
				
			•- 
				16
			 
 - ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion
  Models- 
			Paper
			 •- 
			2508.18271
			 •
			Published
				
			•- 
				8
			 
 - Collaborative Multi-Modal Coding for High-Quality 3D Generation- 
			Paper
			 •- 
			2508.15228
			 •
			Published
				
			•- 
				4
			 
 - P3-SAM: Native 3D Part Segmentation- 
			Paper
			 •- 
			2509.06784
			 •
			Published
				
			•- 
				22
			 
 - X-Part: high fidelity and structure coherent shape decomposition- 
			Paper
			 •- 
			2509.08643
			 •
			Published
				
			•- 
				26
			 
 - 3D Aware Region Prompted Vision Language Model- 
			Paper
			 •- 
			2509.13317
			 •
			Published
				
			•- 
				13
			 
 - SPATIALGEN: Layout-guided 3D Indoor Scene Generation- 
			Paper
			 •- 
			2509.14981
			 •
			Published
				
			•- 
				26
			 
 - Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model
  Self-Distillation- 
			Paper
			 •- 
			2509.19296
			 •
			Published
				
			•- 
				22
			 
 - GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface
  Reconstruction- 
			Paper
			 •- 
			2509.18090
			 •
			Published
				
			•- 
				2
			 
 - NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks- 
			Paper
			 •- 
			2510.15019
			 •
			Published
				
			•- 
				55
			 
 - FlashWorld: High-quality 3D Scene Generation within Seconds- 
			Paper
			 •- 
			2510.13678
			 •
			Published
				
			•- 
				69