Roblox
/

cube3d-v0.5

Model card Files Files and versions Community

cube3d-v0.5 / README.md

animan123's picture

Initial commit

8cab488 7 days ago

|

history blame contribute delete

2.64 kB

	---
	license: openrail
	---

	# Cube 3D: Foundational Model for Generative AI

	<p align="center">
	<img src="./assets/3d_composition.gif">
	</p>


	<div align="center">
	<a href=https://corp.roblox.com/newsroom/2025/03/introducing-roblox-cube target="_blank"><img src=https://img.shields.io/badge/Roblox-Blog-000000.svg?logo=Roblox height=22px style="display: inline-block; vertical-align: middle;"></a>
	<a href=https://github.com/Roblox/cube target="_blank"><img src=https://img.shields.io/badge/Github-Source-blue.svg?logo=github height=22px style="display: inline-block; vertical-align: middle;"></a>
	<a href=https://arxiv.org/abs/2503.15475 target="_blank"><img src=https://img.shields.io/badge/ArXiv-Report-b5212f.svg?logo=arxiv height=22px style="display: inline-block; vertical-align: middle;"></a>
	</div>

	Cube 3D is our first step towards 3D intelligence, which involves a shape tokenizer and a text-to-shape generation model. We are unlocking the power of generating 3D assets and enhancing creativity for all artists. Our latest version of Cube 3D is now accessible to individuals, creators, researchers and businesses of all sizes so that they can experiment, innovate and scale their ideas responsibly.

	This release includes model weights and starting code for our v0.5 model. With this update, we introduce two upgrades to the auto-regressive base model for 3D geometry generation from text: higher fidelity 3D compositions and bounding box conditioning

	The example gif above shows the model's capacity to generate 3D shapes capturing mixtures of concepts expressed in text, for example mechanical lobster with mechanical tank treads. The v0.5 model also shows significantly better text adherence, for example the lowpoly paper craft victorian rabbit.

	With bounding box conditioning, we observe novel 3D generations where the model balances between the two conditioning inputs -- text prompt and global aspect ratio. In the gif below, notice how the model creatively interprets the the seashell or tall pagoda prompts into distinct 3D shapes. The model sometimes struggles when the bounding box is too extreme for a given prompt, for example the cat, where it can produce disconnected components or generates it along a diagonal to fit the bounding box constraints.
	<p align="center">
	<img src="./assets/bbox_conditioning.gif" width="600" style="margin: 5px;">
	</p>

	For code and details on how to use the models, please refer to our [Github repository](https://github.com/Roblox/cube).

	For technical details, please see our [tech report](https://arxiv.org/abs/2503.15475)