VAST-AI
/

TripoSG

TripoSGPipeline

Model card Files Files and versions Community

TripoSG / README.md

pookiefoof's picture

Initial release

2c1c516 4 months ago

|

history blame contribute delete

1.47 kB

	---
	license: mit
	pipeline_tag: image-to-3d
	tags:
	- triposg
	- 3d-generation
	- rectified-flow
	---
	# TripoSG - High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

	TripoSG is a state-of-the-art image-to-3D generation foundation model that leverages large-scale rectified flow transformers to produce high-fidelity 3D shapes from single images.

	## Model Description

	### Model Architecture

	TripoSG utilizes a novel architecture combining:
	- Rectified Flow (RF) based Transformer for stable, linear trajectory modeling
	- Advanced VAE with SDF-based representation and hybrid geometric supervision
	- Cross-attention mechanism for image feature condition
	- 1.5B parameters operating on 2048 latent tokens

	## Intended Uses

	This model is designed for:
	- Converting single images to high-quality 3D meshes
	- Creative and design applications
	- Gaming and VFX asset creation
	- Prototyping and visualization

	## Requirements

	- CUDA-capable GPU (>8GB VRAM)

	## Usage

	For detailed usage instructions, please visit our [GitHub repository](https://github.com/VAST-AI-Research/TripoSG).

	## About

	TripoSG is developed by [Tripo](https://www.tripo3d.ai), [VAST AI Research](https://github.com/orgs/VAST-AI-Research), pushing the boundaries of 3D Generative AI.
	For more information:
	- [GitHub Repository](https://github.com/VAST-AI-Research/TripoSG)
	- [Paper](https://arxiv.org/abs/2502.06608)
	- [Gradio Demo](https://huggingface.co/spaces/VAST-AI/TripoSG)