0xLDF
/

Seg2Any

Model card Files Files and versions

Seg2Any / README.md

0xLDF's picture

Update README.md

444dde1 verified 25 days ago

|

history blame contribute delete

1.29 kB

	---
	license: apache-2.0
	---

	<h1 align="center">Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control</h1>


	<div align="center">
	<!-- <a href='LICENSE'><img src='https://img.shields.io/badge/license-MIT-yellow'></a> -->
	<a href='https://seg2any.github.io'><img src='https://img.shields.io/badge/Project-Page-Green'></a>
	<a href='https://arxiv.org/abs/2506.00596'><img src='https://img.shields.io/badge/Paper-Arxiv-red'></a>
	<a href='https://github.com/0xLDF/Seg2Any'><img src='https://img.shields.io/badge/⭐_GitHub-Code-blue' alt='GitHub'></a>
	<a href="https://huggingface.co/datasets/0xLDF/SACap-1M"><img src="https://img.shields.io/badge/🤗_HuggingFace-Dataset-ffbd45.svg" alt="HuggingFace"></a>
	<a href="https://huggingface.co/datasets/0xLDF/SACap-eval"><img src="https://img.shields.io/badge/🤗_HuggingFace-Benchmark-ffbd45.svg" alt="HuggingFace"></a>

	</div>

	We release model weights trained on three distinct datasets: ADE20K, COCO-Stuff, and SACap-1M. The SACap-1M version is the most popular, offering fine-grained regional text prompts.

	For detailed usage instructions, please refer to the [GitHub](https://github.com/0xLDF/Seg2Any).

	<br>

	<p align="center">
	<img src="assets/demo.png" width="90%" height="90%">
	</p>