InstantX

community

https://huggingface.co/InstantX

instantx_ai

instantX-research

AI & ML interests

We open source generative models

Recent Activity

zhen-nan authored a paper 3 days ago

DiP: Taming Diffusion Models in Pixel Space

xingpng authored a paper about 2 months ago

InstantIR: Blind Image Restoration with Instant Generative Reference

xingpng authored a paper about 2 months ago

3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly

View all activity

zhen-nan

authored a paper 3 days ago

DiP: Taming Diffusion Models in Pixel Space

Paper • 2511.18822 • Published 10 days ago • 23

xingpng

authored 6 papers about 2 months ago

InstantIR: Blind Image Restoration with Instant Generative Reference

Paper • 2410.06551 • Published Oct 9, 2024 • 6

3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly

Paper • 2502.05761 • Published Feb 9 • 7

Dynamic Pyramid Network for Efficient Multimodal Large Language Model

Paper • 2503.20322 • Published Mar 26 • 1

OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation

Paper • 2506.07977 • Published Jun 9 • 41

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14 • 143

WithAnyone: Towards Controllable and ID Consistent Image Generation

Paper • 2510.14975 • Published Oct 16 • 83

multimodalart

posted an update about 2 months ago

Post

5721

Want to iterate on a Hugging Face Space with an LLM?

Now you can easily convert any HF entire repo (Model, Dataset or Space) to a text file and feed it to a language model!

multimodalart/repo2txt

wanghaofan

updated a model 3 months ago

InstantX/Qwen-Image-ControlNet-Inpainting

Image-to-Image • Updated Sep 12 • 3.9k • 81

wanghaofan

published a model 3 months ago

InstantX/Qwen-Image-ControlNet-Inpainting

Image-to-Image • Updated Sep 12 • 3.9k • 81

wanghaofan

updated a Space 3 months ago

README

wanghaofan

in InstantX/Qwen-Image-ControlNet-Inpainting 3 months ago

Upload 2 files

#1 opened 3 months ago by

wanghaofan

updated a Space 3 months ago

InstantID

Generate images preserving face identity

wanghaofan

updated a model 3 months ago

InstantX/Qwen-Image-ControlNet-Union

Image-to-Image • Updated Aug 26 • 14.5k • 97

wanghaofan

published a model 4 months ago

InstantX/Qwen-Image-ControlNet-Union

Image-to-Image • Updated Aug 26 • 14.5k • 97

wangqixun

authored a paper 4 months ago

InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework

Paper • 2504.12395 • Published Apr 16 • 16

wanghaofan

authored a paper 5 months ago

Calligrapher: Freestyle Text Image Customization

Paper • 2506.24123 • Published Jun 30 • 37

multimodalart

posted an update 6 months ago

Post

17855

Self-Forcing - a real-time video distilled model from Wan 2.1 by @adobe is out, and they open sourced it 🐐

I've built a live real time demo on Spaces 📹💨

multimodalart/self-forcing

6 replies

·

wanghaofan

authored 2 papers 6 months ago

GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains

Paper • 2505.18700 • Published May 24 • 4

EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering

Paper • 2505.24417 • Published May 30 • 13