Spaces:
Running
on
Zero
Running
on
Zero
A newer version of the Gradio SDK is available:
5.45.0
IC-Custom Application
A sophisticated image customization tool powered by advanced AI models.
πΊ App Guide:
For a fast overview of how to use the app, watch this video:
IC-Custom App Usage Guide (YouTube)
π Quick Start
python src/app/app.py \
--config configs/app/app.yaml \
--hf_token $HF_TOKEN \
--hf_cache_dir $HF_CACHE_DIR \
--assets_cache_dir results/app \
--enable_ben2_for_mask_ref False \
--enable_vlm_for_prompt False \
--save_results True
βοΈ Configuration & CLI Arguments
Argument | Type | Required | Default | Description |
---|---|---|---|---|
--config |
str | β | - | Path to app YAML config file |
--hf_token |
str | β | - | Hugging Face access token. |
--hf_cache_dir |
str | β | ~/.cache/huggingface/hub |
HF assets cache directory |
--assets_cache_dir |
str | β | results/app |
Output images & metadata directory |
--save_results |
bool | β | False |
Save generated results |
--enable_ben2_for_mask_ref |
bool | β | False |
Enable BEN2 background removal |
--enable_vlm_for_prompt |
bool | β | False |
Enable VLM prompt generation |
Environment Variables
HF_TOKEN
β--hf_token
HF_HUB_CACHE
β--hf_cache_dir
π₯ Model Downloads
Model checkpoints are required before running the app.
All required models will be automatically downloaded when you run the app, or you can manually download them and specify paths inconfigs/app/app.yaml
.
Required Models
The following models are automatically downloaded when running the app:
Model | Purpose | Source |
---|---|---|
IC-Custom | Our customization model | TencentARC/IC-Custom |
CLIP | Vision-language understanding | openai/clip-vit-large-patch14 |
T5 | Text processing | google/t5-v1_1-xxl |
SigLIP | Image understanding | google/siglip-so400m-patch14-384 |
Autoencoder | Image encoding/decoding | black-forest-labs/FLUX.1-Fill-dev |
DIT | Diffusion model | black-forest-labs/FLUX.1-Fill-dev |
Redux | Image processing | black-forest-labs/FLUX.1-Redux-dev |
SAM-vit-h | Image segmentation | HCMUE-Research/SAM-vit-h |
Optional Models (Selective Download)
BEN2 and Qwen2.5-VL models are disabled by default and only downloaded when explicitly enabled:
Model | Flag | Source | Purpose |
---|---|---|---|
BEN2 | --enable_ben2_for_mask_ref True |
PramaLLC/BEN2 | Background removal |
Qwen2.5-VL | --enable_vlm_for_prompt True |
Qwen/Qwen2.5-VL-7B-Instruct | Prompt generation |
Manual Configuration
Alternative: Manually download all models and specify paths in configs/app/app.yaml
:
checkpoint_config:
# Required models
dit_path: "/path/to/flux1-fill-dev.safetensors"
ae_path: "/path/to/ae.safetensors"
t5_path: "/path/to/t5-v1_1-xxl"
clip_path: "/path/to/clip-vit-large-patch14"
siglip_path: "/path/to/siglip-so400m-patch14-384"
redux_path: "/path/to/flux1-redux-dev.safetensors"
# IC-Custom models
lora_path: "/path/to/dit_lora_0x1561.safetensors"
img_txt_in_path: "/path/to/dit_txt_img_in_0x1561.safetensors"
boundary_embeddings_path: "/path/to/dit_boundary_embeddings_0x1561.safetensors"
task_register_embeddings_path: "/path/to/dit_task_register_embeddings_0x1561.safetensors"
# APP interactive models
sam_path: "/path/to/sam_vit_h_4b8939.pth"
# Optional models
ben2_path: "/path/to/BEN2_Base.pth"
vlm_path: "/path/to/Qwen2.5-VL-7B-Instruct"
APP Overview