Generate expressive speech from text with emotion control
infinite-length audio-driven avatar video generation model
Part-level image-to-3D generation.