facebook/wav2vec2-base-10k-voxpopuli-ft-it
Automatic Speech Recognition
•
Updated
•
18
None defined yet.
ActionMesh: Animated 3D Mesh Generation with Temporal 3D Diffusion
ShapeR: Robust Conditional 3D Shape Generation from Casual Captures