Spaces:

Skywork
/

SkyCaptioner-V1

Running on Zero

Apply for community grant: Academic project (gpu)

by pinoo - opened May 7

Skywork org May 7

SkyCaptioner-V1 is a structural video captioning model designed to generate high-quality, structural descriptions for video data. It integrates specialized sub-expert models and multimodal large language models (MLLMs) with human annotations to address the limitations of general captioners in capturing professional film-related details.

hysts

May 7

Hi @pinoo , we've assigned ZeroGPU to this Space. Please check the compatibility and usage sections of this page so your Space can run on ZeroGPU.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment