Running
2
Step-3
🤖
Interact with a multimodal AI assistant using text and images
None defined yet.
Interact with a multimodal AI assistant using text and images
Modify images based on text prompts
image2mesh
A demo of StepFun step-3 reasoning model with visible chain-
Edit an image based on the given instruction.
Generate audio responses from text or audio
Extract text from images using various OCR modes