-
facebook/vjepa2-vitl-fpc64-256
Video Classification β’ 0.3B β’ Updated β’ 75.2k β’ 184 -
microsoft/xclip-base-patch32
Video Classification β’ 0.2B β’ Updated β’ 145k β’ 109 -
MCG-NJU/videomae-base
Video Classification β’ 94.2M β’ Updated β’ 71.4k β’ 50 -
OpenGVLab/VideoMAEv2-Base
Video Classification β’ 86.2M β’ Updated β’ 15.8k β’ 12
Alban NYANTUDRE
AI & ML interests
Recent Activity
Organizations
-
anyantudre/MooreSpeechCorpora
Viewer β’ Updated β’ 5.54k β’ 7 β’ 3 - Running4
Moore Language Space
π4Demo Space for MoorΓ© language TTS, ASR and translation
-
anyantudre/moore-speech-contes
Viewer β’ Updated β’ 5.96k β’ 7 β’ 1 - Running1
Moore translation Leaderboard
π1Text2text Machine Translation for Moore language
- Running on CPU Upgrade1k
Open VLM Leaderboard
π1kVLMEvalKit Evaluation Results Collection
- Running on ZeroFeatured421
moondream1
π421Generate coherent text continuations from prompts
- Runtime error20
Ovis2 1B
π¦«20Small model can do big things.
- Running on Zero4
VQA Autonomous Driving SmolVLM2
π4Visual Question Answering - Autonomous Driving - SmolVLM2
- Running on L40S555
MinerU OCR
π555A data extraction tool to convert PDF to Markdown and JSON
- Running on ZeroFeatured448
DeepSeek OCR 2 Demo
π448Try out DeepSeek-OCR-2 on your PDFs or images
- Running on ZeroFeatured269
granite-docling-258M demo
π269Extract and convert document content from images
- Running on Zero38
Multimodal RAG with Granite Vision
π38RAG example using Granite [vision, embedding, instruct]
-
OpenGVLab/InternVL3-1B
Image-Text-to-Text β’ 0.9B β’ Updated β’ 101k β’ 79 -
vikhyatk/moondream2
Image-Text-to-Text β’ 2B β’ Updated β’ 5.16M β’ 1.39k -
microsoft/Florence-2-base
Image-Text-to-Text β’ 0.2B β’ Updated β’ 738k β’ 354 -
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text β’ 0.3B β’ Updated β’ 154k β’ 98
-
facebook/vjepa2-vitl-fpc64-256
Video Classification β’ 0.3B β’ Updated β’ 75.2k β’ 184 -
microsoft/xclip-base-patch32
Video Classification β’ 0.2B β’ Updated β’ 145k β’ 109 -
MCG-NJU/videomae-base
Video Classification β’ 94.2M β’ Updated β’ 71.4k β’ 50 -
OpenGVLab/VideoMAEv2-Base
Video Classification β’ 86.2M β’ Updated β’ 15.8k β’ 12
- Running on L40S555
MinerU OCR
π555A data extraction tool to convert PDF to Markdown and JSON
- Running on ZeroFeatured448
DeepSeek OCR 2 Demo
π448Try out DeepSeek-OCR-2 on your PDFs or images
- Running on ZeroFeatured269
granite-docling-258M demo
π269Extract and convert document content from images
- Running on Zero38
Multimodal RAG with Granite Vision
π38RAG example using Granite [vision, embedding, instruct]
-
anyantudre/MooreSpeechCorpora
Viewer β’ Updated β’ 5.54k β’ 7 β’ 3 - Running4
Moore Language Space
π4Demo Space for MoorΓ© language TTS, ASR and translation
-
anyantudre/moore-speech-contes
Viewer β’ Updated β’ 5.96k β’ 7 β’ 1 - Running1
Moore translation Leaderboard
π1Text2text Machine Translation for Moore language
-
OpenGVLab/InternVL3-1B
Image-Text-to-Text β’ 0.9B β’ Updated β’ 101k β’ 79 -
vikhyatk/moondream2
Image-Text-to-Text β’ 2B β’ Updated β’ 5.16M β’ 1.39k -
microsoft/Florence-2-base
Image-Text-to-Text β’ 0.2B β’ Updated β’ 738k β’ 354 -
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text β’ 0.3B β’ Updated β’ 154k β’ 98
- Running on CPU Upgrade1k
Open VLM Leaderboard
π1kVLMEvalKit Evaluation Results Collection
- Running on ZeroFeatured421
moondream1
π421Generate coherent text continuations from prompts
- Runtime error20
Ovis2 1B
π¦«20Small model can do big things.
- Running on Zero4
VQA Autonomous Driving SmolVLM2
π4Visual Question Answering - Autonomous Driving - SmolVLM2