AI & ML interests
Large Multimodal Models
Organizations
None yet
Zhang199/TinyLLaVA-Qwen2-0.5B-SigLIP
Image-Text-to-Text
• 1B • Updated
• 506
• 7
Zhang199/EDGE-GRPO-Qwen-1.5B
Text Generation
• 2B • Updated
Zhang199/EDGE-GRPO-Qwen-7B
Text Generation
• 8B • Updated
• 1
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-16-512
Video-Text-to-Text
• 4B • Updated
• 494
• 1
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Naive-16-512
Video-Text-to-Text
• 4B • Updated
• 2
Zhang199/TinyLLaVA-Video-Phi2-Naive-16-512
Video-Text-to-Text
• 3B • Updated
• 7
Zhang199/TinyLLaVA-Qwen2.5-3B-SigLIP
Image-Text-to-Text
• 4B • Updated
• 494
Zhang199/TinyLLaVA-Video-R1
Video-Text-to-Text
• 4B • Updated
• 7
• 4
Zhang199/TinyLLaVA-Video-Coldstart_NextQA_16
Video-Text-to-Text
• 4B • Updated
• 11
• 1
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-1fps-512
Video-Text-to-Text
• 4B • Updated
• 4
Zhang199/subject_bert_mmmu
Text Classification
• 0.1B • Updated
• 1