Multimodal - MLX Collection Language Models that takes vision input and/or audio input, hand picked by Nexa Team. • 9 items • Updated 10 days ago • 3
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 549