Doge Face

community

https://huggingface.co/SmallDoge

SmallDoges

Activity Feed Request to join this org

AI & ML interests

A Family of Dynamic UltraFast Small Language Models Ready for Embodied Artificial General Intelligence!

Recent Activity

JingzeShi new activity 1 day ago

SmallDoge/Doge-60M:ImportError: cannot import name 'LossKwargs' from 'transformers.utils'

JingzeShi updated a model 1 day ago

SmallDoge/Doge-320M-Instruct-SFT

JingzeShi updated a model 1 day ago

SmallDoge/Doge-160M-Instruct

View all activity

prithivMLmods

posted an update about 15 hours ago

Post

452

I've added the demo of the openbmb/MiniCPM-V-4 model to the Hugging Face Space:
prithivMLmods/Multimodal-VLM-Thinking

✨ MiniCPM-V 4.0 is the latest efficient model in the MiniCPM-V series. The model is built based on SigLIP2-400M and MiniCPM4-3B, with a total of 4.1B parameters. It inherits the strong single-image, multi-image, and video understanding performance of MiniCPM-V 2.6 with largely improved efficiency.

✨ With only 4.1B parameters, MiniCPM-V 4.0 achieves an average score of 69.0 on OpenCompass, a comprehensive evaluation of 8 popular benchmarks. This performance surpasses GPT-4.1-mini-20250414, MiniCPM-V 2.6 (8.1B parameters, OpenCompass 65.2), and Qwen2.5-VL-3B-Instruct (3.8B parameters, OpenCompass 64.5). It also shows good performance in multi-image and video understanding.

The community GPU grant was given by Hugging Face — special thanks to them. 🤗🚀

To know more about it, visit the model card of the respective model. !!

JingzeShi

in SmallDoge/Doge-60M 1 day ago

ImportError: cannot import name 'LossKwargs' from 'transformers.utils'

#2 opened 3 months ago by

Alan109440

JingzeShi

updated 9 models 1 day ago

Evanwu50020

authored a paper 2 days ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published 5 days ago • 12

JingzeShi

updated a dataset 2 days ago

SmallDoge/Doge

Viewer • Updated 2 days ago • 8M • 166

JingzeShi

published a dataset 2 days ago

SmallDoge/Doge

Viewer • Updated 2 days ago • 8M • 166

JingzeShi

authored a paper 2 days ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published 5 days ago • 12

wubingheng

authored a paper 2 days ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published 5 days ago • 12

KingNish

posted an update 4 days ago

Post

360

Wan 2.2 fast upto 10x faster than original wan 2.2

Model: FastVideo/FastWan2.2-TI2V-5B-FullAttn-Diffusers

Space: KingNish/wan2-2-fast

prithivMLmods

posted an update 4 days ago

Post

4119

Qwen Image – The Latest Image Generation Model🔥

Below are some samples generated using the Qwen Image Diffusion Model. Qwen-Image, a 20B MMDiT model for next-generation text-to-image generation, preserves typographic details, layout coherence, and contextual harmony with stunning accuracy. It is especially strong at creating stunning graphic posters with native text. The model is now open-source. [ 𝚀𝚠𝚎𝚗-𝙸𝚖𝚊𝚐𝚎 : Qwen/Qwen-Image ]

⤷ Try the Qwen Image demo here: prithivMLmods/Qwen-Image-Diffusion

⤷ Qwen-Image Technical Report : Qwen-Image Technical Report (2508.02324)
⤷ Qwen Image [GitHub] : https://github.com/QwenLM/Qwen-Image

Even more impressively, it demonstrates a strong ability to understand images. The model supports a wide range of vision-related tasks such as object detection, semantic segmentation, depth and edge (Canny) estimation, novel view synthesis, and image super-resolution. While each task is technically distinct, they can all be viewed as advanced forms of intelligent image editing driven by deep visual understanding. Collectively, these capabilities position Qwen-Image as more than just a tool for generating appealing visuals, it serves as a versatile foundation model for intelligent visual creation and transformation, seamlessly blending language, layout, and imagery.

Qwen-Image uses a dual-stream MMDiT architecture with a frozen Qwen2.5-VL, VAE encoder, RMSNorm for QK-Norm, LayerNorm elsewhere, and a custom MSRoPE scheme for joint image-text positional encoding.

.
.
.
To know more about it, visit the model card of the respective model. !!