Zhukov's picture

Zhukov

Geximus

·

AI & ML interests

None yet

Recent Activity

new activity 9 days ago

MiniMaxAI/MiniMax-M2.7:Prevent whitespace leakage in beginning of prompt

upvoted an article 19 days ago

Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs

new activity 26 days ago

demon-zombie/MiniMax-M2.7-AWQ-4bit:These are NOT actual AWQ-quantized models.

View all activity

Organizations

None yet

New activity in MiniMaxAI/MiniMax-M2.7 9 days ago

Prevent whitespace leakage in beginning of prompt

#22 opened 25 days ago by

upvoted an article 19 days ago

Article

Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs

lujangusface

•

Apr 3

• 8

New activity in demon-zombie/MiniMax-M2.7-AWQ-4bit 26 days ago

These are NOT actual AWQ-quantized models.

#1 opened 27 days ago by

New activity in MiniMaxAI/MiniMax-M2.7 27 days ago

MiniMax-M2.7 is highly verbose and slow

#18 opened 28 days ago by

New activity in cyankiwi/MiniMax-M2.7-AWQ-4bit 27 days ago

thanks for 4bit awq!

#1 opened 29 days ago by

upvoted an article 28 days ago

Article

2x Faster on a 229B MoE: EAGLE3 Speculative Decoding for MiniMax-M2.5

lujangusface

•

Apr 9

• 3

liked a model 29 days ago

cyankiwi/MiniMax-M2.7-AWQ-4bit

Text Generation • 37B • Updated 29 days ago • 251k • 30

New activity in cyankiwi/MiniMax-M2.5-AWQ-4bit 29 days ago

Is minimax 2.7 on the way?

#3 opened 29 days ago by

New activity in MiniMaxAI/MiniMax-M2.5 about 1 month ago

Minimax 2.7???

#53 opened about 2 months ago by

New activity in togethercomputer/Aurora-Spec-Minimax-M2.5 about 1 month ago

Perfomance question

#4 opened about 1 month ago by

liked a model 2 months ago

cyankiwi/Qwen3.5-122B-A10B-AWQ-8bit

Image-Text-to-Text • 39B • Updated Mar 26 • 4.74k • 4

liked a model 3 months ago

cyankiwi/Qwen3-Coder-Next-AWQ-4bit

Text Generation • 14B • Updated Mar 26 • 96.9k • 28

New activity in Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice 3 months ago

Low generation speed and low GPU utilization (~12%) during inference

#18 opened 4 months ago by

liked 2 models 4 months ago

cyankiwi/Qwen3-30B-A3B-Instruct-2507-AWQ-4bit

Text Generation • 5B • Updated 5 days ago • 55k • 31

ai-sage/GigaAM-v3

Automatic Speech Recognition • Updated Nov 19, 2025 • 105k • 98

New activity in black-forest-labs/FLUX.2-dev 5 months ago

Why is Flux 2 so slow in Img2Img even though everything is in CUDA?

#22 opened 5 months ago by

New activity in cyankiwi/Qwen3-Next-80B-A3B-Instruct-AWQ-4bit 5 months ago

Perfomance of this model is one of the best

#13 opened 5 months ago by

liked a Space 5 months ago

Qwen TTS Clone Demo

Create a custom voice clone and synthesize speech

New activity in cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-4bit 6 months ago

why recently re-uploaded the core?

#7 opened 6 months ago by

liked a model 6 months ago

cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-8bit

Text Generation • 84B • Updated 5 days ago • 53 • 5