4 1 6

ZizhengZhan

Anditty

AI & ML interests

None yet

Recent Activity

reacted to AdinaY's post with 👍 2 days ago

KAT-V1 🔥 a LLM that tackles overthinking by switching between reasoning and direct answers, by Kuaishou. https://huggingface.co/Kwaipilot/KAT-V1-40B ✨ 40B ✨ Step-SRPO: smarter reasoning control via RL ✨ MTP + Distillation: efficient training, lower cost

upvoted a paper about 2 months ago

KAT-V1: Kwai-AutoThink Technical Report

liked a model 3 months ago

Kwaipilot/KwaiCoder-AutoThink-preview

View all activity

Organizations

reacted to AdinaY's post with 👍 2 days ago

Post

2687

KAT-V1 🔥 a LLM that tackles overthinking by switching between reasoning and direct answers, by Kuaishou.

Kwaipilot/KAT-V1-40B

✨ 40B
✨ Step-SRPO: smarter reasoning control via RL
✨ MTP + Distillation: efficient training, lower cost

upvoted a paper about 2 months ago

KAT-V1: Kwai-AutoThink Technical Report

Paper • 2507.08297 • Published Jul 11 • 6

liked a model 3 months ago

Kwaipilot/KwaiCoder-AutoThink-preview

41B • Updated Jun 10 • 34 • 49

published a model 3 months ago

Kwaipilot/KwaiCoder-AutoThink-preview

41B • Updated Jun 10 • 34 • 49

updated a model 3 months ago

Kwaipilot/KwaiCoder-AutoThink-preview

41B • Updated Jun 10 • 34 • 49

liked 2 models 6 months ago

Kwaipilot/OASIS-code-embedding-1.5B

Kwaipilot/KwaiCoder-23B-A4B-v1

Text Generation • 23B • Updated Jan 24 • 5 • 14

updated 2 models 6 months ago

Kwaipilot/OASIS-code-embedding-1.5B

Kwaipilot/OASIS-code-1.3B

liked 2 models 6 months ago

Kwaipilot/OASIS-code-1.3B

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 521k • • 12.7k

New activity in Kwaipilot/KwaiCoder-23B-A4B-v1 7 months ago

prompt format in code completion

#1 opened 7 months ago by

LeiLeier

liked a model over 1 year ago

Qwen/Qwen-72B-Chat-Int4

Text Generation • 12B • Updated Jan 4, 2024 • 93 • 46

New activity in codellama/codellama-playground almost 2 years ago

Should add <bos_id> token with infilling

#11 opened almost 2 years ago by

Anditty

New activity in bigcode/starcoder about 2 years ago

starcoder uses Megatron-LM?

#27 opened over 2 years ago by

senxiangms

ZizhengZhan

AI & ML interests

Recent Activity

Organizations

Anditty's activity

prompt format in code completion

Should add <bos_id> token with infilling

starcoder uses Megatron-LM?