Chong Ruan's picture

34

Chong Ruan

Chester111

·

AI & ML interests

AGI & LLM

Recent Activity

authored a paper 23 days ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

authored a paper 4 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

authored a paper 5 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

View all activity

Organizations

Chester111's activity

authored a paper 23 days ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published 24 days ago • 63

authored a paper 4 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 160

authored a paper 5 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 401

New activity in deepseek-ai/DeepSeek-R1 5 months ago

Update README.md

#16 opened 5 months ago by

New activity in deepseek-ai/DeepSeek-R1-Zero 5 months ago

Update README.md

#12 opened 5 months ago by

New activity in deepseek-ai/DeepSeek-R1 5 months ago

Tag Model as MIT license

#12 opened 5 months ago by

New activity in deepseek-ai/DeepSeek-R1-Zero 5 months ago

add library name & auto-tag

#10 opened 5 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-32B 5 months ago

add library tag for better code snippets and tags

#3 opened 5 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Llama-8B 5 months ago

add library tag for better code snippets and tags

#1 opened 5 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Llama-70B 5 months ago

add library tag for better code snippets and tags

#3 opened 5 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B 5 months ago

add library tag for better code snippets and tags

#1 opened 5 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-7B 5 months ago

add library tag for better code snippets and tags

#1 opened 5 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-14B 5 months ago

add library tag for better code snippets and tags

#1 opened 5 months ago by

updated a collection 5 months ago

DeepSeek-R1

10 items • Updated 9 days ago • 707