Fukang Wen's picture

1 4

Fukang Wen

smallkang2025

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

liked a Space 24 days ago

huggingface/ai-deadlines

liked a Space 3 months ago

HuggingFaceFW/blogpost-fineweb-v1

View all activity

Organizations

None yet

upvoted a paper 8 days ago

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Paper • 2601.07832 • Published 8 days ago • 46

liked a Space 24 days ago

AI Deadlines

Discover and manage important project deadlines and milestones

liked 3 Spaces 3 months ago

FineWeb: decanting the web for the finest text data at scale

Generate high-quality text data for LLMs using FineWeb

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

The Smol Training Playbook

The secrets to building world-class LLMs