Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
JunxiongWang 's Collections
M1
MambaInLlama_MATH_Reasoning
MambaInLlama-dpo
MambaInLlama-distill
Mamba2InLlama3.2-3B
Mamba-In-Zephyr
Mamba-In-Llama3
Mamba2-In-Llama3
MambaByte

M1

updated Jul 3

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models https://arxiv.org/abs/2504.10449

Upvote
-

  • togethercomputer/M1-3B

    Text Generation • 3B • Updated 6 days ago • 1.19k • 4

  • JunxiongWang/M1-3B

    Text Generation • 3B • Updated Apr 16 • 1.14k • 1

  • JunxiongWang/M1-3B-SFT

    Text Generation • 3B • Updated Apr 16 • 6 • 1

  • JunxiongWang/R1_Sythetic_SFT

    Viewer • Updated Apr 16 • 1M • 47

  • JunxiongWang/R1_GR_SFT

    Viewer • Updated Apr 16 • 44k • 23

  • JunxiongWang/R1_SFT

    Updated Apr 16 • 52

  • JunxiongWang/R1_OpenThoughts_SFT

    Viewer • Updated Apr 7 • 862k • 17

  • JunxiongWang/R1_am_SFT

    Viewer • Updated Apr 1 • 1.4M • 14

  • JunxiongWang/MATH_SFT

    Viewer • Updated Apr 15 • 19.1M • 70
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs