Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lasgroup 's Collections
Test-Time Curricula for Targeted RL
Test-Time Model Merging (TTMM)
Test-Time Curricula for Targeted RL (Qwen3-8B)
Test-Time Curricula for Targeted RL (Qwen3-4B-Instruct-2507)
Test-Time Curricula for Targeted RL (Qwen3-8B-Base)
Test-Time Curricula for Targeted RL (AIME25)

Test-Time Model Merging (TTMM)

updated 27 days ago

Collection of fine-tuned models and expert adapters from "Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging" (COLM 25)

Upvote
1

  • Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging

    Paper • 2505.14136 • Published May 20

  • rbertolissi/Llama-3.2-1B-Wikipedia

    Updated Jul 28

  • rbertolissi/Llama-3.2-1B-TTMM-Wikipedia

    Updated Jul 29

  • rbertolissi/Qwen2.5-1.5B-Wikipedia

    Updated Jul 28

  • rbertolissi/Qwen2.5-1.5B-TTMM-Wikipedia

    Updated Jul 29

  • rbertolissi/Llama-3.2-1B-GitHub-Python

    Updated Jul 28

  • rbertolissi/Qwen2.5-1.5B-TTMM-GitHub-Python

    Updated Jul 29

  • rbertolissi/Qwen2.5-1.5B-GitHub-Python

    Updated Jul 28

  • rbertolissi/Llama-3.2-1B-TTMM-GitHub-Python

    Updated Jul 29

  • rbertolissi/Llama-3.2-1B-MMLU

    Updated Jul 29

  • rbertolissi/Llama-3.2-1B-TTMM-MMLU

    Updated Jul 29
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs