Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
deqing 's Collections
Fourier Language Model
Convergent Evolution
Convergent Evolution (Addition)
Convergent Evolution (Architecture and Optimizer)
Convergent Evolution (Data)

Convergent Evolution (Data)

updated 3 days ago
Upvote
-

  • deqing/convergent-llama-300M-muon-original

    Text Generation • 0.3B • Updated 14 days ago • 798

  • deqing/convergent-llama-300M-muon-unigram

    Text Generation • 0.3B • Updated 14 days ago • 268

  • deqing/convergent-llama-300M-muon-isolate-1

    Text Generation • 0.3B • Updated 13 days ago • 6.7k

  • deqing/convergent-llama-300M-muon-swap_numbers

    Text Generation • 0.3B • Updated 14 days ago • 301

  • deqing/convergent-llama-300M-muon-isolate-2

    Text Generation • 0.3B • Updated 12 days ago • 1.24k

  • deqing/convergent-llama-300M-muon-isolate-8

    Text Generation • 0.3B • Updated 12 days ago • 2.19k • 1

  • deqing/convergent-llama-300M-muon-window-2

    Text Generation • 0.3B • Updated 12 days ago • 7.64k

  • deqing/convergent-llama-300M-muon-window-4

    Text Generation • 0.3B • Updated 13 days ago • 7.93k

  • deqing/convergent-llama-300M-muon-window-8

    Text Generation • 0.3B • Updated 13 days ago • 3.78k

  • deqing/convergent-llama-300M-muon-window-64

    Text Generation • 0.3B • Updated 12 days ago • 1.21k • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs