Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LCM-Lab
/
moba_llama

Text Generation
Transformers
PyTorch
English
llama
conversational
text-generation-inference
Model card Files Files and versions
xet
Community

Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers

Downloads last month
3
Inference Providers NEW
Text Generation
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including LCM-Lab/moba_llama

Elastic-Attention

Collection
Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers • 17 items • Updated 1 day ago • 2

Paper for LCM-Lab/moba_llama

Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers

Paper • 2601.17367 • Published 5 days ago • 32
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs