Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mengfanxu's picture
3 9 127

mengfanxu

fxmeng
Sunny500's profile picture Trangle's profile picture zuoke's profile picture
·
https://fxmeng.github.io
  • fxmeng

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago
HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention
authored a paper 3 days ago
LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning
authored a paper 3 days ago
LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning
View all activity

Organizations

None yet

commented a paper 7 months ago

TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference

Paper • 2508.15881 • Published Aug 21, 2025 • 10 •
2
commented 3 papers about 1 year ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11, 2025 • 69 •
9

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11, 2025 • 69 •
9

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11, 2025 • 69 •
9
New activity in MMMU/MMMU over 2 years ago

Question about "Text as Input"

#4 opened over 2 years ago by
fxmeng
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs