mengfanxu's picture

mengfanxu

fxmeng

·

https://fxmeng.github.io

fxmeng

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention

authored a paper 4 days ago

LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning

authored a paper 4 days ago

LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning

View all activity

Organizations

None yet

authored 5 papers 4 days ago

LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning

Paper • 2502.14644 • Published Feb 20, 2025

LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning

Paper • 2412.13626 • Published Dec 18, 2024

Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners

Paper • 2305.14825 • Published May 24, 2023 • 1

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published Dec 31, 2025 • 154

HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention

Paper • 2603.28458 • Published 6 days ago • 34

authored a paper 7 months ago

TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference

Paper • 2508.15881 • Published Aug 21, 2025 • 10

authored 2 papers about 1 year ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11, 2025 • 69

CLOVER: Constrained Learning with Orthonormal Vectors for Eliminating Redundancy

Paper • 2411.17426 • Published Nov 26, 2024

authored a paper almost 2 years ago

PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models

Paper • 2404.02948 • Published Apr 3, 2024 • 4