LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning Paper • 2502.14644 • Published Feb 20, 2025
LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning Paper • 2412.13626 • Published Dec 18, 2024
Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners Paper • 2305.14825 • Published May 24, 2023 • 1
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published Dec 31, 2025 • 154
HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention Paper • 2603.28458 • Published 6 days ago • 34
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference Paper • 2508.15881 • Published Aug 21, 2025 • 10
TransMLA: Multi-head Latent Attention Is All You Need Paper • 2502.07864 • Published Feb 11, 2025 • 69
CLOVER: Constrained Learning with Orthonormal Vectors for Eliminating Redundancy Paper • 2411.17426 • Published Nov 26, 2024
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models Paper • 2404.02948 • Published Apr 3, 2024 • 4