view article Article The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+ 23 days ago โข 47
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper โข 2502.11089 โข Published Feb 16, 2025 โข 167
Retentive Network: A Successor to Transformer for Large Language Models Paper โข 2307.08621 โข Published Jul 17, 2023 โข 173