SparseLLMs

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

authored a paper 3 months ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 320

authored 2 papers 8 months ago

PipeLLM: Fast and Confidential Large Language Model Services with Speculative Pipelined Encryption

Paper • 2411.03357 • Published Nov 4, 2024

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28, 2025 • 58

authored a paper 8 months ago

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28, 2025 • 58

authored 4 papers 9 months ago

ConPET: Continual Parameter-Efficient Tuning for Large Language Models

Paper • 2309.14763 • Published Sep 26, 2023 • 1

ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs

Paper • 2402.03804 • Published Feb 6, 2024 • 4

ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models

Paper • 2402.13516 • Published Feb 21, 2024 • 1

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

Paper • 2507.08771 • Published Jul 11, 2025 • 10

updated 6 models 9 months ago

SparseLLM/BlockFFN-Small

Text Generation • Updated Jul 14, 2025 • 7

SparseLLM/BlockFFN-Medium

Text Generation • Updated Jul 14, 2025 • 3

SparseLLM/BlockFFN-Large

Text Generation • Updated Jul 14, 2025 • 6

SparseLLM/BlockFFN-XLarge

Text Generation • Updated Jul 14, 2025 • 12

SparseLLM/BlockFFN-3B-SFT-EAGLE

Text Generation • Updated Jul 14, 2025 • 5

SparseLLM/BlockFFN-3B-SFT

Text Generation • Updated Jul 14, 2025 • 5 • 1

published 6 models 9 months ago

SparseLLM/BlockFFN-Small

Text Generation • Updated Jul 14, 2025 • 7

SparseLLM/BlockFFN-Medium

Text Generation • Updated Jul 14, 2025 • 3

SparseLLM/BlockFFN-Large

Text Generation • Updated Jul 14, 2025 • 6

SparseLLM/BlockFFN-XLarge

Text Generation • Updated Jul 14, 2025 • 12

SparseLLM/BlockFFN-3B-SFT-EAGLE

Text Generation • Updated Jul 14, 2025 • 5

SparseLLM/BlockFFN-3B-SFT

Text Generation • Updated Jul 14, 2025 • 5 • 1