Lexo-Sort SLMs trained to perform lexograp vijay-ravichander/Qwen2.5-0.5B-Lexo-Sort-SFT-v0 Text Generation • 0.5B • Updated Jul 1, 2025 • 3 vijay-ravichander/Qwen2.5-0.5B-Lexo-Sort-SFT-v1 Text Generation • 0.5B • Updated Jul 4, 2025 • 4 vijay-ravichander/Qwen2.5-0.5B-Lexo-Sort 0.5B • Updated Jul 4, 2025 • 1 vijay-ravichander/V3-lexo-sort Viewer • Updated Jul 8, 2025 • 1k • 14
LLM FocusLLM: Scaling LLM's Context by Parallel Decoding Paper • 2408.11745 • Published Aug 21, 2024 • 25 LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21, 2024 • 58
FocusLLM: Scaling LLM's Context by Parallel Decoding Paper • 2408.11745 • Published Aug 21, 2024 • 25
LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21, 2024 • 58
ColSmol256 Distill Models vijay-ravichander/Qwen-KL-Distill 0.2B • Updated Apr 30, 2025 • 7 vijay-ravichander/Smol-Pairwise-Distill 0.2B • Updated Apr 30, 2025 • 4 vijay-ravichander/Qwen-MMSE-Distill 0.2B • Updated Apr 30, 2025 • 3 vijay-ravichander/Qwen-Pairwise-Distill 0.2B • Updated Apr 30, 2025 • 5
Lexo-Sort SLMs trained to perform lexograp vijay-ravichander/Qwen2.5-0.5B-Lexo-Sort-SFT-v0 Text Generation • 0.5B • Updated Jul 1, 2025 • 3 vijay-ravichander/Qwen2.5-0.5B-Lexo-Sort-SFT-v1 Text Generation • 0.5B • Updated Jul 4, 2025 • 4 vijay-ravichander/Qwen2.5-0.5B-Lexo-Sort 0.5B • Updated Jul 4, 2025 • 1 vijay-ravichander/V3-lexo-sort Viewer • Updated Jul 8, 2025 • 1k • 14
ColSmol256 Distill Models vijay-ravichander/Qwen-KL-Distill 0.2B • Updated Apr 30, 2025 • 7 vijay-ravichander/Smol-Pairwise-Distill 0.2B • Updated Apr 30, 2025 • 4 vijay-ravichander/Qwen-MMSE-Distill 0.2B • Updated Apr 30, 2025 • 3 vijay-ravichander/Qwen-Pairwise-Distill 0.2B • Updated Apr 30, 2025 • 5
LLM FocusLLM: Scaling LLM's Context by Parallel Decoding Paper • 2408.11745 • Published Aug 21, 2024 • 25 LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21, 2024 • 58
FocusLLM: Scaling LLM's Context by Parallel Decoding Paper • 2408.11745 • Published Aug 21, 2024 • 25
LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21, 2024 • 58