dataset
updated
mistralai/Mistral-7B-Instruct-v0.3
7B • Updated • 3.41M
• 2.54k
black-forest-labs/FLUX.1-dev
Text-to-Image
• Updated • 731k
• • 12.7k
PKU-Alignment/align-anything
Viewer
• Updated • 69.4k • 3.08k
• 48
NousResearch/hermes-function-calling-v1
Viewer
• Updated • 11.6k • 9.87k
• 403
google-bert/bert-base-uncased
Fill-Mask
• 0.1B • Updated • 59.2M
• • 2.64k
ThorBaller/Minstral_pubmed_gguf
Question Answering
• 7B • Updated • 610
Any-to-Any
• Updated
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
• 2402.17764
• Published • 629
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper
• 2501.08313
• Published • 302
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
Thinking
Paper
• 2501.04519
• Published • 290
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Paper
• 2501.12948
• Published • 448
papercup-ai/multilingual-pl-bert
Updated • 72
Raiff1982/recursivetraining
Viewer
• Updated • 40 • 9
Raiff1982/codette-training-lab