Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
reaperdoesntknow
/
DiscoverLM-70M
like
0
Text Generation
Transformers
TensorBoard
Safetensors
nohurry/Opus-4.6-Reasoning-3000x-filtered
openbmb/UltraData-Math
yahma/alpaca-cleaned
English
moa_metric
trl
sft
metric-attention
mixture-of-attentions
triangle-inequality
blackhole-rope
discrepancy-calculus
discover
License:
cc
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
DiscoverLM-70M
281 MB
Ctrl+K
Ctrl+K
1 contributor
History:
12 commits
reaperdoesntknow
Update README.md
b6d5c2d
verified
11 days ago
.gitattributes
Safe
1.52 kB
initial commit
11 days ago
README.md
11.1 kB
Update README.md
11 days ago
config.json
1.6 kB
Upload MoAMetricLM
11 days ago
events.out.tfevents.1772979692.a28ffe9e0143.11703 (1).0
201 kB
xet
Upload 2 files
11 days ago
events.out.tfevents.1772979692.a28ffe9e0143.11703 (2).0
201 kB
xet
Upload events.out.tfevents.1772979692.a28ffe9e0143.11703 (2).0
11 days ago
generation_config.json
204 Bytes
Upload MoAMetricLM
11 days ago
model.safetensors
277 MB
xet
Upload MoAMetricLM
11 days ago
tokenizer.json
3.38 MB
Upload tokenizer
11 days ago
tokenizer_config.json
349 Bytes
Upload tokenizer
11 days ago
trainer_state.json
148 kB
Rename trainer_state (2).json to trainer_state.json
11 days ago