Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
280
9
15
Edward Beeching
PRO
edbeeching
Follow
aractingi's profile picture
Arshiaboss's profile picture
MarcusAGray's profile picture
245 followers
Β·
30 following
https://edbeeching.github.io/
edbeeching
AI & ML interests
None yet
Recent Activity
updated
a dataset
1 day ago
synthetic-data-universe/gpt-oss-20b-gen4
published
a dataset
1 day ago
synthetic-data-universe/gpt-oss-20b-gen4
updated
a Space
2 days ago
synthetic-data-universe/synth
View all activity
Organizations
edbeeching
's models
372
Sort:Β Recently updated
edbeeching/Qwen2.5-1.5B-Open-R1-Distill-dev
Updated
Jul 25
edbeeching/OpenR1-Distill-7B-packing-benchmarks
8B
β’
Updated
Jun 9
β’
11
edbeeching/OpenR1-Distill-7B
Text Generation
β’
8B
β’
Updated
Jun 7
β’
7
edbeeching/SmolLM3-3B-instruct
Updated
Jun 2
edbeeching/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
β’
2B
β’
Updated
Jun 2
β’
10
edbeeching/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
β’
Updated
May 22
β’
9
edbeeching/Qwen2.5-7B-Instruct-GRPO
8B
β’
Updated
Mar 25
β’
10
edbeeching/Qwen2.5-Math-7B-Instruct-SFT
Text Generation
β’
8B
β’
Updated
Mar 25
β’
12
edbeeching/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Mar 11
edbeeching/Qwen2.5-Coder-3B-Instruct-sft
Text Generation
β’
3B
β’
Updated
Feb 22
β’
9
edbeeching/pythia-1b-deduped-tldr-online-dpo
Updated
Feb 19
edbeeching/DeepSeek-R1-Distill-Qwen-1.5-GRPO
2B
β’
Updated
Feb 7
β’
8
edbeeching/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Jan 30
edbeeching/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Jan 30
edbeeching/gkd-model-compile
Updated
Oct 17, 2024
edbeeching/gkd-model-no-compile
Updated
Oct 17, 2024
edbeeching/EleutherAI_pythia-1b
Text Generation
β’
1B
β’
Updated
Aug 1, 2024
β’
7
edbeeching/EleutherAI_pythia-2.8b
Text Generation
β’
3B
β’
Updated
Aug 1, 2024
β’
4
edbeeching/dpo_tldr_1b
Text Generation
β’
1B
β’
Updated
Aug 1, 2024
β’
4
edbeeching/EleutherAI_pythia-6.9b
Updated
Jul 26, 2024
edbeeching/online_dpo_tldr_6.9b
Text Generation
β’
7B
β’
Updated
Jul 25, 2024
β’
4
edbeeching/dpo_tldr_6.9b
Updated
Jul 25, 2024
edbeeching/vsft-llava_builder_Meta-Llama-3-8B
Image-to-Text
β’
8B
β’
Updated
Apr 23, 2024
β’
4
edbeeching/vsft-llava_builder-meta-Llama-3-8B
Updated
Apr 23, 2024
edbeeching/vsft-llava_builder_zephyr-7b-beta
Image-to-Text
β’
8B
β’
Updated
Apr 20, 2024
β’
3
edbeeching/vsft-llava_builder
Updated
Apr 19, 2024
edbeeching/atari_2B_atari_stargunner_2222
Reinforcement Learning
β’
Updated
Apr 16, 2024
β’
4
edbeeching/atari_2B_atari_stargunner_1111
Reinforcement Learning
β’
Updated
Apr 16, 2024
β’
4
edbeeching/atari_2B_atari_spaceinvaders_2222
Reinforcement Learning
β’
Updated
Apr 16, 2024
β’
6
edbeeching/atari_2B_atari_spaceinvaders_1111
Reinforcement Learning
β’
Updated
Apr 16, 2024
β’
7
Previous
1
2
3
...
13
Next