Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
130.7
TFLOPS
83
106
184
Daniel Fox
FlameF0X
Follow
Gargaz's profile picture
daya-s's profile picture
hugo5454's profile picture
53 followers
Ā·
31 following
https://flamef0x.github.io
FlameF0X
AI & ML interests
Pre-training text generator. (Brother, im 18) Please don't try to contact me.
Recent Activity
reacted
to
anakin87
's
post
with ā¤ļø
about 2 hours ago
A small model that struggled against a random opponent now beats GPT-5-mini at tic-tac-toe I took https://huggingface.co/LiquidAI/LFM2-2.6B and trained it through play. š§āš³ Here's how: 1ļøā£ Build a solid RL env with Verifiers (Prime Intellect) 2ļøā£ Generate synthetic data: <200 games sampled from GPT-5-mini playing in the env 3ļøā£ SFT warm-up to teach format 4ļøā£ Group-based RL (CISPO) against opponents making 20-70% random moves 5ļøā£ RL again with stronger opponents (0-25% random moves) + 1.25 temperature to push exploration and shake off suboptimal strategies Done! Beats GPT-5-mini š --- š® Play against the model: https://huggingface.co/spaces/anakin87/LFM2-2.6B-mr-tictactoe š¤ Model: https://huggingface.co/anakin87/LFM2-2.6B-mr-tictactoe š Walkthrough/course: https://github.com/anakin87/llm-rl-environments-lil-course š¤ Dataset and checkpoints: https://huggingface.co/collections/anakin87/lfm2-26b-mr-tic-tac-toe
liked
a model
about 4 hours ago
deepseek-ai/DeepSeek-V4-Pro
liked
a model
about 4 hours ago
deepseek-ai/DeepSeek-V4-Flash
View all activity
Organizations
FlameF0X
's datasets
17
Sort:Ā Recently updated
FlameF0X/TinyTask-BM
Viewer
ā¢
Updated
7 days ago
ā¢
1.48k
ā¢
11
ā¢
1
FlameF0X/Traces-Test
Viewer
ā¢
Updated
16 days ago
ā¢
1
ā¢
34
FlameF0X/Claude-Corpus
Viewer
ā¢
Updated
Mar 21
ā¢
41
ā¢
16
FlameF0X/Chess
Viewer
ā¢
Updated
Mar 7
ā¢
200
ā¢
10
FlameF0X/coherence-RLAIF
Viewer
ā¢
Updated
Mar 1
ā¢
49
ā¢
12
FlameF0X/agentic-code
Viewer
ā¢
Updated
Feb 18
ā¢
47.8k
ā¢
100
FlameF0X/arXiv-AI-ML
Viewer
ā¢
Updated
Feb 16
ā¢
2.5k
ā¢
79
FlameF0X/TinyTask2-BM
Viewer
ā¢
Updated
Feb 13
ā¢
28
ā¢
26
FlameF0X/YAAR-data
Viewer
ā¢
Updated
Jan 27
ā¢
120
ā¢
9
FlameF0X/MedWiki
Viewer
ā¢
Updated
Jan 24
ā¢
1.83k
ā¢
20
FlameF0X/test
Updated
Dec 28, 2025
ā¢
5
FlameF0X/Heuristic-Epistemic-Reasoning
Viewer
ā¢
Updated
Dec 26, 2025
ā¢
50
ā¢
9
FlameF0X/Lime
Preview
ā¢
Updated
Dec 13, 2025
ā¢
6
FlameF0X/Safety_Alignment_Benchmark
Viewer
ā¢
Updated
Dec 6, 2025
ā¢
155
ā¢
8
FlameF0X/i3-chat
Viewer
ā¢
Updated
Dec 5, 2025
ā¢
50
ā¢
7
FlameF0X/chat-pretrain
Updated
Dec 5, 2025
ā¢
3
FlameF0X/Lang2Lang
Viewer
ā¢
Updated
Aug 10, 2025
ā¢
92
ā¢
2