Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
130.7
TFLOPS
83
106
184
Daniel Fox
FlameF0X
Follow
ninajamy's profile picture
jatsalkes's profile picture
JBNC's profile picture
53 followers
Ā·
31 following
https://flamef0x.github.io
FlameF0X
AI & ML interests
Pre-training text generator. (Brother, im 18) Please don't try to contact me.
Recent Activity
reacted
to
anakin87
's
post
with ā¤ļø
about 1 hour ago
A small model that struggled against a random opponent now beats GPT-5-mini at tic-tac-toe I took https://huggingface.co/LiquidAI/LFM2-2.6B and trained it through play. š§āš³ Here's how: 1ļøā£ Build a solid RL env with Verifiers (Prime Intellect) 2ļøā£ Generate synthetic data: <200 games sampled from GPT-5-mini playing in the env 3ļøā£ SFT warm-up to teach format 4ļøā£ Group-based RL (CISPO) against opponents making 20-70% random moves 5ļøā£ RL again with stronger opponents (0-25% random moves) + 1.25 temperature to push exploration and shake off suboptimal strategies Done! Beats GPT-5-mini š --- š® Play against the model: https://huggingface.co/spaces/anakin87/LFM2-2.6B-mr-tictactoe š¤ Model: https://huggingface.co/anakin87/LFM2-2.6B-mr-tictactoe š Walkthrough/course: https://github.com/anakin87/llm-rl-environments-lil-course š¤ Dataset and checkpoints: https://huggingface.co/collections/anakin87/lfm2-26b-mr-tic-tac-toe
liked
a model
about 3 hours ago
deepseek-ai/DeepSeek-V4-Pro
liked
a model
about 3 hours ago
deepseek-ai/DeepSeek-V4-Flash
View all activity
Organizations
FlameF0X
's buckets
1
Sort:Ā Recently updated
FlameF0X/test
14.6 kB