Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
demystify-long-cot
community
Activity Feed
Follow
4
AI & ML interests
None defined yet.
Recent Activity
yuexiang96
authored
a paper
2 months ago
Small Models Struggle to Learn from Strong Reasoners
yuexiang96
authored
a paper
2 months ago
Evaluating Vision-Language Models as Evaluators in Path Planning
yuexiang96
authored
a paper
2 months ago
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search
View all activity
Team members
2
demystify-long-cot
's models
29
Sort: Recently updated
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n2-raw-sft-ppo
8B
•
Updated
Jan 20
•
5
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n1-raw-sft-ppo
8B
•
Updated
Jan 20
•
3
demystify-long-cot/llama-3.1-8b-webit462k-qwq-n8-rft
Updated
Jan 20
demystify-long-cot/llama-3.1-8b-webit462k-qwq-n4-rft
Updated
Jan 20
demystify-long-cot/llama-3.1-8b-webit462k-qwq-n2-rft
8B
•
Updated
Jan 20
•
3
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n8-rft
8B
•
Updated
Jan 20
•
3
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n4-rft
8B
•
Updated
Jan 20
•
4
demystify-long-cot/llama-3.1-8b-webit462k-qwq-n1-raw-sft
8B
•
Updated
Jan 20
•
7
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n4-raw-sft
8B
•
Updated
Jan 20
•
5
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n2-raw-sft
8B
•
Updated
Jan 20
•
5
demystify-long-cot/llama-3.1-8b-webit231k-qwq-n1-raw-sft
8B
•
Updated
Jan 20
•
4
demystify-long-cot/llama-3.1-8b-math-qwen-n256-rft
8B
•
Updated
Jan 20
•
3
demystify-long-cot/llama-3.1-8b-math-qwen-n32-rft-ppo
8B
•
Updated
Jan 20
•
3
demystify-long-cot/llama-3.1-8b-math-qwen-n32-rft
8B
•
Updated
Jan 20
•
2
demystify-long-cot/llama-3.1-8b-math-qwen-n64-rft-ppo
8B
•
Updated
Jan 20
•
3
demystify-long-cot/llama-3.1-8b-math-qwen-n64-rft
8B
•
Updated
Jan 20
•
2
demystify-long-cot/llama-3.1-8b-math-qwen-n128-rft-ppo
8B
•
Updated
Jan 20
•
7
demystify-long-cot/llama-3.1-8b-math-qwen-n128-rft
8B
•
Updated
Jan 20
•
7
demystify-long-cot/llama-3.1-8b-math-qwen-n16-rft
8B
•
Updated
Jan 20
•
2
demystify-long-cot/llama-3.1-8b-math-qwq-n192-rft-ppo
8B
•
Updated
Jan 20
•
3
demystify-long-cot/llama-3.1-8b-math-qwq-n192-rft
8B
•
Updated
Jan 20
•
4
demystify-long-cot/llama-3.1-8b-math-qwq-n16-rft
8B
•
Updated
Jan 20
•
3
demystify-long-cot/llama-3.1-8b-math-qwq-n64-rft-ppo
8B
•
Updated
Jan 20
•
3
demystify-long-cot/llama-3.1-8b-math-qwq-n64-rft
8B
•
Updated
Jan 20
•
3
demystify-long-cot/llama-3.1-8b-math-qwq-n32-rft-ppo
8B
•
Updated
Jan 20
•
2
demystify-long-cot/llama-3.1-8b-math-qwq-n32-rft
8B
•
Updated
Jan 20
•
3
demystify-long-cot/llama-3.1-8b-math-qwq-n256-rft
8B
•
Updated
Jan 20
•
4
demystify-long-cot/llama-3.1-8b-math-qwq-n128-rft-ppo
8B
•
Updated
Jan 20
•
5
demystify-long-cot/llama-3.1-8b-math-qwq-n128-rft
8B
•
Updated
Jan 20
•
6