The collection for the Paper "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
HKUST NLP Group
university
AI & ML interests
None defined yet.
Recent Activity
The collection for the Paper "Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning."
-
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning • 8B • Updated • 11 • 1 -
hkust-nlp/R1-Distill-Verifier-1.5B
2B • Updated • 10 • 1 -
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning • 8B • Updated • 7 -
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning • 8B • Updated • 6
The collection for the Paper "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
The collection for the Paper "Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning."
-
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning • 8B • Updated • 11 • 1 -
hkust-nlp/R1-Distill-Verifier-1.5B
2B • Updated • 10 • 1 -
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning • 8B • Updated • 7 -
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning • 8B • Updated • 6
models
62
hkust-nlp/WebExplorer-8B
Image-Text-to-Text
•
8B
•
Updated
•
31
•
6
hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier
Reinforcement Learning
•
8B
•
Updated
•
7
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning
•
8B
•
Updated
•
6
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning
•
8B
•
Updated
•
7
hkust-nlp/R1-Distill-Verifier-1.5B
2B
•
Updated
•
10
•
1
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning
•
8B
•
Updated
•
11
•
1
hkust-nlp/Laser-DE-L4096-1.5B
2B
•
Updated
•
1.27k
hkust-nlp/Laser-DE-L2048-1.5B
2B
•
Updated
•
8
hkust-nlp/Laser-DE-L1024-1.5B
2B
•
Updated
•
8
hkust-nlp/Laser-D-L4096-1.5B
2B
•
Updated
•
22
datasets
28
hkust-nlp/WebExplorer-QA
Viewer
•
Updated
•
100
•
109
•
4
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw
Updated
•
48
•
2
hkust-nlp/CodeIO-PyEdu-Reasoning
Preview
•
Updated
•
119
•
53
hkust-nlp/rl-verifier-pitfalls_hacking_data
Viewer
•
Updated
•
6.12k
•
19
•
1
hkust-nlp/deepscaler_simplelr
Viewer
•
Updated
•
40.3k
•
35
hkust-nlp/Laser-Deepscaler-Dataset
Viewer
•
Updated
•
40.8k
•
82
hkust-nlp/LeetCode-O
Preview
•
Updated
•
52
hkust-nlp/GUIMid
Viewer
•
Updated
•
1.85M
•
105
•
5
hkust-nlp/SimpleRL-Zoo-Data
Viewer
•
Updated
•
53.1k
•
552
•
6
hkust-nlp/PreSelect-100B
Viewer
•
Updated
•
54.5M
•
3.32k
•
11