WPRM
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
53

WPRM/qwen2.5-ar-reward-rejected-action-ablation-1
3B
•
Updated
•
12

WPRM/llama-3.1-8b-ar-rm-mtl
8B
•
Updated
•
8

WPRM/qwen3-8b-ar-reward-cot-mtl-checklist-enhanced
8B
•
Updated
•
2

WPRM/qwen-3b-ar-reward-cot-mtl-checklist-enhanced
3B
•
Updated
•
5

WPRM/qwen3-8b-checklist-enhanced
8B
•
Updated
•
5

WPRM/qwen3-ar-reward-cot-mtl-same-ratio-epoch2
8B
•
Updated
•
4

WPRM/qwen3-ar-reward-cot-mtl
8B
•
Updated
•
4

WPRM/qwen3-ar-reward-cot-mtl-epoch1
8B
•
Updated
•
6

WPRM/qwen2_5vl-3b_ar_reward_cot_multimodal_mtl
4B
•
Updated
•
7

WPRM/qwen2.5-ar-reward-cot-mtl
3B
•
Updated
•
6
datasets
118
WPRM/gitlab_failed_data
Viewer
•
Updated
•
16
•
70
WPRM/ours_8b_mtl_enhanced_annotated_workarena_checklist
Viewer
•
Updated
•
334
•
68
WPRM/ours_3b_mtl_enhanced_annotated_workarena_checklist
Viewer
•
Updated
•
334
•
56
WPRM/4omini_obs_annotated_workarena_checklist
Viewer
•
Updated
•
334
•
56
WPRM/ours_llama_8b_annotated_walite_combined_checklist
Viewer
•
Updated
•
812
•
58
WPRM/workarena_checklist_raw
Viewer
•
Updated
•
334
•
52
WPRM/human_dataset_sample_50
Viewer
•
Updated
•
50
•
58
WPRM/webprm-rejected-action-ablation-dataset-rejected-action-3
Viewer
•
Updated
•
21.8k
•
42
WPRM/webprm-rejected-action-ablation-dataset-rejected-action-2
Viewer
•
Updated
•
18.1k
•
50
WPRM/webprm-rejected-action-ablation-dataset-rejected-action-1
Viewer
•
Updated
•
12.1k
•
47