WebArbiter: A Principle-Guided Reasoning Process Reward Model for Web Agents
Paper
•
2601.21872
•
Published
None defined yet.
WebArbiter: A Principle-Guided Reasoning Process Reward Model for Web Agents
GroundedPRM: Tree-Guided and Fidelity-Aware Process Reward Modeling for Step-Level Reasoning