WebLINX WebLINX: Real-World Website Navigation with Multi-Turn Dialogue Paper • 2402.05930 • Published Feb 8, 2024 • 40 McGill-NLP/WebLINX Viewer • Updated Dec 7, 2024 • 79.8k • 426 • 62 McGill-NLP/WebLINX-full Updated Mar 7 • 66.7k • 6 McGill-NLP/weblinx-browsergym Updated Dec 7, 2024 • 627 • 4
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue Paper • 2402.05930 • Published Feb 8, 2024 • 40
AgentRewardBench AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories Paper • 2504.08942 • Published Apr 11 • 27 McGill-NLP/agent-reward-bench Viewer • Updated Apr 21 • 1.41k • 12.9k • 4 Running 4 4 Agent Reward Bench Demo 💻 Visualize agent interactions with WebArena tasks Running 1 1 Agent Reward Bench Leaderboard 🥇 Leaderboard for AgentRewardBench
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories Paper • 2504.08942 • Published Apr 11 • 27
WebLINX WebLINX: Real-World Website Navigation with Multi-Turn Dialogue Paper • 2402.05930 • Published Feb 8, 2024 • 40 McGill-NLP/WebLINX Viewer • Updated Dec 7, 2024 • 79.8k • 426 • 62 McGill-NLP/WebLINX-full Updated Mar 7 • 66.7k • 6 McGill-NLP/weblinx-browsergym Updated Dec 7, 2024 • 627 • 4
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue Paper • 2402.05930 • Published Feb 8, 2024 • 40
AgentRewardBench AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories Paper • 2504.08942 • Published Apr 11 • 27 McGill-NLP/agent-reward-bench Viewer • Updated Apr 21 • 1.41k • 12.9k • 4 Running 4 4 Agent Reward Bench Demo 💻 Visualize agent interactions with WebArena tasks Running 1 1 Agent Reward Bench Leaderboard 🥇 Leaderboard for AgentRewardBench
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories Paper • 2504.08942 • Published Apr 11 • 27