arxiv:2412.04537
rokosbasilisk
rb
AI & ML interests
Large multi-modal world models
Recent Activity
updated a dataset about 13 hours ago
antieval/frontier_sweep_evals published a dataset 2 days ago
antieval/frontier_sweep_evals updated a dataset 3 days ago
antieval/swebench-trajectories