arxiv:2412.04537
rokosbasilisk
rb
AI & ML interests
Large multi-modal world models
Recent Activity
updated a dataset 1 day ago
antieval/frontier_sweep_evals published a dataset 1 day ago
antieval/frontier_sweep_evals updated a dataset 2 days ago
antieval/swebench-trajectories