Ai2 Open Coding Agents - Django, Sphinx, Sympy Data
AI & ML interests
Building breatkthrough AI to solve the world's biggest problems.
Recent Activity
Papers
TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics
How2Everything: Mining the Web for How-To Procedures to Evaluate and Improve LLMs
Organization Card
spaces 13
pinned
Running
20
AstaBench Leaderboard
🥇
View benchmark leaderboards
pinned
Running
421
Reward Bench Leaderboard
📐
Explore RewardBench model rankings and scores
pinned
Running
2
HREF Leaderboard
📐
Browse and search HREF leaderboard data
pinned
Running
91
Zebra Logic Bench
🦓
Show leaderboard and explore model puzzle results
pinned
Running
3
SUPER Leaderboard
🤖
Display a static leaderboard from a JSON file
pinned
Running
53
ZeroEval Leaderboard
📊
Embed ZeroEval for evaluation
models 852
allenai/Flex-pes2o-2x7B-1T
Text Generation • 12B • Updated
• 170 • 2
allenai/Flex-news-2x7B-1T
Text Generation • 12B • Updated
• 165 • 2
allenai/Flex-creative-2x7B-1T
Text Generation • 12B • Updated
• 275 • 5
allenai/Flex-public-7B-1T
Text Generation • 7B • Updated
• 267 • 5
allenai/Flex-code-2x7B-1T
Text Generation • 12B • Updated
• 393 • 2
allenai/Flex-math-2x7B-1T
Text Generation • 12B • Updated
• 368 • 3
allenai/olmo-3-tokenizer-instruct-release
Updated
• 1
allenai/olmOCR-2-7B-1025-FP8
Image-Text-to-Text • 8B • Updated
• 228k • 203
allenai/Olmo-3-7B-RL-Zero-General
Text Generation • 528k • Updated
• 112 • 7
allenai/Olmo-3-7B-RL-Zero-IF
Text Generation • 528k • Updated
• 113 • 6
datasets 368
allenai/prescience
Viewer
• Updated
• 839k • 43 • 9
allenai/dolma3_pool
Preview
• Updated
• 125k • 32
allenai/dolma3_longmino_mix-100B-1125
Preview
• Updated
• 18.2k • 11
allenai/dolma3_dolmino_mix-100B-1125
Preview
• Updated
• 226k • 19
allenai/asta-summary-citation-counts
Viewer
• Updated
• 47M • 390 • 8
allenai/olmix
Preview
• Updated
• 277 • 38
allenai/Dolci-Instruct-DPO
Viewer
• Updated
• 260k • 2.26k • 7
allenai/olmOCR-bench
Benchmark
• Updated
• 2.16k • 104
allenai/Molmo2-MultiImageQA
Viewer
• Updated
• 44.7k • 159 • 2
allenai/molmospaces
Viewer
• Updated
• 772k • 1.24k • 39