GEM benchmark

https://gem-benchmark.com

Activity Feed Request to join this org

AI & ML interests

We develop infrastructure for the evaluation of generated text.

Recent Activity

lewtun submitted a paper 14 days ago

Single-minus gluon tree amplitudes are nonzero

lewtun submitted a paper 15 days ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

gentaiscool authored a paper about 1 month ago

PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

View all activity

GEM 's models

None public yet