Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
GEM benchmark
https://gem-benchmark.com
Activity Feed
Request to join this org
Follow
130
AI & ML interests
We develop infrastructure for the evaluation of generated text.
Recent Activity
lewtun
Â
submitted
a paper
14 days ago
Single-minus gluon tree amplitudes are nonzero
lewtun
Â
submitted
a paper
15 days ago
Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL
gentaiscool
Â
authored
a paper
about 1 month ago
PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues
View all activity
Team members
100
+66
+53
+32
+22
+2
GEM
's models
None public yet