eleutherai/lm-evaluation-harness
steadyA framework for few-shot evaluation of language models.
Python
View on GitHub
Stars
13,076
Forks
3,362
Open issues
570
24h
+14
+0.1%
7d
+114
+0.9%
Refresh
2h
Star history (7 days)
Last checked
8m ago
Last pushed
1d ago
Next check
just now