huggingface/trl
steadyTrain transformer language models with reinforcement learning.
Python
View on GitHub
Stars
18,719
Forks
2,806
Open issues
298
24h
+11
+0.1%
7d
+69
+0.4%
Refresh
2h
Star history (7 days)
Last checked
1h ago
Last pushed
3h ago
Next check
just now