CMD + K

trl

Community

Train transformer language models with reinforcement learning.

1 item