Skip to content
This repository was archived by the owner on Apr 25, 2023. It is now read-only.

skyhookadventure/soft_optim

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

283 Commits

About

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Stars

Watchers

Forks