skyhookadventure/soft_optim

This repository was archived by the owner on Apr 25, 2023. It is now read-only.

Name		Name	Last commit message	Last commit date
Latest commit History 283 Commits

About

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)