Providing various large model fine-tuning methods, including but not limited to SFT, RLHF, offline RL, and more.
phonism / fusionft Goto Github PK
View Code? Open in Web Editor NEWProviding various large model fine-tuning methods, including but not limited to SFT, RLHF, offline RL, and more.
License: MIT License