Implementation of the Llama architecture with RLHF + Q-learning
kirilcvetkov92 / llama-qrlhf Goto Github PK
View Code? Open in Web Editor NEWThis project forked from lucidrains/llama-qrlhf
Implementation of the Llama architecture with RLHF + Q-learning
License: MIT License