mowenyii / d3po Goto Github PK
View Code? Open in Web Editor NEWThis project forked from yk7333/d3po
Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
Home Page: https://arxiv.org/abs/2311.13231
License: MIT License