Commit Graph

6 Commits

Author SHA1 Message Date
hiyouga
747db40172 ppo support rm server 2023-12-03 21:38:51 +08:00
hiyouga
7df4f3ab20 implement rm server #1543 2023-12-03 20:52:54 +08:00
hiyouga
1740131d63 fix #1558 2023-11-19 14:15:47 +08:00
hiyouga
ff52b1779c fix bug in freeze tuning 2023-11-16 14:25:11 +08:00
hiyouga
856522a3df fix bug in PPO training 2023-11-16 02:32:54 +08:00
hiyouga
35b91ea34c fix import bug 2023-11-16 02:27:03 +08:00