This website requires JavaScript.
Explore
Help
Register
Sign In
kyy
/
llm_trainer
Watch
1
Star
0
Fork
0
You've already forked llm_trainer
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
9658c63cd94d28bba730a19f73397580b9865d6b
llm_trainer
/
examples
/
lora_single_gpu
History
hiyouga
d1587c80de
update examples
2024-03-06 13:14:57 +08:00
..
dpo.sh
update examples
2024-03-06 13:14:57 +08:00
ppo.sh
update examples
2024-03-06 13:14:57 +08:00
predict.sh
update examples
2024-03-06 13:14:57 +08:00
pretrain.sh
update examples
2024-03-06 13:14:57 +08:00
README.md
add examples
2024-03-05 03:16:35 +08:00
reward.sh
update examples
2024-03-06 13:14:57 +08:00
sft.sh
update examples
2024-03-06 13:14:57 +08:00
README.md
Usage:
pretrain.sh
sft.sh
->
reward.sh
->
ppo.sh
sft.sh
->
dpo.sh
->
predict.sh
Reference in New Issue
View Git Blame
Copy Permalink