This website requires JavaScript.
Explore
Help
Register
Sign In
kyy
/
llm_trainer
Watch
1
Star
0
Fork
0
You've already forked llm_trainer
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
9658c63cd94d28bba730a19f73397580b9865d6b
llm_trainer
/
examples
/
lora_single_gpu
/
README.md
hiyouga
76f31b18eb
add examples
2024-03-05 03:16:35 +08:00
101 B
Raw
Blame
History
Usage:
pretrain.sh
sft.sh
->
reward.sh
->
ppo.sh
sft.sh
->
dpo.sh
->
predict.sh
Reference in New Issue
View Git Blame
Copy Permalink