hiyouga
|
4736344eb1
|
disentangle model from tuner and rename modules
|
2023-11-15 16:29:09 +08:00 |
|
hiyouga
|
4bd8e3906d
|
fix flashattn warning
|
2023-11-10 18:34:54 +08:00 |
|
hiyouga
|
2818af0b09
|
refactor model_dtype, fix PPO trainer
|
2023-10-11 23:16:01 +08:00 |
|
hiyouga
|
0a356bc897
|
fix flash shift short attention
|
2023-10-09 17:54:48 +08:00 |
|
hiyouga
|
ab65c3063b
|
fix shift short attention
|
2023-10-09 17:07:46 +08:00 |
|
hiyouga
|
5d4118b096
|
tiny fix
|
2023-09-28 01:03:04 +08:00 |
|
hiyouga
|
d2ebd225db
|
tiny fix
|
2023-09-28 01:02:11 +08:00 |
|
hiyouga
|
c902236397
|
fix #1064
|
2023-09-28 00:53:29 +08:00 |
|
hiyouga
|
84b7486885
|
fix layer norm dtype
|
2023-09-28 00:25:55 +08:00 |
|
hiyouga
|
90375f600d
|
support LongLoRA
|
2023-09-27 21:55:50 +08:00 |
|