hiyouga
|
e3d8fc75eb
|
support badam for all stages
|
2024-04-16 17:44:48 +08:00 |
|
hiyouga
|
92dab8a90b
|
simplify readme
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
4a6ca621c0
|
fix #3083
|
2024-04-01 22:53:52 +08:00 |
|
hiyouga
|
5b9b40403d
|
fix IPO and ORPO loss
|
2024-04-01 14:37:53 +08:00 |
|
hiyouga
|
5907216a1c
|
fix plots
|
2024-03-31 19:43:48 +08:00 |
|
hiyouga
|
17bf8a2c3a
|
support ORPO
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
ca793028c6
|
release v0.6.1
|
2024-03-29 11:36:08 +08:00 |
|
hiyouga
|
8c77b10912
|
update trainers
|
2024-03-28 18:16:27 +08:00 |
|
hoshi-hiyouga
|
3bcd41b639
|
fix ds optimizer
|
2024-03-26 23:39:56 +08:00 |
|
hiyouga
|
511f675402
|
fix #2961
|
2024-03-26 17:26:14 +08:00 |
|
hiyouga
|
9bec3c98a2
|
fix #2777 #2895
|
2024-03-20 17:59:45 +08:00 |
|
hiyouga
|
8664262cde
|
support layerwise galore
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
bdb496644c
|
allow non-packing pretraining
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
28f7862188
|
support galore
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
4e5fae2fac
|
fix #2649
|
2024-03-01 13:02:41 +08:00 |
|
hiyouga
|
3cc10a01a7
|
fix #2532
|
2024-02-21 21:55:14 +08:00 |
|
hiyouga
|
638234ceee
|
format style
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
f6d6e00337
|
fix tests
|
2024-01-20 19:58:04 +08:00 |
|
hiyouga
|
38af076a75
|
support longlora for main branch
|
2024-01-20 19:25:22 +08:00 |
|
hiyouga
|
d9f1cae351
|
support function calling
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
4b2d11ec28
|
fix #2164
|
2024-01-12 00:27:57 +08:00 |
|
hiyouga
|
074745b170
|
fix dpo trainer
|
2023-12-23 01:51:55 +08:00 |
|
hiyouga
|
7aad0b889d
|
support unsloth
|
2023-12-23 00:14:33 +08:00 |
|
hiyouga
|
b87c74289d
|
support dpo-ftx
|
2023-12-16 19:21:41 +08:00 |
|
hiyouga
|
7df4f3ab20
|
implement rm server #1543
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
1740131d63
|
fix #1558
|
2023-11-19 14:15:47 +08:00 |
|
hiyouga
|
35b91ea34c
|
fix import bug
|
2023-11-16 02:27:03 +08:00 |
|
hiyouga
|
ce78303600
|
support full-parameter PPO
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
4736344eb1
|
disentangle model from tuner and rename modules
|
2023-11-15 16:29:09 +08:00 |
|