hiyouga
|
54c6905937
|
add docstrings, refactor logger
|
2024-09-08 00:56:56 +08:00 |
|
hiyouga
|
e2a28f51c6
|
add adam_mini to readme
|
2024-08-09 20:02:03 +08:00 |
|
moontidef
|
82bc15dc79
|
feat: add support for adammini
|
2024-08-07 10:08:22 +08:00 |
|
moontidef
|
40908a36fa
|
fix: rename optimzer to optimizer
|
2024-08-07 10:05:01 +08:00 |
|
hiyouga
|
29ebcd75d5
|
fix up
|
2024-07-15 01:04:56 +08:00 |
|
hiyouga
|
8baf3b22b0
|
refactor pissa, improve llamaboard
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
095fab58d3
|
tiny fix about badam
|
2024-06-25 01:54:53 +08:00 |
|
hoshi-hiyouga
|
d0f953bf5b
|
Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
|
2024-06-25 01:49:13 +08:00 |
|
hiyouga
|
4cff6a4ad5
|
fix templates
|
2024-06-19 17:44:05 +08:00 |
|
Jonery
|
5c2ff1b749
|
Cleaner integration.
|
2024-06-19 12:29:40 +08:00 |
|
Jonery
|
ea1f3ba5e0
|
Merge remote-tracking branch 'upstream/main'
|
2024-06-17 18:44:51 +08:00 |
|
hiyouga
|
46093b5786
|
fix tol
|
2024-06-16 01:38:44 +08:00 |
|
hiyouga
|
8c1046d78a
|
support pissa
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
d87108daa6
|
add license
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
cf9f2d6c42
|
fix #4209
DeepSpeed ZeRO3 has inflight param error when calling model.eval()
|
2024-06-13 02:25:50 +08:00 |
|
hiyouga
|
89f2bd8c8c
|
fix #4198
|
2024-06-11 15:38:38 +08:00 |
|
hiyouga
|
f9e818d79c
|
fix #4120
|
2024-06-07 04:18:05 +08:00 |
|
hiyouga
|
74f96efef9
|
rename files
|
2024-06-07 00:09:06 +08:00 |
|