hiyouga
|
2f4b89ace1
|
loose gemma2 attention
|
2024-06-29 01:42:14 +08:00 |
|
hiyouga
|
4d35e218b1
|
bf16 by default, gemma2 attns
Gemma2 finetuning cannot work until merging https://github.com/huggingface/transformers/pull/31674
|
2024-06-28 06:00:26 +08:00 |
|
stceum
|
3ed063f281
|
Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
|
2024-06-24 20:39:31 +08:00 |
|
hiyouga
|
d87108daa6
|
add license
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
2ed8270112
|
clean code
|
2024-06-13 01:58:16 +08:00 |
|
hiyouga
|
74f96efef9
|
rename files
|
2024-06-07 00:09:06 +08:00 |
|