Commit Graph

  • 6f2b563f12 release v0.6.0 hiyouga 2024-03-25 22:38:56 +08:00
  • bb4ca1691a Update README_zh.md Tsumugii24 2024-03-25 22:31:03 +08:00
  • f33a3dfadc Merge pull request #2963 from rkinas/patch-1 hoshi-hiyouga 2024-03-25 21:49:34 +08:00
  • b02899bf89 Update requirements.txt Remek Kinas 2024-03-25 14:30:58 +01:00
  • 558a538724 tiny fix hiyouga 2024-03-25 21:18:08 +08:00
  • 49f9dbb4b1 Merge pull request #2945 from marko1616/bugfix/lora-model-merge hoshi-hiyouga 2024-03-25 13:36:08 +08:00
  • c8f0d99704 pass ruff check marko1616 2024-03-24 16:12:10 +08:00
  • 6f080fdba3 fix Llama lora merge crash marko1616 2024-03-24 03:06:11 +08:00
  • 51349ea1cc fix Llama lora merge crash marko1616 2024-03-24 02:55:23 +08:00
  • c1e2c4ea45 fix Llama lora merge crash marko1616 2024-03-24 02:44:35 +08:00
  • 140ad4ad56 fix #2936 hiyouga 2024-03-24 00:43:21 +08:00
  • 7afbc85dae fix #2928 hiyouga 2024-03-24 00:34:54 +08:00
  • a1c8c98c5f fix #2941 hiyouga 2024-03-24 00:28:44 +08:00
  • 564d57aa23 Update wechat.jpg hiyouga 2024-03-22 14:00:37 +08:00
  • ce261fdd64 Merge pull request #2919 from 0xez/main hoshi-hiyouga 2024-03-22 12:12:24 +08:00
  • be0360303d Update README_zh.md, fix the release date of the paper 0xez 2024-03-22 10:41:17 +08:00
  • 675ba41562 Update README.md, fix the release date of the paper 0xez 2024-03-21 22:14:48 +08:00
  • 96702620c4 move file hiyouga 2024-03-21 17:05:17 +08:00
  • 5eaa50fa01 add citation hiyouga 2024-03-21 17:04:10 +08:00
  • 0581bfdbc7 paper release hiyouga 2024-03-21 13:49:17 +08:00
  • bfe7a91289 update readme hiyouga 2024-03-21 00:48:42 +08:00
  • 8408225162 support fsdp + qlora hiyouga 2024-03-21 00:36:06 +08:00
  • 3271af2afc add orca_dpo_pairs dataset hiyouga 2024-03-20 20:09:06 +08:00
  • b2dfbd728f Merge pull request #2905 from SirlyDreamer/main hoshi-hiyouga 2024-03-20 18:09:54 +08:00
  • 9bec3c98a2 fix #2777 #2895 hiyouga 2024-03-20 17:59:45 +08:00
  • 7b8f502901 fix #2346 hiyouga 2024-03-20 17:56:33 +08:00
  • e165965341 Follow HF_ENDPOINT environment variable SirlyDreamer 2024-03-20 08:31:30 +00:00
  • a773035709 Merge pull request #2903 from khazic/main hoshi-hiyouga 2024-03-20 16:13:44 +08:00
  • 8d10fa71c2 Updated README with new information khazic 2024-03-20 14:38:08 +08:00
  • 0531dac30d Updated README with new information khazic 2024-03-20 14:21:16 +08:00
  • df9b4fb90a Updated README with new information 刘一博 2024-03-20 14:11:28 +08:00
  • bea31b9b12 Update wechat.jpg hiyouga 2024-03-18 16:48:32 +08:00
  • 8e04794b2d fix packages hiyouga 2024-03-17 22:32:03 +08:00
  • 85c376fc1e fix patcher hiyouga 2024-03-15 19:18:42 +08:00
  • 113cc04719 Merge pull request #2849 from S3Studio/DockerizeSupport hoshi-hiyouga 2024-03-15 19:16:02 +08:00
  • 6bc2c23b6d fix export hiyouga 2024-03-15 15:06:30 +08:00
  • e75407febd Use official Nvidia base image S3Studio 2024-03-14 18:03:33 +08:00
  • 6a5693d11d improve Docker build and runtime parameters S3Studio 2024-03-12 14:05:10 +08:00
  • 6ebde4f23e tiny fix hiyouga 2024-03-14 21:19:06 +08:00
  • 3b4a59bfb1 fix export hiyouga 2024-03-14 18:17:01 +08:00
  • 8172530d54 fix bug hiyouga 2024-03-13 23:55:31 +08:00
  • 714d936dfb fix bug hiyouga 2024-03-13 23:43:42 +08:00
  • 72367307df improve lora+ impl. hiyouga 2024-03-13 23:32:51 +08:00
  • 4e5e99af43 Merge pull request #2830 from qibaoyuan/lora_plus hoshi-hiyouga 2024-03-13 20:15:46 +08:00
  • a0965cd62c [FEATURE]: ADD LORA+ ALGORITHM 齐保元 2024-03-13 19:43:27 +08:00
  • dfd451b722 Update wechat.jpg hiyouga 2024-03-13 19:03:00 +08:00
  • 0b4a5bf509 fix #2817 hiyouga 2024-03-13 12:42:03 +08:00
  • b9f87cdc11 fix #2802 hiyouga 2024-03-13 12:33:45 +08:00
  • 96ce76cd27 fix kv cache hiyouga 2024-03-13 01:21:50 +08:00
  • 19ef482649 support QDoRA hiyouga 2024-03-12 22:12:42 +08:00
  • 70a3052dd8 patch for gemma cpt hiyouga 2024-03-12 21:21:54 +08:00
  • 60cc17f3a8 fix plot issues hiyouga 2024-03-12 18:41:35 +08:00
  • b3247d6a16 support olmo hiyouga 2024-03-12 18:30:38 +08:00
  • 8d8956bad5 fix #2802 hiyouga 2024-03-12 17:08:34 +08:00
  • 06c97083e1 fix #2803 hiyouga 2024-03-12 16:57:39 +08:00
  • 07f9b754a7 fix #2782 #2798 hiyouga 2024-03-12 15:53:29 +08:00
  • c901aa63ff Merge pull request #2743 from S3Studio/DockerizeSupport hoshi-hiyouga 2024-03-12 00:05:49 +08:00
  • e874c00906 fix #2775 hiyouga 2024-03-11 00:42:54 +08:00
  • 352693e2dc tiny fix hiyouga 2024-03-11 00:17:18 +08:00
  • be99799413 update parser hiyouga 2024-03-10 13:35:20 +08:00
  • 8664262cde support layerwise galore hiyouga 2024-03-10 00:24:11 +08:00
  • 18ffce36b5 fix #2732 hiyouga 2024-03-09 22:37:16 +08:00
  • bdb496644c allow non-packing pretraining hiyouga 2024-03-09 22:21:46 +08:00
  • 412c52e325 fix #2766 hiyouga 2024-03-09 21:35:24 +08:00
  • af0e370fb1 use default arg for freeze tuning hiyouga 2024-03-09 06:08:48 +08:00
  • 818726e9bc add GaLore results hiyouga 2024-03-09 04:11:55 +08:00
  • 393c2de27c update hardware requirements hiyouga 2024-03-09 03:58:18 +08:00
  • 4c00bcdcae update examples hiyouga 2024-03-09 02:30:37 +08:00
  • e8dd38b7fd fix #2756 , patch #2746 hiyouga 2024-03-09 02:01:26 +08:00
  • 516d0ddc66 Merge pull request #2746 from stephen-nju/main hoshi-hiyouga 2024-03-09 01:37:00 +08:00
  • 74ff8664d7 Update setup.py hiyouga 2024-03-09 00:14:48 +08:00
  • 10be2f0ecc fix aqlm version hiyouga 2024-03-09 00:09:09 +08:00
  • 8a45213440 fix example params hiyouga 2024-03-08 20:41:43 +08:00
  • aa71571b77 update stephen_zhu 2024-03-08 12:47:44 +08:00
  • cdb7f82869 fix ppo runtime error stephen 2024-03-08 11:48:26 +08:00
  • 3d911ae713 Add dockerize support S3Studio 2024-03-08 10:47:28 +08:00
  • 4a2cc60b94 update readme hiyouga 2024-03-08 03:06:21 +08:00
  • 5d956e2a51 fix chat engine, update webui hiyouga 2024-03-08 03:01:53 +08:00
  • 5cd4947650 Update setup.py hiyouga 2024-03-08 01:23:00 +08:00
  • 0ac6b40a47 update galore args hiyouga 2024-03-08 01:17:32 +08:00
  • 33a4c24a8a fix galore hiyouga 2024-03-08 00:44:51 +08:00
  • 57452a4aa1 add Yi-9B model hiyouga 2024-03-07 23:11:57 +08:00
  • 7230e1177d add galore examples hiyouga 2024-03-07 22:53:45 +08:00
  • 28f7862188 support galore hiyouga 2024-03-07 22:41:36 +08:00
  • 725f7cd70f update readme hiyouga 2024-03-07 20:34:49 +08:00
  • 77211d9843 tiny fix hiyouga 2024-03-07 20:29:34 +08:00
  • a0dc721816 Merge pull request #2739 from hiyouga/dev-vllm hoshi-hiyouga 2024-03-07 20:28:18 +08:00
  • d07ad5cc1c support vllm hiyouga 2024-03-07 20:26:31 +08:00
  • f74f804a71 fix #2735 hiyouga 2024-03-07 16:15:53 +08:00
  • 2185855bdb Merge pull request #2730 from cx2333-gt/main hoshi-hiyouga 2024-03-07 14:37:18 +08:00
  • 94b7a1b915 revert choice name cx2333 2024-03-07 14:28:55 +08:00
  • 921ee82267 fix chatglm3 template hiyouga 2024-03-07 14:26:16 +08:00
  • 08d7dc06f2 Update wechat.jpg hiyouga 2024-03-07 13:14:10 +08:00
  • a8889498fa fix flash_attn in train_web cx2333 2024-03-07 10:13:55 +08:00
  • 0048a2021e tiny fix hiyouga 2024-03-06 17:25:08 +08:00
  • 3e84f430b1 export use balanced gpu hiyouga 2024-03-06 16:33:14 +08:00
  • 9658c63cd9 fix add tokens hiyouga 2024-03-06 15:04:02 +08:00
  • 3016e65657 fix version checking hiyouga 2024-03-06 14:51:51 +08:00
  • d1587c80de update examples hiyouga 2024-03-06 13:14:57 +08:00
  • e0c47358f9 fix arg dtype hiyouga 2024-03-05 20:53:30 +08:00