Commit Graph

  • 898ec3696a fix #2161 hiyouga 2024-01-11 17:04:13 +08:00
  • 1653c22438 improve web ui hiyouga 2024-01-10 12:37:45 +08:00
  • 05ed4e8028 improve model export hiyouga 2024-01-09 22:26:24 +08:00
  • 6b0705bed8 Update wechat.jpg hiyouga 2024-01-09 22:10:41 +08:00
  • 919acc2b0b modify weight name hiyouga 2024-01-09 20:22:47 +08:00
  • 4571068e1e fix #1789 hiyouga 2024-01-09 18:31:27 +08:00
  • ebee4f6a2a fix #2127 hiyouga 2024-01-09 14:49:13 +08:00
  • 3ae735ffe8 fix #2125 hiyouga 2024-01-08 21:42:25 +08:00
  • 0ed526cedf Merge branch 'main' of https://github.com/hiyouga/LLaMA-Factory hiyouga 2024-01-08 14:31:04 +08:00
  • 5e09862b90 Update wechat.jpg hiyouga 2024-01-08 14:30:47 +08:00
  • 379c0ae750 Merge pull request #2117 from dasdristanta13/main hoshi-hiyouga 2024-01-07 23:56:53 +08:00
  • e4cde81851 Update requirements.txt With einops dependency Dristanta Das 2024-01-07 21:03:30 +05:30
  • 0a9986160c tiny fix hiyouga 2024-01-07 17:17:18 +08:00
  • 08464183b9 fix api server hiyouga 2024-01-07 17:14:42 +08:00
  • d2a676c8ba improve model export hiyouga 2024-01-05 18:51:49 +08:00
  • f6fdd83f8a fix #2098 hiyouga 2024-01-05 17:11:26 +08:00
  • ed216bbc46 fix qwen template hiyouga 2024-01-05 16:14:56 +08:00
  • 33f2c0d4f8 fix #2081 hiyouga 2024-01-04 23:19:08 +08:00
  • cc275abe09 fix #2090 hiyouga 2024-01-04 23:05:08 +08:00
  • 368b31f6b7 fix #2067 hiyouga 2024-01-04 22:53:03 +08:00
  • 1696698eb9 fix dispatch hiyouga 2024-01-03 16:33:16 +08:00
  • 24d8d6f224 fix valuehead patch hiyouga 2024-01-03 16:19:23 +08:00
  • 55021097d5 fix rm server hiyouga 2024-01-03 15:30:46 +08:00
  • 3014e3c189 Update wechat.jpg hiyouga 2023-12-31 15:05:59 +08:00
  • 4519d95923 Update wechat.jpg hiyouga 2023-12-29 15:26:22 +08:00
  • ce2156eaa8 fix #2014 hiyouga 2023-12-29 15:17:22 +08:00
  • c7ea17d616 add yuan model hiyouga 2023-12-29 13:50:24 +08:00
  • 47da742fc9 fix version hiyouga 2023-12-29 04:53:36 +08:00
  • 65c5b0477c fix args hiyouga 2023-12-28 18:47:19 +08:00
  • e165354fac fix export format hiyouga 2023-12-28 18:40:46 +08:00
  • 5431be42f9 fix ppo trainer hiyouga 2023-12-28 18:09:28 +08:00
  • db6cb2d0e7 add model link hiyouga 2023-12-25 19:44:38 +08:00
  • 5b93d545e2 tiny update hiyouga 2023-12-25 18:29:34 +08:00
  • e4bb846c43 fix bug hiyouga 2023-12-24 19:20:12 +08:00
  • 6629087e12 update loader hiyouga 2023-12-24 19:10:23 +08:00
  • e44b82ee24 update patcher hiyouga 2023-12-23 15:24:27 +08:00
  • 0bbf7118df fix #1909 hiyouga 2023-12-23 14:42:20 +08:00
  • 0ad86a4f62 update readme hiyouga 2023-12-23 02:17:41 +08:00
  • 779cfefb78 fix unsloth dtype hiyouga 2023-12-23 01:59:49 +08:00
  • 074745b170 fix dpo trainer hiyouga 2023-12-23 01:51:55 +08:00
  • 9a18a85639 llama board: add unsloth hiyouga 2023-12-23 00:35:53 +08:00
  • 7aad0b889d support unsloth hiyouga 2023-12-23 00:14:33 +08:00
  • 315b8367cb Merge pull request #1953 from ShaneTian/model-load-bf16 hoshi-hiyouga 2023-12-22 17:29:54 +08:00
  • d032daa4bd Fix slow model initialization in bfloat16 dtype. ShaneTian 2023-12-21 21:25:20 +08:00
  • ba69378841 fix param type hiyouga 2023-12-21 17:33:01 +08:00
  • 083355fc05 fix ds zero3 check hiyouga 2023-12-21 01:19:22 +08:00
  • af0194e6d9 match version hiyouga 2023-12-20 22:17:35 +08:00
  • ba4d32bf59 Merge pull request #1932 from ShaneTian/main hoshi-hiyouga 2023-12-20 22:13:28 +08:00
  • 390f0caf7f Update transformers to 4.36.2 to resolve bug when saving a checkpoint in the multi-node setting. ShaneTian 2023-12-20 22:00:41 +08:00
  • 7910dbae92 Update wechat.jpg hiyouga 2023-12-20 19:24:37 +08:00
  • dec360d5ae fix stop words hiyouga 2023-12-20 19:06:43 +08:00
  • 5af8841c4f fix yi template #1895 hiyouga 2023-12-20 18:58:16 +08:00
  • 624cc21281 improve quantization hiyouga 2023-12-20 18:27:16 +08:00
  • c4a3977ad7 add max_memory for gptq #1923 hiyouga 2023-12-20 18:15:17 +08:00
  • 31165a9822 fix #1073 #1462 #1735 #1908 hiyouga 2023-12-20 17:15:40 +08:00
  • ec1fe1daa9 optimize data loading logic hiyouga 2023-12-20 16:15:41 +08:00
  • c6abbbfe90 fix #1909 hiyouga 2023-12-20 16:11:07 +08:00
  • f86857bd9e fix mixtral inference #1821 hiyouga 2023-12-20 15:11:15 +08:00
  • 0c6ab7c75e fix #1900 hiyouga 2023-12-19 17:21:46 +08:00
  • edb7d177c2 update readme hiyouga 2023-12-18 22:29:45 +08:00
  • a67a440644 add codegeex template hiyouga 2023-12-18 19:52:35 +08:00
  • 2df923540c add xverse-65B-2 model hiyouga 2023-12-18 19:24:09 +08:00
  • 709ac8870a add models hiyouga 2023-12-18 19:09:31 +08:00
  • 71a9c16171 fix tokenizer for Yi chat models #1617 #1875 hiyouga 2023-12-18 17:18:11 +08:00
  • 2b4e5f0d32 update readme hiyouga 2023-12-18 15:46:45 +08:00
  • c46879575f fix llama board hiyouga 2023-12-16 22:17:37 +08:00
  • 870426ff70 fix #1742 hiyouga 2023-12-16 20:50:45 +08:00
  • 7ae6919b9b add xverse-65b-chat model hiyouga 2023-12-16 20:21:29 +08:00
  • 328ad06bd4 set version hiyouga 2023-12-16 20:17:51 +08:00
  • a66186b872 add noisy mean initialization #1815 hiyouga 2023-12-16 19:47:51 +08:00
  • b87c74289d support dpo-ftx hiyouga 2023-12-16 19:21:41 +08:00
  • 71389be37c support autogptq in llama board #246 hiyouga 2023-12-16 16:31:30 +08:00
  • 93f64ce9a8 Merge pull request #1868 from yhyu13/improve_hfargparser hoshi-hiyouga 2023-12-16 16:06:09 +08:00
  • fc70a92cb6 Use llmtuner logger yhyu13 2023-12-16 07:15:27 +00:00
  • 26817143ff Improve logging for unknown args yhyu13 2023-12-16 05:16:29 +00:00
  • 3551171d49 update tips hiyouga 2023-12-15 23:52:50 +08:00
  • 439a26c276 fix #1770 hiyouga 2023-12-15 23:50:15 +08:00
  • 3524aa1e58 support quantization in export model hiyouga 2023-12-15 23:44:50 +08:00
  • 87ef3f47b5 update dc link hiyouga 2023-12-15 22:11:31 +08:00
  • e2bd597b3c Merge pull request #1864 from hiyouga/dev hoshi-hiyouga 2023-12-15 22:06:56 +08:00
  • 00c77104f8 fix bug hiyouga 2023-12-15 21:54:02 +08:00
  • 9e509b99af fix bug hiyouga 2023-12-15 21:49:26 +08:00
  • 2740aa9cbb add configurer hiyouga 2023-12-15 21:46:40 +08:00
  • 0716f5e470 refactor adapter hparam hiyouga 2023-12-15 20:53:11 +08:00
  • d4c351f1ec add loftq hiyouga 2023-12-14 21:53:56 +08:00
  • bfdee1608f fix valuehead model hiyouga 2023-12-14 20:15:20 +08:00
  • bf2d9c8feb Update wechat.jpg hoshi-hiyouga 2023-12-13 18:23:18 +08:00
  • 81167cd19d tiny fix hoshi-hiyouga 2023-12-13 17:32:36 +08:00
  • 9b0630f84f revert peft version hoshi-hiyouga 2023-12-13 10:49:45 +08:00
  • 573a12c86b update peft version hoshi-hiyouga 2023-12-13 10:23:51 +08:00
  • 6953096c9d tiny fix hoshi-hiyouga 2023-12-13 10:21:29 +08:00
  • 1fcd545c3d fix #1819 hoshi-hiyouga 2023-12-13 10:14:01 +08:00
  • 3a8a50d4d4 remove loftq hiyouga 2023-12-13 01:53:46 +08:00
  • 2c8e88f9c1 fix sharegpt loading hiyouga 2023-12-13 00:56:16 +08:00
  • 3552035d7e add model urls hiyouga 2023-12-13 00:09:17 +08:00
  • 28cc07868c update readme hiyouga 2023-12-12 23:30:29 +08:00
  • 6219dfbd93 support loftq hiyouga 2023-12-12 22:47:06 +08:00
  • ada0e536c9 fix #1795 hiyouga 2023-12-12 19:58:34 +08:00
  • 0a9c6e0146 support system column #1765 hiyouga 2023-12-12 19:45:59 +08:00
  • d5b2c57a35 fix modelscope data hub hiyouga 2023-12-12 18:33:06 +08:00