Commit Graph

  • 7159bc54ed add datasets hiyouga 2023-07-19 20:59:15 +08:00
  • 925a790bc9 fix #196 hiyouga 2023-07-19 17:35:38 +08:00
  • 8f7819fcaa fix #194 hiyouga 2023-07-19 17:07:33 +08:00
  • 7a3ade8c69 support LLaMA-2 hiyouga 2023-07-19 16:42:14 +08:00
  • 38eb1aaf55 add LLaMA2 template hiyouga 2023-07-19 00:44:49 +08:00
  • 29af67b015 fix API hiyouga 2023-07-19 00:01:14 +08:00
  • fe2887ca13 support dev set in web ui hiyouga 2023-07-18 20:40:49 +08:00
  • b447fa85aa add web demo hiyouga 2023-07-18 17:21:16 +08:00
  • bdf91846da update baichuan template hiyouga 2023-07-18 16:43:51 +08:00
  • d1ae428c6e fix template hiyouga 2023-07-18 16:37:23 +08:00
  • cadeac0f44 fix #176 hiyouga 2023-07-18 16:36:24 +08:00
  • 6f9360c0bd fix webUI, fix #171 #177 hiyouga 2023-07-18 15:51:48 +08:00
  • 12d8a8633f update webUI, fix #179 hiyouga 2023-07-18 15:35:17 +08:00
  • b9fe83fb75 tiny fix hiyouga 2023-07-18 00:52:31 +08:00
  • 262252d67b a monkey patch for lora_target hiyouga 2023-07-18 00:31:40 +08:00
  • f8193e8009 release v0.1.0 hiyouga 2023-07-18 00:18:25 +08:00
  • 85c2210452 fix #175 hiyouga 2023-07-17 18:07:17 +08:00
  • 1e1358431d fix saving custom code hiyouga 2023-07-16 18:04:41 +08:00
  • 2c867b9bb1 add custom baichuan-13B code supports left-padding hiyouga 2023-07-15 22:37:17 +08:00
  • 552d773dad fix callback hiyouga 2023-07-15 22:01:43 +08:00
  • 8528a84e74 update stream_chat hiyouga 2023-07-15 19:51:02 +08:00
  • 657cf0f55a create chat model hiyouga 2023-07-15 19:26:20 +08:00
  • d640c5545f Update callbacks.py hiyouga 2023-07-15 17:39:16 +08:00
  • 1e2b7e0c4b Update README.md hiyouga 2023-07-15 17:20:39 +08:00
  • 22d9a9c2af fix callback hiyouga 2023-07-15 17:18:16 +08:00
  • f751376613 modity code structure hiyouga 2023-07-15 16:54:28 +08:00
  • 2a0f1f8398 Update wechat.jpg hiyouga 2023-07-14 17:29:43 +08:00
  • c30db9f1f0 fix eval and pred loss hiyouga 2023-07-14 13:11:57 +08:00
  • a04115ec27 fix pretrain hiyouga 2023-07-13 23:41:54 +08:00
  • 08439d29b2 fix Baichuan-13B hiyouga 2023-07-13 23:08:45 +08:00
  • 8cd76ef3c3 Merge pull request #156 from ZhengJun-AI/main hoshi-hiyouga 2023-07-12 20:11:19 +08:00
  • 4955dc9eed Support for WebNovel dataset zxbsmk 2023-07-12 17:29:47 +08:00
  • 894f13e41f Merge pull request #145 from elicassion/patch-1 hoshi-hiyouga 2023-07-12 13:50:39 +08:00
  • dc1e8b7181 Fix typo in common.py Jinghuan Shang 2023-07-11 18:03:53 -04:00
  • b2f7cb4465 fix sft encode hiyouga 2023-07-11 19:50:33 +08:00
  • 1af031c02b add baichuan template hiyouga 2023-07-11 18:57:50 +08:00
  • f936a7af0b support Baichuan-13B hiyouga 2023-07-11 16:16:14 +08:00
  • 8447206bbc Update README.md hiyouga 2023-07-10 23:09:11 +08:00
  • 061c324972 Update wechat.jpg hiyouga 2023-07-10 18:41:53 +08:00
  • 4182c7aa8b Update README.md hiyouga 2023-07-09 14:57:13 +08:00
  • 84a06318d4 update api to match langchain hiyouga 2023-07-07 20:35:39 +08:00
  • 233f20864b Update README.md hiyouga 2023-07-07 12:06:28 +08:00
  • a2f507c562 support InternLM hiyouga 2023-07-07 11:02:28 +08:00
  • caa00d3ac2 fix rouge score hiyouga 2023-07-06 14:28:34 +08:00
  • 89c623e4bf update readme hiyouga 2023-07-05 23:03:58 +08:00
  • 4abd2485e1 fix streaming response in API hiyouga 2023-07-05 22:42:31 +08:00
  • e6603977f6 fix freeze tuning hiyouga 2023-07-05 21:18:28 +08:00
  • a2ba69183b fix bug in PPO stage hiyouga 2023-07-05 19:14:10 +08:00
  • 8e3540c62d fix compute dtype hiyouga 2023-07-05 15:13:00 +08:00
  • c136f362c1 support falcon model #72 hiyouga 2023-07-05 15:00:06 +08:00
  • 966b5c70fc Update wechat.jpg hiyouga 2023-07-05 00:22:22 +08:00
  • cac87fd553 fix bleu score hiyouga 2023-07-05 00:11:21 +08:00
  • 395ed1cf1b set use_cache before saving model hiyouga 2023-07-04 23:18:20 +08:00
  • 65e9ce2cdd fix seq2seq predictions hiyouga 2023-07-04 22:56:51 +08:00
  • cb26f78923 Merge pull request #119 from codemayq/main hoshi-hiyouga 2023-07-03 19:51:46 +08:00
  • d3b30ecde3 add the pre-built version of bitsandbytes library for windows user codemayq 2023-07-03 13:58:10 +08:00
  • 0db9d29111 Update auto_gptq.py hiyouga 2023-07-02 20:56:11 +08:00
  • cf6d57fd3e add autogptq hiyouga 2023-07-02 20:36:37 +08:00
  • b8e1f09a2e Update wechat.jpg hiyouga 2023-06-30 15:45:20 +08:00
  • 92fa515e97 fix typo hiyouga 2023-06-30 10:09:59 +08:00
  • 021b035c1e Update README.md hiyouga 2023-06-29 19:36:22 +08:00
  • f14bd729a8 rename evaluate.py hiyouga 2023-06-29 15:40:39 +08:00
  • 23a7266272 Update evaluate.py hiyouga 2023-06-29 15:40:03 +08:00
  • 70592035b8 Update README.md hiyouga 2023-06-29 15:37:19 +08:00
  • 3154fec979 add open assistant dataset hiyouga 2023-06-28 23:09:33 +08:00
  • 4d0fddba21 update loading logic hiyouga 2023-06-28 12:07:16 +08:00
  • 0a46313cca fix loading best model hiyouga 2023-06-28 01:55:12 +08:00
  • 7826a8ca77 fix RM accuracy hiyouga 2023-06-28 01:40:13 +08:00
  • 9cb1af71f3 add star history hiyouga 2023-06-27 23:56:29 +08:00
  • 450910c1db tiny fix hiyouga 2023-06-27 23:54:24 +08:00
  • 18f87c1b25 fix initializing data arguments hiyouga 2023-06-27 22:50:23 +08:00
  • 2e01abfda5 support save full model, replace BOS token hiyouga 2023-06-27 21:40:11 +08:00
  • 1c732e2537 fix decoding in seq2seq hiyouga 2023-06-27 19:33:08 +08:00
  • 33f2141507 Update wechat.jpg hiyouga 2023-06-27 16:41:09 +08:00
  • 4f3772b342 Update evaluate.py hiyouga 2023-06-26 23:41:33 +08:00
  • 5a0a9daf74 Create evaluate.py hiyouga 2023-06-26 23:30:18 +08:00
  • 907e065454 Merge pull request #86 from Jingsong-Yan/main hoshi-hiyouga 2023-06-26 20:14:40 +08:00
  • 90bb5b6f37 Update README.md with baichuan-7b-rtx3090 Jingsong-Yan 2023-06-26 19:45:41 +08:00
  • 993cabdd4c Merge branch 'main' of https://github.com/hiyouga/LLaMA-Efficient-Tuning hiyouga 2023-06-26 18:07:09 +08:00
  • 1175948029 fix generation in seq2seq.py hiyouga 2023-06-26 18:07:06 +08:00
  • 95b057f5af Merge pull request #84 from wu-yy/patch-1 hoshi-hiyouga 2023-06-26 15:39:08 +08:00
  • e2a16d549e Update requirements.txt 蓝鲸123 2023-06-26 15:36:19 +08:00
  • cec9760eb8 support prefixes, loading multiple local files hiyouga 2023-06-26 15:32:40 +08:00
  • f030b09924 update api hiyouga 2023-06-26 13:39:57 +08:00
  • d21cc71750 Update wechat.jpg hiyouga 2023-06-25 23:41:11 +08:00
  • 0697643358 update readme hiyouga 2023-06-23 00:17:05 +08:00
  • 614d3a996c update API hiyouga 2023-06-22 20:46:24 +08:00
  • 76ecb8c222 match api with OpenAI format hiyouga 2023-06-22 20:27:00 +08:00
  • 9324940b76 Merge pull request #68 from mMrBun/main hoshi-hiyouga 2023-06-22 15:52:34 +08:00
  • 6e4db0903f Compatible with OpenAI API. Bun 2023-06-21 14:45:04 +08:00
  • ded5aa3c3d Update wechat.jpg hiyouga 2023-06-19 19:46:04 +08:00
  • f621f7631a add default template hiyouga 2023-06-16 21:12:17 +08:00
  • 334d1a6d26 add belle multiturn dataset hiyouga 2023-06-16 20:01:16 +08:00
  • a6c4b141cd fix freeze layers hiyouga 2023-06-16 17:38:21 +08:00
  • fc4d8155b3 add source prefix hiyouga 2023-06-16 16:32:17 +08:00
  • 0574b590ef support loading lora from hub hiyouga 2023-06-16 00:02:17 +08:00
  • 0cee6ad67f support baichuan model hiyouga 2023-06-15 16:02:01 +08:00
  • c527399424 fix bug in template vanilla hiyouga 2023-06-15 14:36:55 +08:00
  • 0a36658bb6 Update wechat.jpg hiyouga 2023-06-15 13:48:53 +08:00
  • d668f8b501 add BOS token in pre-training hiyouga 2023-06-15 01:46:17 +08:00