Commit Graph

  • 382319915c Merge pull request #1802 from tastelikefeet/feat/support_ms hoshi-hiyouga 2023-12-12 17:58:37 +08:00
  • 6382efec52 Merge branch 'main' into feat/support_ms hoshi-hiyouga 2023-12-12 17:55:32 +08:00
  • e6ddebd3ae fix webui hiyouga 2023-12-12 15:27:40 +08:00
  • e80a989d49 modify guanaco xingjun.wang 2023-12-12 15:00:37 +08:00
  • 73b50a26b9 update dataset info xingjun.wang 2023-12-12 14:53:59 +08:00
  • adc98c86da add use_streaming xingjun.wang 2023-12-12 14:23:05 +08:00
  • 1909f0d117 fix cache dir xingjun.wang 2023-12-12 14:21:33 +08:00
  • 168321a4da add print info for test xingjun.wang 2023-12-12 14:14:40 +08:00
  • edc82b923a update cache dir xingjun.wang 2023-12-12 13:08:18 +08:00
  • 09533e95ed update args for MsDataset.load xingjun.wang 2023-12-12 13:02:54 +08:00
  • fe4acc66b0 add new datasets xingjun.wang 2023-12-12 12:44:15 +08:00
  • 0ce18a3782 add open orca xingjun.wang 2023-12-12 12:34:04 +08:00
  • cfba1009d0 update xingjun.wang 2023-12-12 12:03:23 +08:00
  • 5b979147f0 for test xingjun.wang 2023-12-12 11:52:59 +08:00
  • 8a908a8c64 for test xingjun.wang 2023-12-12 11:47:59 +08:00
  • 8cace77808 update readme hiyouga 2023-12-12 11:44:30 +08:00
  • 96380f5e18 support mixtral hiyouga 2023-12-12 11:39:04 +08:00
  • f4657de7d5 fix baichuan resize hiyouga 2023-12-11 20:55:50 +08:00
  • 0239d29fa0 tiny fix hiyouga 2023-12-11 18:09:40 +08:00
  • 64744dde89 support resize embeddings #1786 hiyouga 2023-12-11 17:50:02 +08:00
  • 9ce1b0e2f2 use peft 0.7.0, fix #1561 #1764 hiyouga 2023-12-11 17:13:40 +08:00
  • 28d5de7e78 fix #1784 hiyouga 2023-12-09 20:53:18 +08:00
  • e4cf2a75ca fix typo yuze.zyz 2023-12-08 18:13:26 +08:00
  • 9c2247d700 support ms dataset yuze.zyz 2023-12-08 18:00:57 +08:00
  • d42c0b1d34 fix #1771 and temporarily fix #1764 hiyouga 2023-12-08 16:26:20 +08:00
  • 3378337b1a Update wechat.jpg hiyouga 2023-12-07 22:05:04 +08:00
  • d60cf551a1 Update wechat.jpg hiyouga 2023-12-06 17:10:02 +08:00
  • e25f7bae16 add models hiyouga 2023-12-06 13:33:18 +08:00
  • d3dccd0693 fix ppo trainer save logic hiyouga 2023-12-04 19:00:19 +08:00
  • 997b65f291 update readme hiyouga 2023-12-04 11:22:01 +08:00
  • 8ede3128df update readme hiyouga 2023-12-04 11:02:29 +08:00
  • c9b166615c fix #1715 hiyouga 2023-12-03 22:35:47 +08:00
  • 438dea679b release v0.3.3 hiyouga 2023-12-03 21:59:45 +08:00
  • 8b681ee273 fix bug hiyouga 2023-12-03 21:40:40 +08:00
  • 747db40172 ppo support rm server hiyouga 2023-12-03 21:38:51 +08:00
  • 7df4f3ab20 implement rm server #1543 hiyouga 2023-12-03 20:52:54 +08:00
  • 03d05991f8 fix #1707 #1710 hiyouga 2023-12-03 11:33:12 +08:00
  • 5b78e269b6 add logo hiyouga 2023-12-02 01:31:24 +08:00
  • b69763ff92 fix #1642 hiyouga 2023-12-02 00:37:53 +08:00
  • 6e7af11b98 add xuanyuan models hiyouga 2023-12-02 00:35:29 +08:00
  • f57445c7a0 fix gptq training hiyouga 2023-12-02 00:27:15 +08:00
  • a973ce6e89 tiny fix hiyouga 2023-12-01 23:37:10 +08:00
  • 01e6c539b0 fix gptq model inference hiyouga 2023-12-01 23:34:14 +08:00
  • 0cb260f453 update readme hiyouga 2023-12-01 22:58:29 +08:00
  • 662d9a3a4e fix #1703 hiyouga 2023-12-01 22:55:41 +08:00
  • bd42c229b0 patch modelscope hiyouga 2023-12-01 22:53:15 +08:00
  • 3a64506031 Merge pull request #1700 from tastelikefeet/feat/support_ms hoshi-hiyouga 2023-12-01 20:25:18 +08:00
  • 00f5c9ee16 Merge branch 'main' into feat/support_ms hoshi-hiyouga 2023-12-01 20:23:46 +08:00
  • 5a2392f105 remove useless code yuze.zyz 2023-12-01 17:28:23 +08:00
  • d9e52957e2 fix bug tastelikefeet 2023-12-01 17:27:00 +08:00
  • a5a248d569 fix err hint hiyouga 2023-12-01 17:13:22 +08:00
  • a51b8ec620 add err hint hiyouga 2023-12-01 17:04:37 +08:00
  • aec946b119 Merge pull request #1699 from Samge0/patch-1 hoshi-hiyouga 2023-12-01 16:52:57 +08:00
  • 7cabb9903d Update .gitignore SamgeShao 2023-12-01 16:37:41 +08:00
  • 5aa6751e52 add readme yuze.zyz 2023-12-01 16:11:30 +08:00
  • e597d3c084 tiny fix hiyouga 2023-12-01 15:58:50 +08:00
  • fbc6220692 Merge pull request #1695 from Samge0/dev hoshi-hiyouga 2023-12-01 15:56:18 +08:00
  • d043a4e7ba Merge pull request #1690 from billvsme/main hoshi-hiyouga 2023-12-01 15:44:35 +08:00
  • bf6f6aeefe fix #1696 hiyouga 2023-12-01 15:34:50 +08:00
  • 8ce4d11e38 add model tastelikefeet 2023-12-01 15:06:17 +08:00
  • a0fde6e421 Merge pull request #1689 from mlinmg/patch-2 hoshi-hiyouga 2023-12-01 14:29:36 +08:00
  • 421d4de604 Improve:"CUDA_VISIBLE_DEVICES" read from the env samge 2023-12-01 11:35:02 +08:00
  • 9468ee9012 Update dataset_info.json Marco 2023-11-30 16:21:34 +01:00
  • 40dfcbc3d4 improve get_current_device billvsme 2023-11-30 22:40:35 +08:00
  • 327d7f7efe fix #1597 hiyouga 2023-11-30 21:47:06 +08:00
  • 1585962eb7 fix #1668 hiyouga 2023-11-30 21:02:00 +08:00
  • a38dbf55e3 fix #1682 hiyouga 2023-11-30 20:03:32 +08:00
  • 509abe8864 add models hiyouga 2023-11-30 19:16:13 +08:00
  • fb2204c183 fix yuze.zyz 2023-11-29 21:43:58 +08:00
  • d38a2e7341 support ms yuze.zyz 2023-11-29 20:36:55 +08:00
  • 9d38e5687d add gpu requirement #1657 hiyouga 2023-11-29 12:05:03 +08:00
  • 77d1b14fc2 fix #1658 hiyouga 2023-11-28 20:57:24 +08:00
  • 475a3fa0f4 fix #1659 hiyouga 2023-11-28 20:52:28 +08:00
  • c2d4300ac4 Update wechat.jpg hiyouga 2023-11-28 17:27:23 +08:00
  • 859a6ea942 support export size setting hiyouga 2023-11-26 18:34:09 +08:00
  • ff1c289229 support Yi-34B-Chat models hiyouga 2023-11-23 19:31:49 +08:00
  • 5085b00a1d update readme hiyouga 2023-11-21 13:15:46 +08:00
  • 35c2da3eba set version hiyouga 2023-11-20 22:57:44 +08:00
  • 9ea9380145 support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569 hiyouga 2023-11-20 22:52:11 +08:00
  • 5021062493 update ppo trainer hiyouga 2023-11-20 21:39:15 +08:00
  • 48211e3799 Merge pull request #1553 from hannlp/hans hoshi-hiyouga 2023-11-20 20:32:55 +08:00
  • 2a36fd5064 fix value head model resuming hiyouga 2023-11-20 19:01:37 +08:00
  • 99a3f06377 fix #1567 hiyouga 2023-11-20 18:46:36 +08:00
  • 00baaa990e better data streaming hiyouga 2023-11-19 23:32:47 +08:00
  • 211b2db5a8 fix model card network issue hiyouga 2023-11-19 23:03:19 +08:00
  • bfb9433165 fix Mistral template hiyouga 2023-11-19 16:29:30 +08:00
  • 065bfaeed4 fix #1263 hiyouga 2023-11-19 16:05:18 +08:00
  • 1740131d63 fix #1558 hiyouga 2023-11-19 14:15:47 +08:00
  • ff6056405d fix evaluator and cached_file in 4.31.0 hiyouga 2023-11-18 19:39:23 +08:00
  • a2019c8b61 update benchmark hiyouga 2023-11-18 11:30:01 +08:00
  • 90212280d6 update readme hiyouga 2023-11-18 11:15:56 +08:00
  • 329134f58c add benchmark hiyouga 2023-11-18 11:09:52 +08:00
  • 7b1aa6f63c update dataset hiyouga 2023-11-17 23:19:12 +08:00
  • ccb0f58e22 fix quantization hiyouga 2023-11-17 22:21:29 +08:00
  • 1bbc1be95e fix #1550 hiyouga 2023-11-17 17:23:13 +08:00
  • 7cab47b822 Update README_zh.md Yuchen Han 2023-11-17 00:18:07 -08:00
  • c9b499fa7e Update README.md Yuchen Han 2023-11-17 00:17:36 -08:00
  • eeb5249d0b Update workflow.py Yuchen Han 2023-11-17 00:16:27 -08:00
  • b24635d22b Update finetuning_args.py Yuchen Han 2023-11-17 00:15:51 -08:00
  • 999bc0ed93 fix packages hiyouga 2023-11-17 16:11:48 +08:00