Commit Graph

  • 8b588c7224 fix #5307 hiyouga 2024-08-30 02:45:40 +08:00
  • 3382317e32 refactor mm training hiyouga 2024-08-30 02:14:31 +08:00
  • 727e184840 Merge pull request #5290 from simonJJJ/qwen2_vl hoshi-hiyouga 2024-08-30 02:10:36 +08:00
  • a8f22d8895 fix bug hoshi-hiyouga 2024-08-30 02:05:26 +08:00
  • a7dd7d325e update liger kernel hiyouga 2024-08-29 20:46:08 +08:00
  • aa1afdc756 fix #5292 hiyouga 2024-08-29 20:37:47 +08:00
  • ad72f3e065 fix #5295 hiyouga 2024-08-29 20:30:18 +08:00
  • 364b757e30 fix #5305 hiyouga 2024-08-29 20:16:01 +08:00
  • 734e019cc1 update simonJJJ 2024-08-28 20:22:46 +08:00
  • aeb85f200b initial-commit simonJJJ 2024-08-28 16:51:35 +08:00
  • 0f5a0f64f7 update wechat hiyouga 2024-08-27 12:55:23 +08:00
  • d14edd350d add extra requires hiyouga 2024-08-27 12:52:12 +08:00
  • f6ae4e75dd tiny fix hiyouga 2024-08-27 12:49:32 +08:00
  • dbe886ae5c Merge pull request #5237 from marko1616/patch-1 hoshi-hiyouga 2024-08-27 12:24:43 +08:00
  • df8d5b6985 ruff pass. marko1616 2024-08-27 11:30:16 +08:00
  • 1545684c3f Update chat.py marko1616 2024-08-27 11:27:56 +08:00
  • 72bc8f0111 support liger kernel hiyouga 2024-08-27 11:20:14 +08:00
  • 3a28521710 Force re check. marko1616 2024-08-23 14:43:18 +08:00
  • 8eb2092921 Update chat.py marko1616 2024-08-22 12:24:34 +08:00
  • a4f1de9d82 Update chat.py marko1616 2024-08-22 12:14:34 +08:00
  • 36039b0fe0 Merge pull request #5230 from MengqingCao/image hoshi-hiyouga 2024-08-21 22:13:07 +08:00
  • 8907150c1b update wechat hiyouga 2024-08-21 22:07:34 +08:00
  • b3f4acd1b4 update npu base image MengqingCao 2024-08-21 09:12:38 +00:00
  • c8b4c7fee5 tiny fix hiyouga 2024-08-20 00:10:52 +08:00
  • 15be296347 Merge pull request #5156 from YeQiuO/main hoshi-hiyouga 2024-08-20 00:09:03 +08:00
  • ec72eeca52 Update template.py hoshi-hiyouga 2024-08-20 00:03:33 +08:00
  • da335d42c3 Merge pull request #5163 from liu-zichen/fix_ppo_optim hoshi-hiyouga 2024-08-19 23:56:24 +08:00
  • f59c9bef31 Merge pull request #5185 from chenhuiyu/feature/add-sailorllm-template hoshi-hiyouga 2024-08-19 23:51:49 +08:00
  • d39f4a62d3 Merge pull request #5188 from Zxilly/main hoshi-hiyouga 2024-08-19 23:51:39 +08:00
  • 5d5bfc83e6 Merge pull request #5193 from Ricardo-L-C/main hoshi-hiyouga 2024-08-19 23:40:59 +08:00
  • 5f3300ec5d Update template.py hoshi-hiyouga 2024-08-19 23:40:16 +08:00
  • 3804ddec9e update readme hiyouga 2024-08-19 23:32:04 +08:00
  • 384ab8db84 _is_bf16_available judgment supports npu Ricardo 2024-08-16 02:58:22 +00:00
  • dc36fcc3de fix: report correct device count for intel xpu Zxilly 2024-08-15 08:30:43 +00:00
  • 2502833a77 Add SailorLLM template Huiyu Chen 2024-08-15 15:10:14 +08:00
  • ddee718b31 fix lr not change liu-zichen 2024-08-13 16:33:34 +08:00
  • 625a0e32c4 add tutorial and doc links codingma 2024-08-13 16:13:10 +08:00
  • 5b9d99ebc6 update wechat.jpg codingma 2024-08-13 16:12:36 +08:00
  • bcbbf45063 fix Llama-template's system prompt bug “Wzw” 2024-08-12 19:22:12 +08:00
  • c93d55bfb0 update readme hiyouga 2024-08-10 10:17:35 +08:00
  • 576a894f77 update readme hiyouga 2024-08-09 20:46:02 +08:00
  • c75b5b83c4 add magpie ultra dataset hiyouga 2024-08-09 20:28:55 +08:00
  • dc770efb14 add qwen2 math models hiyouga 2024-08-09 20:20:35 +08:00
  • 0a690ada6f update examples hiyouga 2024-08-09 20:13:46 +08:00
  • e2a28f51c6 add adam_mini to readme hiyouga 2024-08-09 20:02:03 +08:00
  • ef482394f0 Merge pull request #5095 from relic-yuexi/feat-optimizer hoshi-hiyouga 2024-08-09 19:51:33 +08:00
  • 86f7099fa3 update scripts hiyouga 2024-08-09 19:16:23 +08:00
  • c87023d539 follow #5115 hiyouga 2024-08-09 18:03:00 +08:00
  • 51542cb15f Merge pull request #5115 from YeQiuO/main hoshi-hiyouga 2024-08-09 17:58:27 +08:00
  • 984961c550 Merge pull request #5072 from relic-yuexi/main hoshi-hiyouga 2024-08-09 16:35:21 +08:00
  • 4f62e1cb24 Update template.py hoshi-hiyouga 2024-08-09 16:27:42 +08:00
  • 2fa1e0b2ad mask_history args verify valid “Wzw” 2024-08-08 10:12:01 +08:00
  • b5ca86cc07 fix mask_history tiny bug “Wzw” 2024-08-08 10:09:33 +08:00
  • 18e455c232 Merge pull request #5109 from codemayq/fix-example codingma 2024-08-07 18:30:05 +08:00
  • 9a48f7e957 update wechat.jpg codingma 2024-08-07 18:29:48 +08:00
  • 823e7c122b fix eval_dataset in example codingma 2024-08-07 18:24:19 +08:00
  • 82bc15dc79 feat: add support for adammini moontidef 2024-08-07 10:08:22 +08:00
  • 40908a36fa fix: rename optimzer to optimizer moontidef 2024-08-07 10:05:01 +08:00
  • 55f32dfbf9 Merge branch 'hiyouga:main' into main moontidef 2024-08-06 00:18:45 +08:00
  • b82ecbedd0 fix: fix the deepseekcoder template to avoid repeat problem moontidef 2024-08-05 23:55:45 +08:00
  • b7ca6c8dc1 fix #5048 hiyouga 2024-08-05 23:48:19 +08:00
  • c2921b9960 Merge pull request #5037 from codemayq/feature-gemma-2-2b hoshi-hiyouga 2024-08-05 23:27:37 +08:00
  • dc09d454f2 support gemma-2-2b codingma 2024-08-01 13:45:48 +08:00
  • 1c05b847b2 update wechat.jpg codingma 2024-08-01 09:51:47 +08:00
  • 3885949a9d update wechat_npu.jpg codingma 2024-07-30 13:45:47 +08:00
  • cd420c1938 Merge pull request #5010 from Eruly/main hoshi-hiyouga 2024-07-30 01:55:54 +08:00
  • 06e17eb462 Merge pull request #4996 from LDLINGLINGLING/main hoshi-hiyouga 2024-07-30 01:55:30 +08:00
  • 3a49c76b65 Update README_zh.md hoshi-hiyouga 2024-07-30 01:55:13 +08:00
  • 9e409eadb0 Update README.md hoshi-hiyouga 2024-07-30 01:53:19 +08:00
  • 8d5a41f2cd Update README.md hoshi-hiyouga 2024-07-30 01:52:35 +08:00
  • daa62db06f Merge pull request #4995 from codemayq/fix-pissa hoshi-hiyouga 2024-07-30 01:47:25 +08:00
  • 371009e522 Add Korean web UI (llamafactory-cli webui) eruly 2024-07-29 13:47:13 +00:00
  • b9ed9d45cc 增加了MiniCPM在页面首页的支持列表,MiniCPM官方github也放了LLama_factory的友情链接 liudan 2024-07-29 10:58:28 +08:00
  • 2c1ca9f742 fix pissa save codingma 2024-07-29 10:44:34 +08:00
  • 668654b5ad tiny fix hiyouga 2024-07-26 11:51:00 +08:00
  • 8a2846cfe1 Merge pull request #4892 from piamo/main hoshi-hiyouga 2024-07-26 11:49:34 +08:00
  • 9839c6d1f6 Merge pull request #4950 from liuwwang/main and fi hoshi-hiyouga 2024-07-26 11:48:56 +08:00
  • b8896b9b8b Merge pull request #4970 from HardAndHeavy/add-rocm hoshi-hiyouga 2024-07-26 11:41:23 +08:00
  • 3c424cf69a Merge pull request #4961 from khazic/main hoshi-hiyouga 2024-07-26 11:32:29 +08:00
  • 77e7bfee79 Update README_zh.md hoshi-hiyouga 2024-07-26 11:30:57 +08:00
  • 1186ad53d4 Update README.md hoshi-hiyouga 2024-07-26 11:29:28 +08:00
  • f97beca23a Update README.md hoshi-hiyouga 2024-07-26 11:29:09 +08:00
  • 024c49d4e0 update wechat.jpg codemayq 2024-07-26 10:01:10 +08:00
  • c8e18a669a Add ROCm support HardAndHeavy 2024-07-25 21:29:28 +03:00
  • ceba96f9ed Added the reference address for TRL PPO details. khazic 2024-07-25 09:03:21 +08:00
  • 77cff78863 fix #4959 hiyouga 2024-07-24 23:44:00 +08:00
  • 30f8149d11 update webui hiyouga 2024-07-24 21:11:51 +08:00
  • 71d3e60713 Update README_zh.md hoshi-hiyouga 2024-07-24 21:08:42 +08:00
  • 5626bdc56d Update README.md hoshi-hiyouga 2024-07-24 21:07:14 +08:00
  • ace1d44857 tiny fix hiyouga 2024-07-24 18:33:39 +08:00
  • 091010492b fix #4928 hiyouga 2024-07-24 17:00:29 +08:00
  • 935b22d93e fix #4925 hiyouga 2024-07-24 16:56:58 +08:00
  • 1bbd49faae fix #4944 hiyouga 2024-07-24 16:42:51 +08:00
  • 1550fe7331 add mistral nemo model hiyouga 2024-07-24 16:25:53 +08:00
  • 26533c0604 add llama3.1 hiyouga 2024-07-24 16:20:11 +08:00
  • f91a9a250a fix: Repair the issue where quantization failed after merging the adapter. Liuww 2024-07-24 14:31:29 +08:00
  • bb0a37dc06 Update wechat_npu.jpg hiyouga 2024-07-22 21:17:22 +08:00
  • 5665062ca0 tiny fix hiyouga 2024-07-22 21:10:15 +08:00
  • 26082fc6c9 fix #4917 hoshi-hiyouga 2024-07-22 11:28:31 +08:00
  • c333e2f49d tiny fix hiyouga 2024-07-22 00:06:03 +08:00