Commit Graph

  • dc7ff90c1e Add support for german datasets Johann-Peter Hartmann 2024-01-30 10:18:01 +01:00
  • 1ace676170 fix #2320 hiyouga 2024-01-24 16:19:18 +08:00
  • 8947a87b95 Merge pull request #2319 from ftgreat/main hoshi-hiyouga 2024-01-24 15:32:26 +08:00
  • 786a2f1103 Add patch_mixtral_replace_moe_impl for full training Mitral using DeepSpeed Zero3. ldwang 2024-01-24 15:25:31 +08:00
  • 36ac14a566 Add patch_mixtral_replace_moe_impl for full training Mitral using DeepSpeed Zero3. ldwang 2024-01-24 14:43:16 +08:00
  • 7a048fc91d add hint hiyouga 2024-01-22 23:32:01 +08:00
  • 3f3756b113 Merge pull request #2283 from A-Cepheus/main hoshi-hiyouga 2024-01-22 23:28:45 +08:00
  • b36c4b99cc Update patcher.py hoshi-hiyouga 2024-01-22 23:27:39 +08:00
  • 9856a2276e Update tests.yml hoshi-hiyouga 2024-01-22 23:22:15 +08:00
  • b6dc3ed3ad Create tests.yml hoshi-hiyouga 2024-01-22 23:13:04 +08:00
  • 75be329994 fix #2282 and update tool prompt hiyouga 2024-01-22 22:27:30 +08:00
  • 1fe1ca1c8b add orion models hiyouga 2024-01-22 21:26:53 +08:00
  • 882a6a1d51 🐞 fix: typo A-Cepheus 2024-01-22 16:04:39 +08:00
  • 712ab4ae7a 🐞 fix: typo, move MoE fix to patcher A-Cepheus 2024-01-22 16:01:58 +08:00
  • 18ad259fb3 fix: ZeRO3 does not work with MoE models Former-commit-id: b2844c049a88ea89f8e1812e2d2e8662b4002965 A-Cepheus 2024-01-22 15:21:14 +08:00
  • fe4d93c6db add array param format hiyouga 2024-01-21 22:17:48 +08:00
  • c6ba588e37 update tool test hiyouga 2024-01-21 19:41:46 +08:00
  • 3fda60fca0 fix api hiyouga 2024-01-21 19:15:27 +08:00
  • 96531a0ef8 fix #2268 hiyouga 2024-01-21 14:11:38 +08:00
  • 7abc3065fb tiny fix hiyouga 2024-01-21 13:26:12 +08:00
  • 013ded4bac Merge pull request #2266 from yhyu13/fix_export_model_dtype hoshi-hiyouga 2024-01-21 12:40:39 +08:00
  • 010c3c7348 Merge branch 'main' into fix_export_model_dtype hoshi-hiyouga 2024-01-21 12:40:24 +08:00
  • bf075c075c Update tuner.py hoshi-hiyouga 2024-01-21 12:39:38 +08:00
  • 41b34e5f60 Merge pull request #2262 from fenglui/main hoshi-hiyouga 2024-01-21 12:34:37 +08:00
  • 5a889398e7 format hiyouga 2024-01-21 12:34:17 +08:00
  • 054cae86d8 Merge pull request #2264 from seoeaa/main hoshi-hiyouga 2024-01-21 12:25:24 +08:00
  • cd1cb8b83c Remove manully set use_cache; torch_dtype is not str, save model as bfloat16 used to fail; yhyu13 2024-01-21 11:12:15 +08:00
  • a34779c027 add russian lang Aleksandr 2024-01-21 04:28:14 +03:00
  • d19cb77d74 fix torch_dtype check of export_model fenglui 2024-01-21 05:01:53 +08:00
  • ab67528e89 release v0.5.0 (real) v0.5.0 hiyouga 2024-01-21 01:54:49 +08:00
  • 27f281480a finish agent hiyouga 2024-01-21 01:47:33 +08:00
  • 50459a39f4 fix api hiyouga 2024-01-21 00:03:09 +08:00
  • 5c9815ef6f fix internlm2 template hiyouga 2024-01-20 23:33:50 +08:00
  • aed00a97b6 fix cli_demo hiyouga 2024-01-20 23:27:10 +08:00
  • 7543dc4a9d fix #2260 hiyouga 2024-01-20 23:22:09 +08:00
  • 841fa0030f release v0.5.0 hiyouga 2024-01-20 20:21:39 +08:00
  • 66e0e651b9 format style hiyouga 2024-01-20 20:15:56 +08:00
  • 1750218057 fix tests hiyouga 2024-01-20 19:58:04 +08:00
  • 80637fc06d support longlora for main branch hiyouga 2024-01-20 19:25:22 +08:00
  • 8efc055511 Merge pull request #2201 from liu-zichen/token_embed_resize hoshi-hiyouga 2024-01-20 17:45:38 +08:00
  • be61bfda93 add upcast_lmhead option hiyouga 2024-01-19 23:54:25 +08:00
  • 1a39f529c0 set use_reentrant=False hiyouga 2024-01-19 23:29:54 +08:00
  • 0868d5c550 fix #2249 hiyouga 2024-01-19 21:44:32 +08:00
  • 384f0e7678 add bf16 lora option hiyouga 2024-01-19 16:29:03 +08:00
  • 9b390c4bea fix function formatter hiyouga 2024-01-18 16:01:07 +08:00
  • 42a13fec46 Update tuner.py hiyouga 2024-01-18 15:06:02 +08:00
  • 790acc4c17 fix templates hiyouga 2024-01-18 14:49:52 +08:00
  • b74cf27538 fix rm dataset hiyouga 2024-01-18 14:45:37 +08:00
  • ffc874ec6f fix pretrain data loader hiyouga 2024-01-18 14:42:52 +08:00
  • 546d6bd0b2 Merge pull request #2226 from hiyouga/dev hoshi-hiyouga 2024-01-18 14:31:28 +08:00
  • 8b68ca029e update readme hiyouga 2024-01-18 14:30:48 +08:00
  • 502f84b30c add tool hint hiyouga 2024-01-18 13:19:09 +08:00
  • b7df920860 fix dataset hiyouga 2024-01-18 12:59:30 +08:00
  • e4a424cb6a enable cutoff len hiyouga 2024-01-18 12:25:42 +08:00
  • d8affd3967 add tool test hiyouga 2024-01-18 10:26:26 +08:00
  • a423274fd9 support function calling hiyouga 2024-01-18 09:54:23 +08:00
  • f7329b1a0e Update llamafy_internlm2.py hiyouga 2024-01-18 01:12:31 +08:00
  • 48eb07c956 Update llamafy_internlm2.py hiyouga 2024-01-18 01:00:16 +08:00
  • 636d8a886c Update llamafy_internlm2.py hiyouga 2024-01-18 00:49:31 +08:00
  • 97b52c7fdf fix llamafy scripts hiyouga 2024-01-18 00:37:37 +08:00
  • 344412e66e fix llamafy_internlm2 hiyouga 2024-01-18 00:26:14 +08:00
  • 5cdea14cdf add llamafy_internlm2 hiyouga 2024-01-18 00:17:41 +08:00
  • 7b1a56b96f support export push_to_hub #2183 hiyouga 2024-01-16 23:59:42 +08:00
  • d1ec884e75 fix #2195 hiyouga 2024-01-16 23:53:50 +08:00
  • aa72a4349e support resize embed for zero3 liuzc 2024-01-16 15:16:20 +08:00
  • 5ab7fd0842 tiny fix hiyouga 2024-01-15 23:34:23 +08:00
  • 86d5e9802a Merge pull request #2194 from junuMoon/patch-1 hoshi-hiyouga 2024-01-15 20:21:28 +08:00
  • 18df39e3a1 fix: typo on README.md Former-commit-id: 372066b559305a1428c88fbd6b01e332bfd5e3e1 Junu Moon(Fran) 2024-01-15 19:50:35 +09:00
  • cfe1e24471 support solar 10.7B #1907 hiyouga 2024-01-14 00:30:30 +08:00
  • 2edbe87a8c Update README_zh.md hiyouga 2024-01-14 00:17:28 +08:00
  • 880055bc90 support deepseek moe hiyouga 2024-01-14 00:14:49 +08:00
  • ad99bd0a14 fix phi modules hiyouga 2024-01-13 23:12:47 +08:00
  • c5f099138d fix #2147 hiyouga 2024-01-12 03:30:56 +08:00
  • 6e64e02f71 fix #2164 hiyouga 2024-01-12 00:27:57 +08:00
  • f95f6ec009 Merge pull request #2163 from JessyTsu1/main hoshi-hiyouga 2024-01-11 23:33:29 +08:00
  • 8aeecc20e1 Update README.md JessyTsu1 2024-01-11 23:18:29 +08:00
  • 38d0f6c63f Update README_zh.md JessyTsu1 2024-01-11 23:17:48 +08:00
  • ac8534a9e7 Update README.md JessyTsu1 2024-01-11 23:17:00 +08:00
  • 73cab9d9d4 fix #2161 hiyouga 2024-01-11 17:04:13 +08:00
  • 64246d42d2 improve web ui hiyouga 2024-01-10 12:37:45 +08:00
  • 6fa6d4532e improve model export hiyouga 2024-01-09 22:26:24 +08:00
  • 92b9956c06 modify weight name hiyouga 2024-01-09 20:22:47 +08:00
  • 4d6669c268 fix #1789 hiyouga 2024-01-09 18:31:27 +08:00
  • 89f4ae51f9 fix #2127 hiyouga 2024-01-09 14:49:13 +08:00
  • af0659f573 fix #2125 hiyouga 2024-01-08 21:42:25 +08:00
  • 45a10d501e Merge pull request #2117 from dasdristanta13/main hoshi-hiyouga 2024-01-07 23:56:53 +08:00
  • e529ff1245 Update requirements.txt With einops dependency Dristanta Das 2024-01-07 21:03:30 +05:30
  • b29371dc87 tiny fix hiyouga 2024-01-07 17:17:18 +08:00
  • 0bef890000 fix api server hiyouga 2024-01-07 17:14:42 +08:00
  • 75fe1404b1 improve model export hiyouga 2024-01-05 18:51:49 +08:00
  • b460c9372f fix #2098 hiyouga 2024-01-05 17:11:26 +08:00
  • c3e574ceaa fix qwen template hiyouga 2024-01-05 16:14:56 +08:00
  • 04ae80a52e fix #2081 hiyouga 2024-01-04 23:19:08 +08:00
  • a7ff095399 fix #2090 hiyouga 2024-01-04 23:05:08 +08:00
  • a655dcebaf fix #2067 hiyouga 2024-01-04 22:53:03 +08:00
  • 8c74851b70 fix dispatch hiyouga 2024-01-03 16:33:16 +08:00
  • 7168392a51 fix valuehead patch hiyouga 2024-01-03 16:19:23 +08:00
  • ccc5b324fe fix rm server hiyouga 2024-01-03 15:30:46 +08:00
  • e85c205a81 fix #2014 hiyouga 2023-12-29 15:17:22 +08:00
  • 7e225be16e add yuan model hiyouga 2023-12-29 13:50:24 +08:00