Xue Yadong
|
d3ebd5678d
|
[model] support GLM-OCR SFT (#10183)
|
2026-02-10 21:41:01 +08:00 |
|
Shanay Mehta
|
ea644d04ec
|
[model] support GLM-4.7-Flash SFT (#10173)
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
|
2026-02-09 10:40:44 +08:00 |
|
Hertz
|
8bedfafa4e
|
[model] support MiniCPM-o-4.5 (#10163)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2026-02-04 23:21:27 +08:00 |
|
Yaowei Zheng
|
1a02717fa8
|
[assets] update readme (#10159)
|
2026-02-03 19:11:15 +08:00 |
|
ゆり
|
e7cb145f5d
|
[logging] Fix race condition in LoggerHandler during multi-GPU training (#10156)
Co-authored-by: yurekami <yurekami@users.noreply.github.com>
|
2026-02-03 11:14:07 +08:00 |
|
Hertz
|
b53d7037c2
|
[model] support youtu-vl model (#10152)
|
2026-02-02 21:42:43 +08:00 |
|
浮梦
|
bf04ca6af8
|
[deps] adapt to transformers v5 (#10147)
Co-authored-by: frozenleaves <frozen@Mac.local>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2026-02-02 12:07:19 +08:00 |
|
xvxuopop
|
762b480131
|
[feature] support using ray.remote to start distributed training. (#10109)
|
2026-01-28 16:05:29 +08:00 |
|
Kingsley
|
db2f794f7b
|
[misc] update mcore related docker and mca supported models (#10114)
|
2026-01-19 14:55:16 +08:00 |
|
Hertz
|
4d3621e3d3
|
[model] fixed&added Hunyuan models (#9750)
|
2026-01-12 01:15:00 +08:00 |
|
Hertz
|
15b87f3125
|
[model] support HY-MT model (#9746)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2026-01-11 16:25:56 +08:00 |
|
Yaowei Zheng
|
5cccaeec82
|
[model] clean obsolete models (#9736)
|
2026-01-09 16:12:07 +08:00 |
|
Jackey
|
5fb5d7ebd3
|
[model] support for microsoft's Phi-4-mini (#9734)
|
2026-01-09 12:24:45 +08:00 |
|
Vo Van Phuc
|
5cfd804b59
|
[refactor] rename lfm template to lfm2 and add LFM 2.5 to README (#9731)
|
2026-01-07 19:25:04 +08:00 |
|
Vo Van Phuc
|
958fb523a2
|
[model] support LiquidAI's LFM2.5-VL vision-language model (#9729)
|
2026-01-07 17:20:29 +08:00 |
|
Vo Van Phuc
|
b4e051bea4
|
[model] support for LiquidAI's LFM2.5 (Liquid Foundation Models) (#9726)
|
2026-01-07 14:14:47 +08:00 |
|
Hertz
|
9ae62c6fc0
|
[model] support Youtu-LLM-2B (#9707)
|
2026-01-04 13:17:57 +08:00 |
|
Yaowei Zheng
|
6fe6bd290b
|
[misc] set dev version (#9703)
|
2025-12-31 23:41:40 +08:00 |
|
Yaowei Zheng
|
95ac3f2373
|
[release] Bye 2025 (#9702)
|
2025-12-31 22:22:40 +08:00 |
|
Username_Full
|
000526908a
|
[core deps] upgrade TRL to be between 0.18 and 0.24 (#9617)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-12-31 20:54:27 +08:00 |
|
Kingsley
|
bb1ba31005
|
[misc] lint mca code (#9692)
|
2025-12-29 11:44:38 +08:00 |
|
Hertz
|
c107cc22d0
|
[model] support MiniMax-M1&M2 series (#9680)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-12-28 19:02:05 +08:00 |
|
Copilot
|
eceec8ab69
|
[deps] goodbye python 3.9 (#9677)
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: hiyouga <16256802+hiyouga@users.noreply.github.com>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2025-12-27 02:50:44 +08:00 |
|
Yaowei Zheng
|
55590f5ece
|
[misc] fix ci with uv (#9676)
|
2025-12-27 01:39:13 +08:00 |
|
Yaowei Zheng
|
84485406b7
|
[ci] disable pip cache for ci (#9654)
|
2025-12-23 18:37:40 +08:00 |
|
Yaowei Zheng
|
6ef9854713
|
[misc] fix cache & pin transformers to 4.57.1 (#9638)
|
2025-12-22 00:20:55 +08:00 |
|
Hertz
|
4923f52a28
|
[model] support MiMo-V2-Flash model (#9637)
|
2025-12-21 14:38:18 +08:00 |
|
Hertz
|
9fd4b094d4
|
[model] support VibeThinker models (#9616)
|
2025-12-16 21:50:46 +08:00 |
|
Yaowei Zheng
|
aeda079014
|
[v1] model loader (#9613)
|
2025-12-14 11:50:52 +08:00 |
|
tangefly
|
4fd94141a4
|
[model] Add Ministral3 (#9582)
Co-authored-by: kingsley <kingsleydodonow@gmail.com>
|
2025-12-10 15:57:24 +08:00 |
|
Kingsley
|
22d6ac29d5
|
[model] Rename GLMV template (#9595)
|
2025-12-10 13:27:47 +08:00 |
|
Hertz
|
c1f5f8fff6
|
[model] support GLM4.6v (#9586)
|
2025-12-09 11:06:42 +08:00 |
|
Hertz
|
591fc9ed02
|
[model] support ERNIE-4.5-VL Models (#9521)
|
2025-11-24 16:48:06 +08:00 |
|
Yaowei Zheng
|
eaf963f67f
|
[model] update kt code (#9406)
|
2025-11-05 15:27:22 +08:00 |
|
魅影
|
14abb75126
|
[model] enable using FA in npu (#9397)
Co-authored-by: frozenleaves <frozen@Mac.local>
|
2025-11-04 19:32:30 +08:00 |
|
Peilin Li
|
934b3084ee
|
[train] KTransformers SFT as backend engine for LLaMA-Factory (#9400)
Co-authored-by: jimmy128 <jimmy128@noreply.gitcode.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-11-04 15:54:12 +08:00 |
|
Yaowei Zheng
|
3ae15da9c0
|
[misc] lint code (#9395)
|
2025-11-03 22:08:59 +08:00 |
|
Kingsley
|
13170577b2
|
[feat] support megatron-LM training by mcore_adapter (#9237)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-10-26 16:21:30 +08:00 |
|
Yaowei Zheng
|
9c0d033a15
|
[model] add qwen3vl 2b & 32b (#9343)
|
2025-10-24 13:22:36 +08:00 |
|
Yaowei Zheng
|
d9d67ba62d
|
[misc] fix import error (#9299)
|
2025-10-17 17:46:27 +08:00 |
|
wyfdgg
|
8c341cbaae
|
[model] support hunyuan-mt model (#9284)
Co-authored-by: wyfdgg <liwenkun0812@163.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-10-17 10:33:09 +08:00 |
|
Yaowei Zheng
|
1037f63311
|
[model] add qwen3vl 4b + 8b (#9275)
|
2025-10-15 15:00:36 +08:00 |
|
Yaowei Zheng
|
10146029ba
|
[v1] add v1 launcher (#9236)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2025-10-07 22:34:48 +08:00 |
|
Yaowei Zheng
|
af8437095a
|
[ci] Change macOS version (#9229)
|
2025-10-05 02:18:30 +08:00 |
|
codingma
|
2e2f92701f
|
[model] add qwen3-vl-30b (#9227)
|
2025-10-04 14:12:37 +08:00 |
|
Yaowei Zheng
|
1d96c62df2
|
[v1] add v1 folders (#9225)
|
2025-10-02 15:25:57 +08:00 |
|
Yaowei Zheng
|
6ffebe5ff7
|
[data] fix qwen omni plugin (#9204)
Co-authored-by: kingsley <kingsleydodonow@gmail.com>
|
2025-09-28 01:02:29 +08:00 |
|
xvxuopop
|
0761a4448f
|
[model] add qwen3-vl/qwen3-omni (#9196)
Co-authored-by: kingsley <kingsleydodonow@gmail.com>
|
2025-09-27 01:21:47 +08:00 |
|
Hertz
|
344c760cc1
|
[model] supported ERNIE4.5 Text Models (#9165)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-09-22 11:48:26 +08:00 |
|
Yaowei Zheng
|
80fe3a172d
|
[model] add dots ocr (#9176)
|
2025-09-21 23:34:19 +08:00 |
|