This website requires JavaScript.
Explore
Help
Register
Sign In
cunliangkong
/
LlamaFactory
Watch
1
Star
0
Fork
0
You've already forked LlamaFactory
mirror of
https://github.com/hiyouga/LlamaFactory.git
synced
2026-03-23 02:33:24 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
9585699918f91a3af840475a4246f7088ffc8d00
LlamaFactory
/
src
/
llmtuner
/
tuner
History
hiyouga
688e8601ab
web UI integrating RLHF
...
Former-commit-id: 137fd146b90f89a1164b56e6d507b30b1f5c2437
2023-08-14 10:48:47 +08:00
..
core
tiny fix
2023-08-12 22:02:43 +08:00
dpo
fix
#480
2023-08-14 00:23:56 +08:00
ppo
web UI integrating RLHF
2023-08-14 10:48:47 +08:00
pt
support DPO training (2305.18290)
2023-08-11 03:02:53 +08:00
rm
support DPO training (2305.18290)
2023-08-11 03:02:53 +08:00
sft
support DPO training (2305.18290)
2023-08-11 03:02:53 +08:00
__init__.py
modify code structure
2023-08-02 23:17:36 +08:00
tune.py
support rope scaling,
fix
#475
#476
#478
2023-08-12 20:46:27 +08:00