mirror of
https://github.com/hiyouga/LlamaFactory.git
synced 2026-03-23 02:33:24 +08:00
unwrap_model_for_generation(reward_model) is necessary for zero3 training Former-commit-id: c8d5b21700577cae8d6ca03359bcf1762c8b7cb8
22 KiB
22 KiB