Commit Graph

16 Commits

Author SHA1 Message Date
ViperEkura 92999fa9f6 fix(trainer): 修复训练器中配置引用错误的问题 2025-09-28 22:20:25 +08:00
ViperEkura 0ebf53008e refactor(test): 更新训练配置参数名称并优化测试逻辑 2025-09-28 22:14:39 +08:00
ViperEkura 1c9063fd3d refactor(trainer): 统一参数命名以提升可读性 2025-09-28 22:14:24 +08:00
ViperEkura fa43ed2943 feat(trainer): 重构训练配置与策略工厂引入 2025-09-28 21:39:48 +08:00
ViperEkura 2dc7b5bda8 build(.gitignore): 更新 gitignore 文件忽略规则 2025-09-28 15:39:13 +08:00
ViperEkura 30ac07418c feat(train): 添加多轮对话训练支持 2025-09-28 15:38:53 +08:00
ViperEkura 1169cfad82 fix(trainer): 修复多轮对话中的因果注意力掩码计算逻辑等 2025-09-28 15:15:19 +08:00
ViperEkura 0b96b11a6e test(trainer): 增加训练中断与检查点恢复测试 2025-09-28 14:38:23 +08:00
ViperEkura 25ec56a1f5 fix(trainer): 修复训练器恢复检查点时的学习率初始化问题 2025-09-28 14:38:02 +08:00
ViperEkura c8a38743a4 fix(tests): 更新测试代码以验证优化器状态的保存与加载 2025-09-28 14:00:38 +08:00
ViperEkura f25a249291 feat(khaosz): 优化模型参数保存与加载逻辑 2025-09-28 14:00:21 +08:00
ViperEkura 4fcdc87c95 feat(trainer): 重构数据集与策略模块以支持字典形式的数据返回 2025-09-27 14:11:27 +08:00
ViperEkura 9fbc9481b5 refactor(core): 修改注意力掩码处理函数并重命名参数 2025-09-27 13:37:10 +08:00
ViperEkura 053f4a4dad feat( StrategyFactory): 添加 SFT 策略初始化参数并完善工厂方法调用 2025-09-27 13:24:16 +08:00
ViperEkura 676fdd59d7 feat(strategy): 重构mask构建逻辑并优化策略工厂参数传递 2025-09-27 13:12:57 +08:00
ViperEkura a4443765ee Initial commit 2025-09-27 12:02:22 +08:00