ViperEkura
|
25ec56a1f5
|
fix(trainer): 修复训练器恢复检查点时的学习率初始化问题
|
2025-09-28 14:38:02 +08:00 |
ViperEkura
|
c8a38743a4
|
fix(tests): 更新测试代码以验证优化器状态的保存与加载
|
2025-09-28 14:00:38 +08:00 |
ViperEkura
|
f25a249291
|
feat(khaosz): 优化模型参数保存与加载逻辑
|
2025-09-28 14:00:21 +08:00 |
ViperEkura
|
4fcdc87c95
|
feat(trainer): 重构数据集与策略模块以支持字典形式的数据返回
|
2025-09-27 14:11:27 +08:00 |
ViperEkura
|
9fbc9481b5
|
refactor(core): 修改注意力掩码处理函数并重命名参数
|
2025-09-27 13:37:10 +08:00 |
ViperEkura
|
053f4a4dad
|
feat( StrategyFactory): 添加 SFT 策略初始化参数并完善工厂方法调用
|
2025-09-27 13:24:16 +08:00 |
ViperEkura
|
676fdd59d7
|
feat(strategy): 重构mask构建逻辑并优化策略工厂参数传递
|
2025-09-27 13:12:57 +08:00 |
ViperEkura
|
a4443765ee
|
Initial commit
|
2025-09-27 12:02:22 +08:00 |