ViperEkura
|
0f518473af
|
fix: 修复强化学习算法问题
|
2026-03-19 22:23:51 +08:00 |
ViperEkura
|
e35cb0d84a
|
feat: 增加 label smoothing 设置
|
2026-03-13 22:37:27 +08:00 |
ViperEkura
|
2331713fde
|
refactor: 修改训练脚本
|
2026-03-05 14:40:26 +08:00 |
ViperEkura
|
80e17418b4
|
fix: 修复一些运行时问题
|
2026-03-01 15:47:07 +08:00 |
ViperEkura
|
b17cc6a6fb
|
refactor: 修改参数传递方案
|
2026-02-28 18:09:00 +08:00 |
ViperEkura
|
eba99e1f5e
|
feat(model): 添加QK归一化和门控注意力支持
|
2026-01-05 16:14:44 +08:00 |
ViperEkura
|
fd7ee2895a
|
refactor(paralell): 优化并行设备指定方法
|
2025-12-26 20:54:33 +08:00 |
ViperEkura
|
cfa3cf7daa
|
feat(train): 支持分布式训练的优化器与调度器工厂配置
|
2025-12-22 20:41:03 +08:00 |
ViperEkura
|
573f041c51
|
feat(trainer): 支持分布式训练配置与检查点加载优化
|
2025-12-19 19:34:39 +08:00 |
ViperEkura
|
d882f65579
|
refactor(parallel): 重构parallel模块
|
2025-12-13 22:16:17 +08:00 |
ViperEkura
|
c98b175cd5
|
refactor(trainer): 优化trainer 结构
|
2025-12-07 21:23:05 +08:00 |
ViperEkura
|
82e65ccc21
|
fix(tools/train): 修复参数传递错误
|
2025-12-05 13:53:50 +08:00 |
ViperEkura
|
db53cc5001
|
feat(tools/train): 优化训练参数传递
|
2025-11-30 13:49:24 +08:00 |
ViperEkura
|
3bf2468905
|
fix(tools): 修正训练脚本中的嵌入层参数分组判断条件
|
2025-11-19 17:47:33 +08:00 |
ViperEkura
|
4c289e974a
|
refactor(tools): 将工具脚本移动到tools目录下
|
2025-11-10 21:26:02 +08:00 |