ViperEkura
|
d882f65579
|
refactor(parallel): 重构parallel模块
|
2025-12-13 22:16:17 +08:00 |
ViperEkura
|
c98b175cd5
|
refactor(trainer): 优化trainer 结构
|
2025-12-07 21:23:05 +08:00 |
ViperEkura
|
82e65ccc21
|
fix(tools/train): 修复参数传递错误
|
2025-12-05 13:53:50 +08:00 |
ViperEkura
|
db53cc5001
|
feat(tools/train): 优化训练参数传递
|
2025-11-30 13:49:24 +08:00 |
ViperEkura
|
d9ff662e3a
|
fix(model): 调整 KV Cache 的维度顺序以匹配新的索引逻辑
|
2025-11-19 18:26:15 +08:00 |
ViperEkura
|
3bf2468905
|
fix(tools): 修正训练脚本中的嵌入层参数分组判断条件
|
2025-11-19 17:47:33 +08:00 |
ViperEkura
|
4c289e974a
|
refactor(tools): 将工具脚本移动到tools目录下
|
2025-11-10 21:26:02 +08:00 |