This website requires JavaScript.
Explore
Help
Register
Sign In
ViperEkura
/
AstrAI
Watch
1
Star
0
Fork
You've already forked AstrAI
0
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
a5574f92e2
AstrAI
/
khaosz
/
trainer
History
ViperEkura
a5574f92e2
feat: 初步实现grpo 算法逻辑
2026-03-19 20:56:53 +08:00
..
__init__.py
fix: 修复callback 时机不一致的问题
2026-03-06 10:51:22 +08:00
metric_util.py
refactor: 修改metric_util.py
2026-03-06 10:33:44 +08:00
schedule.py
fix: 修复一些运行时问题
2026-03-01 15:47:07 +08:00
strategy.py
feat: 初步实现grpo 算法逻辑
2026-03-19 20:56:53 +08:00
train_callback.py
fix: 修复metric 保存时机的问题
2026-03-16 20:07:36 +08:00
train_context.py
fix: 修复一些运行时问题
2026-03-01 15:47:07 +08:00
trainer.py
fix: 修复梯度平均问题
2026-03-13 23:00:26 +08:00