This website requires JavaScript.
Explore
Help
Register
Sign In
leftyer
/
MindSpeed-LLM
Watch
1
Star
0
Fork
0
You've already forked MindSpeed-LLM
mirror of
https://gitee.com/ascend/MindSpeed-LLM.git
synced
2025-12-06 11:28:59 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
2.1.0
Add File
New File
Upload File
Apply Patch
MindSpeed-LLM
/
mindspeed_llm
/
training
History
丁子叉
3a8385c4fa
!3372
[pytorch][bugfix]fix profile step setting and qwen3 scripts
...
Merge pull request
!3372
from 丁子叉/210_profile
2025-09-24 01:39:06 +00:00
..
tokenizer
!3113
[pytorch][bugfix]fix some bug for icsl
2025-08-08 07:13:20 +00:00
__init__.py
!2470
[core-llm][dskv3]mtp loss scaler and fix expert bias dtype
2025-04-16 06:06:45 +00:00
arguments.py
!3127
[pytorch][bugfix] update icsl for weights_only
2025-08-11 15:58:41 +00:00
checkpointing.py
!3127
[pytorch][bugfix] update icsl for weights_only
2025-08-11 15:58:41 +00:00
initialize.py
!2393
add mc2
2025-03-22 10:15:39 +00:00
training.py
!3372
[pytorch][bugfix]fix profile step setting and qwen3 scripts
2025-09-24 01:39:06 +00:00
utils.py
!3288
[pytorch][bugfix]optimize attention mask memory in tuning and dpo.
2025-09-16 11:19:28 +00:00