This website requires JavaScript.
Explore
Help
Register
Sign In
leftyer
/
MindSpeed-LLM
Watch
1
Star
0
Fork
0
You've already forked MindSpeed-LLM
mirror of
https://gitee.com/ascend/MindSpeed-LLM.git
synced
2025-12-06 11:28:59 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
2.1.0
Add File
New File
Upload File
Apply Patch
MindSpeed-LLM
/
examples
/
mcore
/
deepseek_r1_distill_llama
History
qu_yueze
162c2a0dcb
!2956
[pytorch][bugfix]fix srcipt of promot-type overlap
...
Merge pull request
!2956
from qu_yueze/2.1.0
2025-07-03 09:37:20 +00:00
..
ckpt_convert_distill_llama_hf2mcore.sh
!2196
DeepSeek-R1-Distill-Qwen-1.5B and LLaMA-8B、70B
2025-02-09 12:46:23 +00:00
data_convert_distill_llama_instruction.sh
!2673
docs readme modify
2025-05-19 04:04:38 +00:00
generate_distill_llama_8b.sh
!2200
add template deepseek3
2025-02-11 02:06:44 +00:00
generate_distill_llama_70b.sh
!2200
add template deepseek3
2025-02-11 02:06:44 +00:00
tune_distill_llama_8b_full.sh
!2956
[pytorch][bugfix]fix srcipt of promot-type overlap
2025-07-03 09:37:20 +00:00
tune_distill_llama_70b_full.sh
!2779
fix script of deepseek_r1_llama_70b_full
2025-06-06 07:20:10 +00:00