6 Commits

Author SHA1 Message Date
qu_yueze
162c2a0dcb !2956 [pytorch][bugfix]fix srcipt of promot-type overlap
Merge pull request !2956 from qu_yueze/2.1.0
2025-07-03 09:37:20 +00:00
yanzhixiao
d529405fd1 !2779 fix script of deepseek_r1_llama_70b_full
Merge pull request !2779 from yanzhixiao/bugfix-0606
2025-06-06 07:20:10 +00:00
jzh
0ceafd12c5 !2673 docs readme modify
Merge pull request !2673 from jzh/docs_0517
2025-05-19 04:04:38 +00:00
jzh
e6d441a17e !2352 fix some GPUS_PER_NODE to NPUS_PER_NODE
Merge pull request !2352 from jzh/master-v3example
2025-03-08 09:40:40 +00:00
wucong
a7e7284a8b !2200 add template deepseek3
Merge pull request !2200 from wucong/add_distill_qwen
2025-02-11 02:06:44 +00:00
guoxinjie
e89e1b1db2 !2196 DeepSeek-R1-Distill-Qwen-1.5B and LLaMA-8B、70B
Merge pull request !2196 from guoxinjie/distill-qwen
2025-02-09 12:46:23 +00:00