qu_yueze
|
162c2a0dcb
|
!2956 [pytorch][bugfix]fix srcipt of promot-type overlap
Merge pull request !2956 from qu_yueze/2.1.0
|
2025-07-03 09:37:20 +00:00 |
|
yanzhixiao
|
d529405fd1
|
!2779 fix script of deepseek_r1_llama_70b_full
Merge pull request !2779 from yanzhixiao/bugfix-0606
|
2025-06-06 07:20:10 +00:00 |
|
jzh
|
0ceafd12c5
|
!2673 docs readme modify
Merge pull request !2673 from jzh/docs_0517
|
2025-05-19 04:04:38 +00:00 |
|
jzh
|
e6d441a17e
|
!2352 fix some GPUS_PER_NODE to NPUS_PER_NODE
Merge pull request !2352 from jzh/master-v3example
|
2025-03-08 09:40:40 +00:00 |
|
wucong
|
a7e7284a8b
|
!2200 add template deepseek3
Merge pull request !2200 from wucong/add_distill_qwen
|
2025-02-11 02:06:44 +00:00 |
|
guoxinjie
|
e89e1b1db2
|
!2196 DeepSeek-R1-Distill-Qwen-1.5B and LLaMA-8B、70B
Merge pull request !2196 from guoxinjie/distill-qwen
|
2025-02-09 12:46:23 +00:00 |
|