-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
add fused_topk_softmax_without_capacity for topk router fusion
#1632
opened Jun 13, 2025 by
AshOfCat
Loading…
Fix typos: vritual → virtual and decoeder → decoder
#1626
opened Jun 11, 2025 by
EricLabile
Loading…
Fix: Apply q_layernorm consistently in MLA LoRA path
#1624
opened Jun 11, 2025 by
Flink-ddd
Loading…
fix: when using moe parallel folding feature and set etp > 1 && ep == 1, the grad sync is incorrect and the loss curve is bad
#1622
opened Jun 10, 2025 by
Louis-J
Loading…
use a cpu set to cache cuda tensor
finished_request_ids
#1610
opened Jun 5, 2025 by
ladyrick
Loading…
Add DistTrain, Allow Encoder to Have Different DP Size
#1605
opened May 30, 2025 by
zidanehuang001
Loading…
bugfix: cross_entropy inplace operations may cause backward error
#1594
opened May 24, 2025 by
ChangWeiming
Loading…
fix bug: the loss of aux_loss and mtp will be tracked twice
#1585
opened May 18, 2025 by
hyleepp
Loading…
use multiple yaml files to avoid passing annoying model configs from cmd lines
#1579
opened May 14, 2025 by
nrailg
Loading…
The phrase "need to want to" is grammatically incorrect
#1574
opened May 13, 2025 by
A-transformer
Loading…
param_copy_back_gpu_hook should sync to h2d stream
#1543
opened Apr 16, 2025 by
ariverhorse
Loading…
Fix parameter error in text_generation_server.py file
#1542
opened Apr 16, 2025 by
xichengpro
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.