Skip to content

From NVIDIA Megatron-LM for visibility#18

Open
RaymondLi0 wants to merge 6581 commits intobigcode-project:multi-query-attentionfrom
NVIDIA:main
Open

From NVIDIA Megatron-LM for visibility#18
RaymondLi0 wants to merge 6581 commits intobigcode-project:multi-query-attentionfrom
NVIDIA:main

Conversation

@RaymondLi0
Copy link
Collaborator

No description provided.

@RaymondLi0 RaymondLi0 changed the base branch from multi-query-attention to before-merge June 20, 2023 20:12
@RaymondLi0 RaymondLi0 changed the base branch from before-merge to multi-query-attention June 20, 2023 20:12
faradawn and others added 27 commits February 20, 2026 17:48
Signed-off-by: Faradawn Yang <73060648+faradawn@users.noreply.github.com>
…deepcopy bug from PyTorch 26.01. (#3510)

Signed-off-by: Cory Ye <cye@nvidia.com>
Signed-off-by: adithyare <adithyare@nvidia.com>
Signed-off-by: Soumye Singhal <soumyes@cw-dfw-cs-001-dc-01.cm.cluster>
Co-authored-by: Soumye Singhal <soumyes@cw-dfw-cs-001-dc-01.cm.cluster>
Co-authored-by: Rabeeh Karimi Mahabadi <rkarimimahab@nvidia.com>
Co-authored-by: Seonjin Na <sna@nvidia.com>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
#3537)

Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
Signed-off-by: Faradawn Yang <73060648+faradawn@users.noreply.github.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
returnL and others added 30 commits March 19, 2026 01:58
Co-authored-by: anthropic-code-agent[bot] <242468646+Claude@users.noreply.github.com>
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: Xin Yao <xiny@nvidia.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Co-authored-by: Xin Yao <xiny@nvidia.com>
…ate tracking (#3748)

Signed-off-by: Robin Zhang <robinz@nvidia.com>
Co-authored-by: Eric Harper <eharper@nvidia.com>
…#3951)

Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
…3947)

Signed-off-by: oliver könig <okoenig@nvidia.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
…zed MoE layers (#3941)

Signed-off-by: Keshav Santhanam <ksanthanam@nvidia.com>
Co-authored-by: Siddharth Singh <sidsingh@nvidia.com>
Co-authored-by: Xin Yao <xiny@nvidia.com>
Co-authored-by: oliver könig <okoenig@nvidia.com>
Co-authored-by: William Dykas <wdykas@oci-hsg-cs-001-vscode-03.cm.cluster>
Co-authored-by: root <root@nvl72077-T12.cm.cluster>
Co-authored-by: root <root@nvl72098-T17.cm.cluster>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Co-authored-by: Jorge Albericio <jalbericiola@nvidia.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Co-authored-by: William Dykas <wdykas@oci-hsg-cs-001-vscode-03.cm.cluster>
Co-authored-by: root <root@nvl72077-T12.cm.cluster>
Co-authored-by: root <root@nvl72098-T17.cm.cluster>
Co-authored-by: root <root@nvl72160-T18.cm.cluster>
Co-authored-by: root <root@nvl72006-T11.cm.cluster>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
…rs (#3979)" (#3982)

Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>
Co-authored-by: oliver könig <okoenig@nvidia.com>
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.