[https://nvbugs/5916151][fix] Unwaive test_fused_moe_w4a8_nvfp4_fp8[TRTLLM] by xxi-nv · Pull Request #12400 · NVIDIA/TensorRT-LLM

xxi-nv · Mar 20, 2026

Summary

Remove waive for test_fused_moe_w4a8_nvfp4_fp8[TRTLLM] (NVBug 5916151)
The illegal memory access was fixed by PR [None][feat] Optimize 6KD fp8 blockscale gemm #11502 (commit c0cf5a3, 2026-03-13) which corrected .cuda() call ordering in the test's run_fused_moe_nvfp4() function

Test plan

Verified on B300 (umb-b300-003, 4x NVIDIA B300 SXM6 AC): 7/7 passes (1 initial + 1 CUTLASS + 5 repeat runs)
CI should pass with the test unwaived

Summary by CodeRabbit

Tests
- Re-enabled a previously waived test case for fused MOE (mixed expert operations) with specific numerical precision configurations.

…RTLLM] The illegal memory access bug (NVBug 5916151) was fixed by PR NVIDIA#11502 which corrected the .cuda() call ordering in the test. Verified on B300 with 7/7 passes (1 initial + 1 CUTLASS + 5 repeat runs). Signed-off-by: xxi <xxi@nvidia.com>

coderabbitai · Mar 20, 2026

📝 Walkthrough

Walkthrough

One integration test waiver entry is removed from the test waiver list, allowing the previously skipped test case unittest/_torch/modules/test_fused_moe.py::test_fused_moe_w4a8_nvfp4_fp8[TRTLLM] to run in integration testing.

Changes

Cohort / File(s)	Summary
Test Waiver Removal `tests/integration/test_lists/waives.txt`	Removed one test waiver entry for `test_fused_moe_w4a8_nvfp4_fp8[TRTLLM]` that was previously marked as `SKIP`, enabling this test case to execute.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly identifies the main change: removing a waiver for a specific test case related to NVBug 5916151, which aligns with the changeset.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Description check	✅ Passed	The PR description is clear and well-structured, explaining the issue, the fix, and verification details.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

📝 Coding Plan

Generate coding plan for human review comments

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Tip

CodeRabbit can suggest fixes for GitHub Check annotations.

Configure the reviews.tools.github-checks setting to adjust the time to wait for GitHub Checks to complete.

xxi-nv · Mar 20, 2026

/bot run --disable-fail-fast

tensorrt-cicd · Mar 20, 2026

PR_Github #39707 [ run ] triggered by Bot. Commit: f5bfe7a Link to invocation

xxi-nv · Mar 20, 2026

/bot run --stage-list "DGX_B300-4_GPUs-PyTorch-Post-Merge-2"

tensorrt-cicd · Mar 20, 2026

PR_Github #39710 [ run ] triggered by Bot. Commit: f5bfe7a Link to invocation

tensorrt-cicd · Mar 20, 2026

PR_Github #39710 [ run ] completed with state SUCCESS. Commit: f5bfe7a
/LLM/main/L0_MergeRequest_PR pipeline #30908 (Partly Tested) completed with status: 'SUCCESS'

CI Report

Link to invocation

xxi-nv · Mar 20, 2026

/bot run --disable-fail-fast

tensorrt-cicd · Mar 20, 2026

PR_Github #39734 [ run ] triggered by Bot. Commit: f5bfe7a Link to invocation

tensorrt-cicd · Mar 20, 2026

PR_Github #39734 [ run ] completed with state SUCCESS. Commit: f5bfe7a
/LLM/main/L0_MergeRequest_PR pipeline #30929 completed with status: 'SUCCESS'

CI Report

Link to invocation

xxi-nv · Mar 22, 2026

/bot run --disable-fail-fast

tensorrt-cicd · Mar 22, 2026

PR_Github #39847 [ run ] triggered by Bot. Commit: f5bfe7a Link to invocation

tensorrt-cicd · Mar 23, 2026

PR_Github #39847 [ run ] completed with state SUCCESS. Commit: f5bfe7a
/LLM/main/L0_MergeRequest_PR pipeline #31022 completed with status: 'SUCCESS'

CI Report

Link to invocation

…RTLLM] (NVIDIA#12400) Signed-off-by: xxi <xxi@nvidia.com>

github-actions Bot assigned xxi-nv Mar 20, 2026

xxi-nv requested a review from EmmaQiaoCh March 20, 2026 11:59

EmmaQiaoCh approved these changes Mar 23, 2026

View reviewed changes

EmmaQiaoCh merged commit 53dc38e into NVIDIA:main Mar 23, 2026
8 of 10 checks passed

longcheng-nv pushed a commit to longcheng-nv/TensorRT-LLM that referenced this pull request Mar 31, 2026

[https://nvbugs/5916151][fix] Unwaive test_fused_moe_w4a8_nvfp4_fp8[T…

0cddaf2

…RTLLM] (NVIDIA#12400) Signed-off-by: xxi <xxi@nvidia.com>

Search code, repositories, users, issues, pull requests...

Conversation

xxi-nv commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Uh oh!

xxi-nv commented Mar 20, 2026

Uh oh!

tensorrt-cicd commented Mar 20, 2026

Uh oh!

xxi-nv commented Mar 20, 2026

Uh oh!

tensorrt-cicd commented Mar 20, 2026

Uh oh!

tensorrt-cicd commented Mar 20, 2026

Uh oh!

xxi-nv commented Mar 20, 2026

Uh oh!

tensorrt-cicd commented Mar 20, 2026

Uh oh!

tensorrt-cicd commented Mar 20, 2026

Uh oh!

xxi-nv commented Mar 22, 2026

Uh oh!

tensorrt-cicd commented Mar 22, 2026

Uh oh!

tensorrt-cicd commented Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

xxi-nv commented Mar 20, 2026 •

edited

Loading

coderabbitai Bot commented Mar 20, 2026 •

edited

Loading