Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

[https://nvbugs/5973801][fix] exclude subproc_worker_timer from thread leak checks#12286

Merged
MrGeva merged 3 commits intoNVIDIA:mainNVIDIA/TensorRT-LLM:mainfrom
nv-auto-deploy:eg/fixthreadleaknv-auto-deploy/TensorRT-LLM:eg/fixthreadleakCopy head branch name to clipboard
Mar 17, 2026
Merged

[https://nvbugs/5973801][fix] exclude subproc_worker_timer from thread leak checks#12286
MrGeva merged 3 commits intoNVIDIA:mainNVIDIA/TensorRT-LLM:mainfrom
nv-auto-deploy:eg/fixthreadleaknv-auto-deploy/TensorRT-LLM:eg/fixthreadleakCopy head branch name to clipboard

Conversation

@MrGeva
Copy link
Copy Markdown
Collaborator

@MrGeva MrGeva commented Mar 17, 2026

All leak <Thread(subproc_worker_timer, ...)> from torch/_inductor/compile_worker/timer.py. The
timer is created when quiesce_async_compile_pool is enabled (via NVIDIA's internal justknob
system in CI). After the compile pool shuts down, timer.quit() sets a flag, but the thread is in
a 30-second sleep and won't check the flag until after the test ends.

The fix — adding subproc_worker_timer to threadleak_exclude in tests/unittest/pytest.ini —
follows the exact same precedent as InductorSubproc (already excluded, same pool, same
justification: benign daemon thread internal to PyTorch inductor). This single-line change covers
all 7 failing test instances across all 3 GPU environments.

@coderabbitai summary

Description

Test Coverage

PR Checklist

Please review the following before submitting your PR:

  • PR description clearly explains what and why. If using CodeRabbit's summary, please make sure it makes sense.

  • PR Follows TRT-LLM CODING GUIDELINES to the best of your knowledge.

  • Test cases are provided for new code paths (see test instructions)

  • Any new dependencies have been scanned for license and vulnerabilities

  • CODEOWNERS updated if ownership changes

  • Documentation updated as needed

  • Update tava architecture diagram if there is a significant design change in PR.

  • The reviewers assigned automatically/manually are appropriate for the PR.

  • Please check this after reviewing the above items as appropriate for this PR.

GitHub Bot Help

To see a list of available CI bot commands, please comment /bot help.

MrGeva added 2 commits March 15, 2026 09:37
Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
@MrGeva MrGeva requested a review from a team as a code owner March 17, 2026 15:02
@MrGeva MrGeva requested a review from bmarimuthu-nv March 17, 2026 15:02
@MrGeva
Copy link
Copy Markdown
Collaborator Author

MrGeva commented Mar 17, 2026

/bot run --extra-stage "DGX_B200-4_GPUs-AutoDeploy-1, DGX_H100-4_GPUs-AutoDeploy-1"

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented Mar 17, 2026

Caution

Review failed

The head commit changed during the review from 9a0207d to 8fd4c3b.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
📝 Coding Plan
  • Generate coding plan for human review comments

Comment @coderabbitai help to get the list of available commands and usage tips.

Tip

You can customize the tone of the review comments and chat replies.

Configure the tone_instructions setting to customize the tone of the review comments and chat replies. For example, you can set the tone to Act like a strict teacher, Act like a pirate and more.

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #39273 [ run ] triggered by Bot. Commit: 8fd4c3b Link to invocation

@MrGeva MrGeva enabled auto-merge (squash) March 17, 2026 17:12
@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #39273 [ run ] completed with state SUCCESS. Commit: 8fd4c3b
/LLM/main/L0_MergeRequest_PR pipeline #30525 completed with status: 'SUCCESS'

CI Report

Link to invocation

@MrGeva MrGeva merged commit a3cae08 into NVIDIA:main Mar 17, 2026
6 of 7 checks passed
@MrGeva MrGeva deleted the eg/fixthreadleak branch March 18, 2026 05:35
limin2021 pushed a commit to limin2021/TensorRT-LLM that referenced this pull request Mar 19, 2026
…d leak checks (NVIDIA#12286)

Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
longcheng-nv pushed a commit to longcheng-nv/TensorRT-LLM that referenced this pull request Mar 31, 2026
…d leak checks (NVIDIA#12286)

Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants

Morty Proxy This is a proxified and sanitized view of the page, visit original site.