-
Notifications
You must be signed in to change notification settings - Fork 1.4k
chore: Unify Python NVTX call #3450
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Copilot reviewed 10 out of 10 changed files in this pull request and generated 1 comment.
/bot run |
PR_Github #1775 [ run ] triggered by Bot |
PR_Github #1775 [ run ] completed with state |
We mentioned having nvtx levels in an earlier discussion for tradeoff between details and performance with nsys, I noticed this MR introduces two levels, And I noticed this PR move nvtx_range to |
@Superjomn Thanks for the comment
I can't think of a scenario that we will need more levels - we already have layerwise nvtx markers as a feature.
It should already be unified in this PR. |
Got it; that is clear. Thanks for your reply. |
/bot run |
eaa4ea3
to
955ddf7
Compare
/bot run |
PR_Github #1851 [ run ] triggered by Bot |
I think two levels is fine -- when we profile we want as much information as possible so as long as it's documented how to do that I think we're happy on our end. |
PR_Github #1851 [ run ] completed with state |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
9eb5c50
to
6379c3e
Compare
/bot reuse-pipeline |
PR_Github #2347 [ reuse-pipeline ] triggered by Bot |
PR_Github #2347 [ reuse-pipeline ] completed with state |
Signed-off-by: Kaiyu Xie <[email protected]> Signed-off-by: Luis Vega <[email protected]>
tensorrt_llm/_utils.py
nvtx_range_debug
call