-
Notifications
You must be signed in to change notification settings - Fork 1.4k
fix: Fixing issue with first gen token being returned twice in streaming #3427
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
dfaebf0
to
384d556
Compare
/bot run --disable-fail-fast |
PR_Github #1641 [ run ] triggered by Bot |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks Patrice!
LGTM. Thank you! |
PR_Github #1641 [ run ] completed with state |
df06fa2
to
d90b2b3
Compare
/bot run --add-multi-gpu-test |
PR_Github #1773 [ run ] triggered by Bot |
…aming Signed-off-by: Patrice Castonguay <[email protected]>
Signed-off-by: Patrice Castonguay <[email protected]>
d90b2b3
to
45ac435
Compare
/bot run --only-multi-gpu-test --disable-fail-fast |
PR_Github #1827 [ run ] triggered by Bot |
PR_Github #1773 [ run ] completed with state |
PR_Github #1827 [ run ] completed with state |
/bot run --only-multi-gpu-test --disable-fail-fast |
PR_Github #1839 [ run ] triggered by Bot |
/bot run --only-multi-gpu-test --disable-fail-fast |
PR_Github #1839 [ run ] completed with state |
PR_Github #1865 [ run ] triggered by Bot |
PR_Github #1865 [ run ] completed with state |
/bot run --only-multi-gpu-test |
/bot run --add-multi-gpu-test --disable-fail-fast |
PR_Github #2050 [ run ] triggered by Bot |
PR_Github #2050 [ run ] completed with state |
/bot run --stage-list "L40S-TensorRT-3" |
/bot run --only-multi-gpu-test |
PR_Github #2059 [ run ] triggered by Bot |
PR_Github #2060 [ run ] triggered by Bot |
PR_Github #2059 [ run ] completed with state |
/bot run --only-multi-gpu-test --disable-fail-fast |
PR_Github #2061 [ run ] triggered by Bot |
PR_Github #2060 [ run ] completed with state |
PR_Github #2061 [ run ] completed with state |
/bot run --stage-list "L40S-TensorRT-3" |
PR_Github #2070 [ run ] triggered by Bot |
PR_Github #2070 [ run ] completed with state |
/bot reuse-pipeline |
PR_Github #2084 [ reuse-pipeline ] triggered by Bot |
PR_Github #2084 [ reuse-pipeline ] completed with state |
/bot skip --comment "ran all tests previously" |
PR_Github #2087 [ skip ] triggered by Bot |
PR_Github #2087 [ skip ] completed with state |
/bot skip --comment "Ran all tests previously" |
PR_Github #2092 [ skip ] triggered by Bot |
PR_Github #2092 [ skip ] completed with state |
Better fix for first gen token being returned twice in streaming mode.