-
Notifications
You must be signed in to change notification settings - Fork 1.4k
feat: allocate minimal blocks per window size #3028
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: allocate minimal blocks per window size #3028
Conversation
b13fd0b
to
8ee8036
Compare
…o window block managers per window size Signed-off-by: Netanel Haber <[email protected]>
8ee8036
to
efa1996
Compare
…t per window size Signed-off-by: Netanel Haber <[email protected]>
Signed-off-by: Netanel Haber <[email protected]>
Signed-off-by: Netanel Haber <[email protected]>
…on scope input length between window size loop iterations Signed-off-by: Netanel Haber <[email protected]>
…indow block managers Signed-off-by: Netanel Haber <[email protected]>
/bot run |
1 similar comment
/bot run |
Signed-off-by: Netanel Haber <[email protected]>
/bot run |
PR_Github #587 [ run ] triggered by Bot |
PR_Github #587 [ run ] completed with state |
Signed-off-by: Netanel Haber <[email protected]>
/bot run --disable-fail-fast |
PR_Github #601 [ run ] triggered by Bot |
PR_Github #601 [ run ] completed with state |
/bot run |
PR_Github #607 [ run ] triggered by Bot |
PR_Github #607 [ run ] completed with state |
Signed-off-by: Netanel Haber <[email protected]>
Signed-off-by: Netanel Haber <[email protected]>
/bot run |
PR_Github #2183 [ run ] completed with state |
Signed-off-by: Netanel Haber <[email protected]>
/bot run --disable-fail-fast |
PR_Github #2207 [ run ] triggered by Bot |
PR_Github #2199 [ run ] completed with state |
Signed-off-by: Netanel Haber <[email protected]> remove misleading comment Signed-off-by: Netanel Haber <[email protected]> fix test_gemma Signed-off-by: Netanel Haber <[email protected]>
de24687
to
bb4095e
Compare
Signed-off-by: Netanel Haber <[email protected]>
Signed-off-by: Netanel Haber <[email protected]>
PR_Github #2207 [ run ] completed with state |
/bot --help |
GitHub Bot Help
Provide a user friendly way for developers to interact with a Jenkins server. Run See details below for each supported subcommand.
Launch build/test pipelines. All previously running jobs will be killed.
kill
Kill all running builds associated with pull request. skip
Skip testing for latest commit on pull request. reuse-pipeline
Reuse a previous pipeline to validate current commit. This action will also kill all currently running builds associated with the pull request. IMPORTANT NOTE: This is dangerous since lack of user care and validation can cause top of tree to break. |
…s, like in main Signed-off-by: Netanel Haber <[email protected]>
* fix: disable KV cache reuse if using attention sink Signed-off-by: Robin Kobus <[email protected]> * fix: disable KV cache reuse if sink bubble Signed-off-by: Robin Kobus <[email protected]> * add comment Signed-off-by: Robin Kobus <[email protected]> --------- Signed-off-by: Robin Kobus <[email protected]>
/bot run --disable-fail-fast |
PR_Github #2494 [ run ] triggered by Bot |
/bot run --disable-fail-fast |
PR_Github #2501 [ run ] triggered by Bot |
PR_Github #2494 [ run ] completed with state |
PR_Github #2501 [ run ] completed with state |
/bot run |
PR_Github #2525 [ run ] triggered by Bot |
PR_Github #2525 [ run ] completed with state |
/bot reuse-pipeline |
PR_Github #2612 [ reuse-pipeline ] triggered by Bot |
PR_Github #2612 [ reuse-pipeline ] completed with state |
No description provided.