Skip to content

Change priority for scheduling reroute during timeout #16445

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 18 commits into from
Mar 18, 2025

Conversation

imRishN
Copy link
Member

@imRishN imRishN commented Oct 23, 2024

Description

This PR updates the priority of scheduling reroute when timed out from HIGH to NORMAL. This is because consistent HIGH reroutes might starve NORMAL priority tasks. And moreover, NORMAL is right for reasonable clusters. For clusters in messed up state which is causing NORMAL priority tasks to starve, we add a new dynamic cluster setting to raise the priority of reroute task to allocate shards in such scenarios.

Related Issues

Resolves #[Issue number to be closed when this PR is merged]

Check List

  • [ ] Functionality includes testing.
  • [ ] API changes companion pull request created, if applicable.
  • [ ] Public documentation issue/PR created, if applicable.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Copy link
Contributor

❌ Gradle check result for 5e83a92: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Copy link
Contributor

@Bukhtawar Bukhtawar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lets update the PR description

@imRishN
Copy link
Member Author

imRishN commented Oct 23, 2024

Lets update the PR description

Updated

Signed-off-by: Rishab Nahata <[email protected]>
Copy link
Contributor

❌ Gradle check result for 6a448d0: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@opensearch-trigger-bot opensearch-trigger-bot bot removed the stalled Issues that have stalled label Mar 9, 2025
Copy link
Contributor

❌ Gradle check result for f29d380: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@imRishN imRishN added the backport 2.x Backport to 2.x branch label Mar 18, 2025
Copy link
Contributor

❌ Gradle check result for 57a8686: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Copy link
Contributor

❌ Gradle check result for 57a8686: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Copy link
Contributor

❌ Gradle check result for 57a8686: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Signed-off-by: Rishab Nahata <[email protected]>
Copy link
Contributor

❌ Gradle check result for 4696885: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Copy link
Contributor

✅ Gradle check result for 614f75b: SUCCESS

@linuxpi linuxpi merged commit 36d798c into opensearch-project:main Mar 18, 2025
31 checks passed
@opensearch-trigger-bot
Copy link
Contributor

The backport to 2.x failed:

The process '/usr/bin/git' failed with exit code 128

To backport manually, run these commands in your terminal:

# Navigate to the root of your repository
cd $(git rev-parse --show-toplevel)
# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add ../.worktrees/OpenSearch/backport-2.x 2.x
# Navigate to the new working tree
pushd ../.worktrees/OpenSearch/backport-2.x
# Create a new branch
git switch --create backport/backport-16445-to-2.x
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 36d798cc42c473bdb49f4bf39093dd3dd9191b70
# Push it to GitHub
git push --set-upstream origin backport/backport-16445-to-2.x
# Go back to the original working tree
popd
# Delete the working tree
git worktree remove ../.worktrees/OpenSearch/backport-2.x

Then, create a pull request where the base branch is 2.x and the compare/head branch is backport/backport-16445-to-2.x.

Copy link
Contributor

❌ Gradle check result for 57a8686: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

vinaykpud pushed a commit to vinaykpud/OpenSearch that referenced this pull request Mar 18, 2025
…ject#16445)

* Change priority for scheduling reroute in timeout

Signed-off-by: Rishab Nahata <[email protected]>

* Add setting for ESA

Signed-off-by: Rishab Nahata <[email protected]>

* Fix tests

Signed-off-by: Rishab Nahata <[email protected]>

* Trigger Build

Signed-off-by: Rishab Nahata <[email protected]>

* Trigger Build

Signed-off-by: Rishab Nahata <[email protected]>

* Add test

Signed-off-by: Rishab Nahata <[email protected]>

* Trigger Build

Signed-off-by: Rishab Nahata <[email protected]>

* Add changelog

Signed-off-by: Rishab Nahata <[email protected]>

* Trigger Build

Signed-off-by: Rishab Nahata <[email protected]>

* Trigger Build

Signed-off-by: Rishab Nahata <[email protected]>

* Trigger Build

Signed-off-by: Rishab Nahata <[email protected]>

---------

Signed-off-by: Rishab Nahata <[email protected]>
linuxpi pushed a commit that referenced this pull request Mar 26, 2025
…16445) (#17622)

* Change priority for scheduling reroute during timeout (#16445)

Signed-off-by: Rishab Nahata <[email protected]>

* Trigger Build

Signed-off-by: Rishab Nahata <[email protected]>

* Trigger Build

Signed-off-by: Rishab Nahata <[email protected]>

* Trigger Build

Signed-off-by: Rishab Nahata <[email protected]>

---------

Signed-off-by: Rishab Nahata <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport 2.x Backport to 2.x branch backport-failed
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants