-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix cluster not able to spin up issue when disk usage exceeds threshold #15258
fix cluster not able to spin up issue when disk usage exceeds threshold #15258
Conversation
❌ Gradle check result for d9096b2: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
Sure, this is a draft PR to prove this can fix the issue, but this fix has drawbacks, e.g. it changes the |
7ce9886
to
3e8df68
Compare
❌ Gradle check result for 7ce9886: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
❌ Gradle check result for 3e8df68: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
Signed-off-by: zane-neo <[email protected]>
Signed-off-by: zane-neo <[email protected]>
Signed-off-by: zane-neo <[email protected]>
Signed-off-by: zane-neo <[email protected]>
Signed-off-by: zane-neo <[email protected]>
Signed-off-by: zane-neo <[email protected]>
c81fbc1
to
af9d70d
Compare
❌ Gradle check result for af9d70d: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
…ncy issue caused test failure Signed-off-by: zane-neo <[email protected]>
❌ Gradle check result for 7f24452: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
…ld (#15258) * fix cluster not able to spin up issue when disk usage exceeds threshold Signed-off-by: zane-neo <[email protected]> * Add comment to changes Signed-off-by: zane-neo <[email protected]> * Add UT to ensure the keepAliveThread starts before node starts Signed-off-by: zane-neo <[email protected]> * remove unused imports Signed-off-by: zane-neo <[email protected]> * Fix forbidden API calls check failed issue Signed-off-by: zane-neo <[email protected]> * format code Signed-off-by: zane-neo <[email protected]> * format code Signed-off-by: zane-neo <[email protected]> * change setInstance method to static Signed-off-by: zane-neo <[email protected]> * Add countdownlatch in test to coordinate the thread to avoid concureency issue caused test failure Signed-off-by: zane-neo <[email protected]> --------- Signed-off-by: zane-neo <[email protected]> (cherry picked from commit 62081f2) Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
…ld (opensearch-project#15258) * fix cluster not able to spin up issue when disk usage exceeds threshold Signed-off-by: zane-neo <[email protected]> * Add comment to changes Signed-off-by: zane-neo <[email protected]> * Add UT to ensure the keepAliveThread starts before node starts Signed-off-by: zane-neo <[email protected]> * remove unused imports Signed-off-by: zane-neo <[email protected]> * Fix forbidden API calls check failed issue Signed-off-by: zane-neo <[email protected]> * format code Signed-off-by: zane-neo <[email protected]> * format code Signed-off-by: zane-neo <[email protected]> * change setInstance method to static Signed-off-by: zane-neo <[email protected]> * Add countdownlatch in test to coordinate the thread to avoid concureency issue caused test failure Signed-off-by: zane-neo <[email protected]> --------- Signed-off-by: zane-neo <[email protected]>
…ld (opensearch-project#15258) * fix cluster not able to spin up issue when disk usage exceeds threshold Signed-off-by: zane-neo <[email protected]> * Add comment to changes Signed-off-by: zane-neo <[email protected]> * Add UT to ensure the keepAliveThread starts before node starts Signed-off-by: zane-neo <[email protected]> * remove unused imports Signed-off-by: zane-neo <[email protected]> * Fix forbidden API calls check failed issue Signed-off-by: zane-neo <[email protected]> * format code Signed-off-by: zane-neo <[email protected]> * format code Signed-off-by: zane-neo <[email protected]> * change setInstance method to static Signed-off-by: zane-neo <[email protected]> * Add countdownlatch in test to coordinate the thread to avoid concureency issue caused test failure Signed-off-by: zane-neo <[email protected]> --------- Signed-off-by: zane-neo <[email protected]>
… threshold (opensearch-project#15258)" This reverts commit 62081f2.
… threshold (opensearch-project#15258)" This reverts commit 62081f2.
…x the issue. Signed-off-by: zane-neo <[email protected]>
Signed-off-by: zane-neo <[email protected]>
…ld (opensearch-project#15258) * fix cluster not able to spin up issue when disk usage exceeds threshold Signed-off-by: zane-neo <[email protected]> * Add comment to changes Signed-off-by: zane-neo <[email protected]> * Add UT to ensure the keepAliveThread starts before node starts Signed-off-by: zane-neo <[email protected]> * remove unused imports Signed-off-by: zane-neo <[email protected]> * Fix forbidden API calls check failed issue Signed-off-by: zane-neo <[email protected]> * format code Signed-off-by: zane-neo <[email protected]> * format code Signed-off-by: zane-neo <[email protected]> * change setInstance method to static Signed-off-by: zane-neo <[email protected]> * Add countdownlatch in test to coordinate the thread to avoid concureency issue caused test failure Signed-off-by: zane-neo <[email protected]> --------- Signed-off-by: zane-neo <[email protected]>
…x the issue. (opensearch-project#16377) Signed-off-by: zane-neo <[email protected]>
…ld (opensearch-project#15258) * fix cluster not able to spin up issue when disk usage exceeds threshold Signed-off-by: zane-neo <[email protected]> * Add comment to changes Signed-off-by: zane-neo <[email protected]> * Add UT to ensure the keepAliveThread starts before node starts Signed-off-by: zane-neo <[email protected]> * remove unused imports Signed-off-by: zane-neo <[email protected]> * Fix forbidden API calls check failed issue Signed-off-by: zane-neo <[email protected]> * format code Signed-off-by: zane-neo <[email protected]> * format code Signed-off-by: zane-neo <[email protected]> * change setInstance method to static Signed-off-by: zane-neo <[email protected]> * Add countdownlatch in test to coordinate the thread to avoid concureency issue caused test failure Signed-off-by: zane-neo <[email protected]> --------- Signed-off-by: zane-neo <[email protected]>
…x the issue. (opensearch-project#16377) Signed-off-by: zane-neo <[email protected]>
Description
root cause:
Changing the code of cluster start part to first start the
keepAliveThread
which is a non-daemon thread to make sure at least one non-daemon thread is running thus the JVM won't exit.Related Issues
#14791
Check List
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.