Skip to content

alerts: increase MimirBucketIndexNotUpdated pending to 5m #11193

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

NickAnge
Copy link
Contributor

@NickAnge NickAnge commented Apr 11, 2025

What this PR does

This PR increases the pending period for MimirBucketIndexNotUpdated by 5 minutes to reduce the false positives we've observed over the past few weeks. As I understand it, the pending period determines how long the condition must exceed the threshold before the alert fires—so with this change, the alert will only trigger if the threshold is exceeded for more than 5 minutes.

Which issue(s) this PR fixes or relates to

Fixes #

Checklist

  • Tests updated.
  • Documentation added.
  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX].
  • about-versioning.md updated with experimental features.

@NickAnge NickAnge changed the title alerts: increase stale bucket index pending period alerts: increase MimirBucketIndexNotUpdated threshold by 5m Apr 11, 2025
@NickAnge NickAnge changed the title alerts: increase MimirBucketIndexNotUpdated threshold by 5m alerts: increase MimirBucketIndexNotUpdated threshold by 7m Apr 14, 2025
@NickAnge NickAnge force-pushed the nickange/stale-bucket-index-pending-period branch from cab98fd to 4ce5a3c Compare April 14, 2025 06:55
@NickAnge NickAnge marked this pull request as ready for review April 14, 2025 06:57
@NickAnge NickAnge requested a review from a team as a code owner April 14, 2025 06:57
@NickAnge NickAnge force-pushed the nickange/stale-bucket-index-pending-period branch from 4ce5a3c to 1e2a3d6 Compare April 14, 2025 08:09
@NickAnge NickAnge changed the title alerts: increase MimirBucketIndexNotUpdated threshold by 7m alerts: increase MimirBucketIndexNotUpdated pending to 5m Apr 14, 2025
@@ -48,6 +48,7 @@
* [ENHANCEMENT] Dashboards: Add panels to the `Mimir / Tenants` and `Mimir / Top Tenants` dashboards showing the rate of gateway requests. #10978
* [ENHANCEMENT] Alerts: Improve `MimirIngesterFailsToProcessRecordsFromKafka` to not fire during forced TSDB head compaction. #11006
* [BUGFIX] Dashboards: fix "Mimir / Tenants" legends for non-Kubernetes deployments. #10891
* [CHANGE] Alerts: Change pending period for `MimirBucketIndexNotUpdated` alert to fire after being 5m active to avoid false positives. #11193
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
* [CHANGE] Alerts: Change pending period for `MimirBucketIndexNotUpdated` alert to fire after being 5m active to avoid false positives. #11193
* [CHANGE] Alerts: Change pending period for the`MimirBucketIndexNotUpdated` alert to fire after being active for five minutes to avoid false positives. #11193

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants