Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Track cortex_query_frontend_enqueue_duration_seconds by query-scheduler #6120

Merged

Conversation

pracucci
Copy link
Collaborator

@pracucci pracucci commented Sep 25, 2023

What this PR does

This PR addresses a feedback received here. In this PR I proposes to track cortex_query_frontend_enqueue_duration_seconds by query-scheduler address, to have better visibility in case the slowdown is caused by a specific scheduler.

I've also took the opportunity to customise the cortex_query_frontend_enqueue_duration_seconds histogram buckets, to track 1ms latency too and removing any bucket bigger than 1s.

Which issue(s) this PR fixes or relates to

N/A

Checklist

  • Tests updated
  • Documentation added
  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

Copy link
Contributor

@charleskorn charleskorn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, comment is non-blocking

@@ -68,10 +68,10 @@ func newFrontendSchedulerWorkers(cfg Config, frontendAddress string, requestsCh
Name: "cortex_query_frontend_workers_enqueued_requests_total",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we remove or deprecate this metric? We'll be able to get the same information at with cortex_query_frontend_enqueue_duration_seconds now.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Separate PR, see: #6121. We offer no guarantee on metrics, so no need to deprecate: we can just remove it (moreover it's not used in any of the dashboards / alerts we provide).

Delete the metric when disconnects from a query-scheduler
Customise cortex_query_frontend_enqueue_duration_seconds buckets

Signed-off-by: Marco Pracucci <marco@pracucci.com>
@pracucci pracucci force-pushed the track-query-frontend-enqueue-duration-per-query-scheduler branch from a78e6ce to a1d515a Compare September 26, 2023 14:25
@pracucci pracucci enabled auto-merge (squash) September 26, 2023 14:26
@pracucci pracucci merged commit e04a90f into main Sep 26, 2023
28 checks passed
@pracucci pracucci deleted the track-query-frontend-enqueue-duration-per-query-scheduler branch September 26, 2023 14:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants