Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[META] Search Query Categorization #11596

Open
deshsidd opened this issue Dec 12, 2023 · 3 comments
Open

[META] Search Query Categorization #11596

deshsidd opened this issue Dec 12, 2023 · 3 comments
Assignees
Labels
enhancement Enhancement or improvement to existing feature or request Meta Meta issue, not directly linked to a PR Roadmap:Stability/Availability/Resiliency Project-wide roadmap label Search Search query, autocomplete ...etc

Comments

@deshsidd
Copy link
Contributor

Overview

Today OpenSearch customers have limited very visibility into the query workload running on a cluster. There is also no easy way to identify patterns in the queries being executed upon an index. This imposes a huge gap while debugging performance issues, tracking changes in data access patterns, or when targeting new feature improvements.

The Query Classification feature (part of the Query Visibility Project) in OpenSearch aims to enhance the platform's capabilities by providing a mechanism to identify patterns, latencies and resource utilization breakdown for the queries being executed upon an index. This will empower users and administrators to optimize query performance and identify query types for better resource allocation and index management.

The primary objective of this proposal is to implement a query classification mechanism within OpenSearch that can categorize and analyze the queries being executed on an index.

We intent to use metric counters to record this information using Metric Framework : #10241

This task has the following Phases and tasks:

Phase 1

Phase 2 (#11040)

Integrations

@deshsidd deshsidd added enhancement Enhancement or improvement to existing feature or request Meta Meta issue, not directly linked to a PR untriaged labels Dec 12, 2023
@deshsidd deshsidd self-assigned this Dec 12, 2023
@deshsidd
Copy link
Contributor Author

@deshsidd deshsidd changed the title [Meta] Search Query Categorization [META] Search Query Categorization Dec 12, 2023
@ansjcy
Copy link
Member

ansjcy commented Dec 12, 2023

Appreciate the summary! These categorization metrics would be valuable for identifying potential patterns in queries in the future. Furthermore, insights dashboards can leverage these metrics to provide additional layers of analysis for the users.

I think right now the big question the community has is "how would those metrics be beneficial for me once the metrics are available". It would be good give some examples illustrating the practical application of these metrics, and how can a user use those metrics and also examples of "insights" the user can get with some example metrics.

@ankitkala ankitkala added Search Search query, autocomplete ...etc and removed Other labels Dec 17, 2023
@macohen
Copy link
Contributor

macohen commented Dec 20, 2023

I agree with @ansjcy. If you took an example query, maybe from a benchmark, and then worked through the example here, what would the dataflow look like? Break down the query, categorize the parts. What metrics would help tune a query? What metrics might indicate an unhealthy cluster? Could this tool be used to prevent any issues with queries?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Enhancement or improvement to existing feature or request Meta Meta issue, not directly linked to a PR Roadmap:Stability/Availability/Resiliency Project-wide roadmap label Search Search query, autocomplete ...etc
Projects
Status: New
Status: Todo
Status: Later (6 months plus)
Development

No branches or pull requests

6 participants