-
Notifications
You must be signed in to change notification settings - Fork 579
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Move usage stats reporting into redpanda #2707
Comments
Note that this only happens if the user has |
are you thinking in terms of exposing this through the admin api? another option is to place some admin credentials in |
This would be something for redpanda to emit by default, not something exposed for another process. cc @senior7515 for more details. |
@dotnwat i think the functionality should be moved into the c++ binary, in particular the cluster controller really. |
Ah, got it! Thanks for writing all of the context in the issue I didn't realize how reporting works today. I also didn't read until the end of the issue description, so I missed the important part. My bad 🥇 Yup this should be straight forward since we have @Lazin's HTTP client!
For (1) something as simple as the periodic leadership re-balancing service could be used as a model for implementation. @jcsp @mmaslankaprv wow 3 use cases in one day for the hypothetical service we've been discussing. |
Exactly. |
The version tracking is likely to be rolled into an overall cluster status service, so #2735 will probably depend on this. |
Changes needed, besides porting the metrics reporting stuff to redpanda:
K8s:
Rpk:
|
(was: Enabling scram breaks rpk status, which is run by default in our systemd file )
By default today, we begin rpk status with systemd when a user starts redpanda. rpk status uses the Kafka API to scrape the number of topics and number of partitions created on the broker. These two, along with free memory, free space, and cpu% by redpanda, are sent to Vectorized as metrics.
If a user enables scram on a cluster, rpk status will be unable to connect to redpanda to scrape the # of topics and # of partitions. The franz-go client internally retries issuing a metadata request a few times, and after a bit the process will die with an error. During these retries, redpanda will continuously be logging that connections are failing due to auth errors. This spams logs and creates gigabytes in syslog over a day.
Rather than having an external process gather these metrics (rpk), redpanda itself should roll them up and emit them. This would be easier to enable/disable, would start one fewer process, and would be less surprising and more foolproof.
The text was updated successfully, but these errors were encountered: