Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Node readyz state takes too long to recover #3743

Closed
awly opened this issue May 19, 2020 · 0 comments
Closed

Node readyz state takes too long to recover #3743

awly opened this issue May 19, 2020 · 0 comments
Assignees
Labels
observability Used for metrics and insight into Teleport.
Milestone

Comments

@awly
Copy link
Contributor

awly commented May 19, 2020

Description

What happened:

After a temporary proxy/auth server downtime, node's /readyz endpoint reports "recovering" state for up to 10min.
The problem is that the current readyz state is updated via certificate rotation checks, which happen every 10min. It should instead be tied to heartbeats, that happen way more frequently.

What you expected to happen:

Node should recover in <1m after proxy/auth server come back online.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
observability Used for metrics and insight into Teleport.
Projects
None yet
Development

No branches or pull requests

2 participants