Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[API Server] API Server restart after Pod recreation in Helm deployment #4771

Open
aylei opened this issue Feb 20, 2025 · 0 comments · May be fixed by #4776
Open

[API Server] API Server restart after Pod recreation in Helm deployment #4771

aylei opened this issue Feb 20, 2025 · 0 comments · May be fixed by #4776
Assignees

Comments

@aylei
Copy link
Collaborator

aylei commented Feb 20, 2025

$ kubectl logs -p skypilot-api-service-7696657ff-mw6pc
Defaulted container "skypilot-api" out of: skypilot-api, create-aws-credentials (init), setup-gcp-credentials (init)
Pre-deploy hook
Failed to connect to SkyPilot API server at http://0.0.0.0:46580. Starting a local server.
sky.exceptions.ApiServerConnectionError: Could not connect to SkyPilot API server at http://127.0.0.1:46580. Please ensure that the server is running. Try: curl http://127.0.0.1:46580/api/health

During handling of the above exception, another exception occurred:

RuntimeError: Failed to start SkyPilot API server at http://0.0.0.0:46580
View logs at: ~/.sky/api_server/server.log
  1. the log should be tailed for diagnosis
  2. startup check can be skipped in helm deployment, since the readiness-probe and liveness-probe do the same work and are tunable for operators
@aylei aylei self-assigned this Feb 20, 2025
@aylei aylei linked a pull request Feb 20, 2025 that will close this issue
5 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant