Component-Specific Troubleshooting
This guide provides a structured approach to diagnosing and resolving common issues. Before escalating any concerns, follow the steps outlined below to identify potential causes and solutions. Using these preliminary checks can help resolve issues efficiently and minimize downtime.
If you encounter persistent issues or unexpected behavior that is not resolved using this guide, please reach out to the ScaleOps team for further support.
scaleops-recommender
Ensure the recommender pod is running:
kubectl get pods -n scaleops-system | grep recommenderCheck logs for errors:
kubectl logs <recommender-pod-name> -n scaleops-systemUse kubectl describe on the pod to see any issues or problems, such as events or warnings.
kubectl describe pod <pod-name> -n scaleops-systemscaleops-dashboards
Check the pod status:
kubectl get pods -n scaleops-system | grep dashboardsReview logs for errors such as external authentication configurations errors.
kubectl logs <dashboard-pod-name> -n scaleops-systemUse kubectl describe on the pod to see any issues or problems, such as events or warnings.
kubectl describe pod <pod-name> -n scaleops-systemscaleops-admissions
Verify the mutating webhook configuration:
kubectl get mutatingwebhookconfigurationsCheck pod logs for admission errors:
kubectl logs <admissions-pod-name> -n scaleops-systemConfirm RBAC permissions are correctly set.
Use kubectl describe on the pod to see any issues or problems, such as events or warnings.
kubectl describe pod <pod-name> -n scaleops-systemscaleops-updater
Verify the pod status:
kubectl get pods -n scaleops-system | grep updaterReview logs for update-related issues:
kubectl logs <updater-pod-name> -n scaleops-systemUse kubectl describe on the pod to see any issues or problems, such as events or warnings.
kubectl describe pod <pod-name> -n scaleops-systemscaleops-agent
Verify pod readiness:
kubectl get pods -n scaleops-system | grep agentInspect logs for event tracking issues:
kubectl logs <agent-pod-name> -n scaleops-systemUse kubectl describe on the pod to see any issues or problems, such as events or warnings.
kubectl describe pod <pod-name> -n scaleops-systemPrometheus
Verify that Prometheus is running:
kubectl get pods -n scaleops-system | grep prometheusCheck PVC and storage configurations:
kubectl get pvc -n scaleops-system
kubectl describe pvc <pvc-name> -n scaleops-systemReview logs for storage-related errors:
kubectl logs <prometheus-pod-name> -n scaleops-systemUse kubectl describe on the pod to see any issues or problems, such as events or warnings.
kubectl describe pod <pod-name> -n scaleops-systemscaleops-healthcheck
Verify health check pod status:
kubectl get pods -n scaleops-system | grep healthcheck