Skip to Content

Component-Specific Troubleshooting

This guide provides a structured approach to diagnosing and resolving common issues. Before escalating any concerns, follow the steps outlined below to identify potential causes and solutions. Using these preliminary checks can help resolve issues efficiently and minimize downtime.

If you encounter persistent issues or unexpected behavior that is not resolved using this guide, please reach out to the ScaleOps team for further support.

scaleops-recommender

Ensure the recommender pod is running:

kubectl get pods -n scaleops-system | grep recommender

Check logs for errors:

kubectl logs <recommender-pod-name> -n scaleops-system

Use kubectl describe on the pod to see any issues or problems, such as events or warnings.

kubectl describe pod <pod-name> -n scaleops-system

scaleops-dashboards

Check the pod status:

kubectl get pods -n scaleops-system | grep dashboards

Review logs for errors such as external authentication configurations errors.

kubectl logs <dashboard-pod-name> -n scaleops-system

Use kubectl describe on the pod to see any issues or problems, such as events or warnings.

kubectl describe pod <pod-name> -n scaleops-system

scaleops-admissions

Verify the mutating webhook configuration:

kubectl get mutatingwebhookconfigurations

Check pod logs for admission errors:

kubectl logs <admissions-pod-name> -n scaleops-system

Confirm RBAC permissions are correctly set.

Use kubectl describe on the pod to see any issues or problems, such as events or warnings.

kubectl describe pod <pod-name> -n scaleops-system

scaleops-updater

Verify the pod status:

kubectl get pods -n scaleops-system | grep updater

Review logs for update-related issues:

kubectl logs <updater-pod-name> -n scaleops-system

Use kubectl describe on the pod to see any issues or problems, such as events or warnings.

kubectl describe pod <pod-name> -n scaleops-system

scaleops-agent

Verify pod readiness:

kubectl get pods -n scaleops-system | grep agent

Inspect logs for event tracking issues:

kubectl logs <agent-pod-name> -n scaleops-system

Use kubectl describe on the pod to see any issues or problems, such as events or warnings.

kubectl describe pod <pod-name> -n scaleops-system

Prometheus

Verify that Prometheus is running:

kubectl get pods -n scaleops-system | grep prometheus

Check PVC and storage configurations:

kubectl get pvc -n scaleops-system kubectl describe pvc <pvc-name> -n scaleops-system

Review logs for storage-related errors:

kubectl logs <prometheus-pod-name> -n scaleops-system

Use kubectl describe on the pod to see any issues or problems, such as events or warnings.

kubectl describe pod <pod-name> -n scaleops-system

scaleops-healthcheck

Verify health check pod status:

kubectl get pods -n scaleops-system | grep healthcheck