Troubleshooting
Streamtime provides tools and processes to help you troubleshoot issues in your Kubernetes fleets and Kafka clusters. This guide outlines common troubleshooting steps and resources.
Troubleshooting Features
-
Audit Logs: Access detailed audit logs for all operations and events across your fleets and clusters.
-
Alerts: Receive real-time alerts for critical issues, failures, or performance bottlenecks.
-
AI Health Agent: Streamtime includes an AI-powered health agent that analyzes cluster metrics and provides prompt, actionable resolutions.
- You can configure your own AI model for the health agent to tailor troubleshooting to your environment.
- The health agent continuously monitors metrics and suggests solutions based on detected anomalies.
—
Use these tools in the Streamtime UI to quickly identify, analyze, and resolve issues in your infrastructure and Kafka workloads.