Troubleshooting

Streamtime provides tools and processes to help you troubleshoot issues in your Kubernetes fleets and Kafka clusters. This guide outlines common troubleshooting steps and resources.

Troubleshooting Features

  • Audit Logs: Access detailed audit logs for all operations and events across your fleets and clusters. Audit logs

  • Alerts: Receive real-time alerts for critical issues, failures, or performance bottlenecks. Alerts

  • AI Health Agent: Streamtime includes an AI-powered health agent that analyzes cluster metrics and provides prompt, actionable resolutions.
    AI Health Agent Metrics

    • You can configure your own AI model for the health agent to tailor troubleshooting to your environment.
    • The health agent continuously monitors metrics and suggests solutions based on detected anomalies. AI Health Agent logs

Use these tools in the Streamtime UI to quickly identify, analyze, and resolve issues in your infrastructure and Kafka workloads.