Skip to main content

Kubernetes Command Probe Templates

Pre-built Command Probe templates for validating Kubernetes resource health and status during chaos experiments. These templates help you quickly set up probes to monitor pods, nodes, containers, and other Kubernetes resources.

Here are Kubernetes probe templates that you can use in your chaos experiments.

[object Object]

Container Restart Check

Container restart check validates the restart count of a container.

Required Environment Variables:

  • TARGET_LABELS: Comma-separated list of target labels to filter pods
  • TARGET_NAMES: Comma-separated list of target pod names
  • TARGET_NAMESPACE: Namespace of the target pods
  • TARGET_CONTAINER: Name of the container to check restart count
  • CONTAINER_RESTART: Maximum allowed restart count
Use cases
  • Verify containers don't restart excessively during chaos experiments
  • Monitor container stability during resource stress
  • Validate application resilience to failures
  • Ensure pods maintain healthy restart counts
View details
[object Object]

Node Status Check

Node status check validates the current state of Kubernetes nodes.

Required Environment Variables:

  • TARGET_NODE: Comma-separated list of nodes to be checked
  • TARGET_NODES: Comma-separated list of nodes to be checked
  • NODE_LABEL: Node label to filter nodes (e.g., node-role.kubernetes.io/worker=)
Use cases
  • Verify nodes remain healthy during chaos experiments
  • Validate node recovery after failures
  • Monitor cluster health during node-level chaos
View details
[object Object]

Pod Replica Count Check

Pod replica count check validates the current replica count of Kubernetes pods.

Required Environment Variables:

  • TARGET_LABELS: Comma-separated list of target labels to filter resources
  • TARGET_NAMES: Comma-separated list of target resource names
  • TARGET_NAMESPACE: Namespace of the target resources
  • TARGET_KIND: Kind of the target resource (e.g., deployment, statefulset)
  • MINIMUM_HEALTHY_REPLICA_COUNT: Minimum healthy replica count for the target
Use cases
  • Verify deployments maintain desired replica count
  • Validate auto-scaling behavior during load chaos
  • Monitor application availability during pod failures
  • Ensure high availability during chaos experiments
View details
[object Object]

Pod Resource Utilisation Check

Pod resource utilisation check validates the current resource utilisation metrics of Kubernetes pods.

Required Environment Variables:

  • TARGET_LABELS: Comma-separated list of target labels to filter pods
  • TARGET_NAMES: Comma-separated list of target pod names
  • TARGET_NAMESPACE: Namespace of the target pods
  • METRIC_TYPE: Metric type to check (cpu or memory)
  • CPU_LIMIT: CPU usage limit in millicores
  • MEMORY_LIMIT: Memory usage limit in MB
Use cases
  • Monitor resource usage during stress chaos experiments
  • Verify resource limits are respected
  • Validate application performance under load
  • Ensure pods don't exceed resource thresholds
View details
[object Object]

Pod Startup Time Check

Pod startup time check validates the startup time of Kubernetes pods.

Required Environment Variables:

  • TARGET_LABELS: Comma-separated list of target labels to filter pods
  • TARGET_NAMES: Comma-separated list of target pod names
  • TARGET_NAMESPACE: Namespace of the target pods
  • STARTUP_DURATION_CUTOFF: All pods should start within this duration (in seconds)
Use cases
  • Validate pods start within acceptable timeframes
  • Monitor deployment performance during rollouts
  • Detect slow startup issues during chaos experiments
  • Ensure application readiness times are optimal
View details
[object Object]

Pod Status Check

Pod status check validates the current state of Kubernetes pods.

Required Environment Variables:

  • TARGET_LABELS: Comma-separated list of target labels to filter pods
  • TARGET_NAMES: Comma-separated list of target pod names
  • TARGET_NAMESPACE: Namespace of the target pods
Use cases
  • Verify pods remain in Running state during chaos experiments
  • Validate pod health after failures and restarts
  • Monitor application availability continuously
  • Ensure pods recover to healthy state after disruptions
View details
[object Object]

Pod Warnings Check

Pod warnings check checks for warnings in the pod events.

Required Environment Variables:

  • TARGET_LABELS: Comma-separated list of target labels to filter pods
  • TARGET_NAMES: Comma-separated list of target pod names
  • TARGET_NAMESPACE: Namespace of the target pods
Use cases
  • Monitor pod health indicators during chaos experiments
  • Detect configuration issues during experiments
  • Validate application behavior under stress
  • Identify potential problems before they become critical
View details