Chaos faults for AWS

Last updated on Jun 10, 2026

Introduction

AWS faults disrupt the resources running on different AWS services from the EKS cluster. To perform such AWS chaos experiments, you will need to authenticate CE with the AWS platform. This can be done in two ways.

Using secrets: You can use secrets to authenticate CE with AWS regardless of whether the Kubernetes cluster is used for the deployment. This is Kubernetes' native way of authenticating CE with AWS.
IAM integration: You can authenticate CE using AWS using IAM when you have deployed chaos on the EKS cluster. You can associate an IAM role with a Kubernetes service account. This service account can be used to provide AWS permissions to the experiment pod which uses the particular service account.

Here are AWS faults that you can execute and validate.

ALB AZ down

ALB AZ down detaches one or more availability zones from an Application Load Balancer for a configurable duration so you can test how clients, target groups, and AZ-aware routing behave when a zone is taken out of the load balancer rotation.

availabilityload balancer

CLB AZ down

CLB AZ down disables one or more availability zones on a Classic Load Balancer for a configurable duration so you can test how clients and back-end instances behave when an AZ is removed from the load balancer rotation.

availabilityload balancer

AZ blackhole

AZ blackhole isolates network traffic in one or more AWS Availability Zones (optionally scoped to specific VPCs or subnets) for a configurable duration and restores connectivity afterwards, so you can test how multi-AZ workloads handle a zone-level outage.

zoneblackhole

VPC route misconfiguration

VPC route misconfiguration temporarily removes specified CIDR routes from one or more VPC route tables for a configurable duration and restores them afterwards, so you can test how the workload behaves when egress to a Transit Gateway, NAT Gateway, VPC peer, or internet gateway disappears.

vpcroute tables

DynamoDB replication pause

DynamoDB replication pause pauses cross-region replication on one or more Amazon DynamoDB global tables for a configurable duration using an AWS Fault Injection Service (FIS) experiment, so you can test how your application handles a brief stop in multi-region consistency.

replicationpausedynamodb

EBS loss by ID

EBS loss by ID detaches an EBS volume by volume ID for a configurable duration and reattaches it afterwards, so you can test how a workload behaves when its storage disappears.

lossid

Page 1 of 11

ALB AZ down

ALB AZ down detaches one or more availability zones from an Application Load Balancer for a configurable duration, then reattaches them, so you can test how multi-AZ workloads behave when a single AZ disappears from the load balancer rotation.

Use cases

Validate AZ-level resilience and DNS-based client failover within the TTL budget.
Confirm remaining AZs absorb redirected traffic without breaching latency SLOs.
Verify cross-zone load balancing behavior and target group re-registration after recovery.

Introduction​

ALB AZ down​

CLB AZ down​

AZ blackhole​

VPC route misconfiguration​

DynamoDB replication pause​

EBS loss by ID​

EBS loss by tag​

EC2 CPU hog​

EC2 DNS chaos​

EC2 HTTP latency​

EC2 HTTP modify body​

EC2 HTTP modify header​

EC2 HTTP reset peer​

EC2 HTTP status code​

EC2 IO stress​

EC2 memory hog​

EC2 network latency​

EC2 network loss​

EC2 process kill​

EC2 stop by ID​

EC2 stop by tag​

ECS agent stop​

ECS container CPU hog​

ECS container HTTP latency​

ECS container HTTP modify body​

ECS container HTTP reset peer​

ECS container HTTP status code​

ECS container IO stress​

ECS container memory hog​

ECS container network latency​

ECS container network loss​

ECS container volume detach​

ECS Fargate CPU hog​

ECS Fargate memory hog​

ECS instance stop​

ECS invalid container image​

ECS network restrict​

ECS task scale​

ECS task stop​

ECS update container resource limit​

ECS update container timeout​

ECS update task role​

Generic experiment template​

Lambda block TCP connection​

Lambda delete event source mapping​

Lambda function layer detach​

Lambda delete function concurrency​

Lambda toggle event mapping state​

Lambda update function memory​

Lambda update function timeout​

Lambda inject latency​

Lambda inject status code​

Lambda update role permission​

Lambda modify response body​

NLB AZ down​

RDS instance delete​

RDS instance reboot​

Resource access restrict​

SSM chaos by ID​

SSM chaos by tag​

Windows EC2 blackhole chaos​

Windows EC2 CPU hog​

Windows EC2 memory hog​

Windows EC2 network latency​

Windows EC2 network loss​

Windows EC2 process kill​

Introduction

ALB AZ down

CLB AZ down

AZ blackhole

VPC route misconfiguration

DynamoDB replication pause

EBS loss by ID

EBS loss by tag

EC2 CPU hog

EC2 DNS chaos

EC2 HTTP latency

EC2 HTTP modify body

EC2 HTTP modify header

EC2 HTTP reset peer

EC2 HTTP status code

EC2 IO stress

EC2 memory hog

EC2 network latency

EC2 network loss

EC2 process kill

EC2 stop by ID

EC2 stop by tag

ECS agent stop

ECS container CPU hog

ECS container HTTP latency

ECS container HTTP modify body

ECS container HTTP reset peer

ECS container HTTP status code

ECS container IO stress

ECS container memory hog

ECS container network latency

ECS container network loss

ECS container volume detach

ECS Fargate CPU hog

ECS Fargate memory hog

ECS instance stop

ECS invalid container image

ECS network restrict

ECS task scale

ECS task stop

ECS update container resource limit

ECS update container timeout

ECS update task role

Generic experiment template

Lambda block TCP connection

Lambda delete event source mapping

Lambda function layer detach

Lambda delete function concurrency

Lambda toggle event mapping state

Lambda update function memory

Lambda update function timeout

Lambda inject latency

Lambda inject status code

Lambda update role permission

Lambda modify response body

NLB AZ down

RDS instance delete

RDS instance reboot

Resource access restrict

SSM chaos by ID

SSM chaos by tag

Windows EC2 blackhole chaos

Windows EC2 CPU hog

Windows EC2 memory hog

Windows EC2 network latency

Windows EC2 network loss

Windows EC2 process kill