Chaos faults for AWS

Last updated on Sep 22, 2025

Introduction

AWS faults disrupt the resources running on different AWS services from the EKS cluster. To perform such AWS chaos experiments, you will need to authenticate CE with the AWS platform. This can be done in two ways.

Using secrets: You can use secrets to authenticate CE with AWS regardless of whether the Kubernetes cluster is used for the deployment. This is Kubernetes' native way of authenticating CE with AWS.
IAM integration: You can authenticate CE using AWS using IAM when you have deployed chaos on the EKS cluster. You can associate an IAM role with a Kubernetes service account. This service account can be used to provide AWS permissions to the experiment pod which uses the particular service account.

Here are AWS faults that you can execute and validate.

ALB AZ down

ALB AZ down takes down the AZ (Availability Zones) on a target application load balancer for a specific duration.

availabilityload balancer

CLB AZ down

CLB AZ down takes down the AZ (Availability Zones) on a target CLB for a specific duration.

availabilityload balancer

AZ blackhole

AZ blackhole causes network blackhole by isolating traffic in specific availability zones across an entire region.

zoneblackhole

VPC route misconfiguration

VPC route misconfiguration causes network issues due to misconfiguration on the route table associated with the target VPC.

vpcroute tables

DynamoDB replication pause

DynamoDB replication pause fault pauses the data replication in DynamoDB tables over multiple locations for the chaos duration.

replicationpausedynamodb

EBS loss by ID

EBS loss by ID disrupts the state of EBS volume by detaching it from the node (or EC2) instance using volume ID for a certain duration.

lossid

Page 1 of 11

ALB AZ down

ALB AZ down takes down the AZ (Availability Zones) on a target application load balancer for a specific duration. This fault restricts access to certain availability zones for a specific duration.

Use cases

Tests the application sanity, availability, and recovery workflows of the application pod attached to the load balancer.
ALB AZ down fault breaks the connectivity of an ALB with the given zones and impacts their delivery.
Detaching the AZ from the application load balancer disrupts the application's performance.

Introduction​

ALB AZ down​

CLB AZ down​

AZ blackhole​

VPC route misconfiguration​

DynamoDB replication pause​

EBS loss by ID​

EBS loss by tag​

EC2 CPU hog​

EC2 DNS chaos​

EC2 HTTP latency​

EC2 HTTP modify body​

EC2 HTTP modify header​

EC2 HTTP reset peer​

EC2 HTTP status code​

EC2 IO stress​

EC2 memory hog​

EC2 network latency​

EC2 network loss​

EC2 process kill​

EC2 stop by ID​

EC2 stop by tag​

ECS agent stop​

ECS container CPU hog​

ECS container HTTP latency​

ECS container HTTP modify body​

ECS container HTTP modify header​

ECS container HTTP reset peer​

ECS container HTTP status code​

ECS container IO stress​

ECS container memory hog​

ECS container network latency​

ECS container network loss​

ECS container volume detach​

ECS Fargate CPU Hog​

ECS Fargate memory hog​

ECS instance stop​

ECS invalid container image​

ECS network restrict​

ECS task scale​

ECS task stop​

ECS update container resource limit​

ECS update container timeout​

ECS update task role​

Generic experiment template​

Lambda delete event source mapping​

Lambda function layer detach​

Lambda delete function concurrency​

Lambda toggle event mapping state​

Lambda update function memory​

Lambda update function timeout​

Lambda inject status code​

Lambda update role permission​

Lambda modify response body​

NLB AZ down​

RDS instance delete​

RDS instance reboot​

Resource access restrict​

SSM chaos by ID​

SSM chaos by tag​

Windows EC2 blackhole chaos​

Windows EC2 CPU hog​

Windows EC2 memory hog​

Windows EC2 Network Latency​

Windows EC2 Network Loss​

Windows EC2 Process Kill​

Lambda Block TCP Connection​

Introduction

ALB AZ down

CLB AZ down

AZ blackhole

VPC route misconfiguration

DynamoDB replication pause

EBS loss by ID

EBS loss by tag

EC2 CPU hog

EC2 DNS chaos

EC2 HTTP latency

EC2 HTTP modify body

EC2 HTTP modify header

EC2 HTTP reset peer

EC2 HTTP status code

EC2 IO stress

EC2 memory hog

EC2 network latency

EC2 network loss

EC2 process kill

EC2 stop by ID

EC2 stop by tag

ECS agent stop

ECS container CPU hog

ECS container HTTP latency

ECS container HTTP modify body

ECS container HTTP modify header

ECS container HTTP reset peer

ECS container HTTP status code

ECS container IO stress

ECS container memory hog

ECS container network latency

ECS container network loss

ECS container volume detach

ECS Fargate CPU Hog

ECS Fargate memory hog

ECS instance stop

ECS invalid container image

ECS network restrict

ECS task scale

ECS task stop

ECS update container resource limit

ECS update container timeout

ECS update task role

Generic experiment template

Lambda delete event source mapping

Lambda function layer detach

Lambda delete function concurrency

Lambda toggle event mapping state

Lambda update function memory

Lambda update function timeout

Lambda inject status code

Lambda update role permission

Lambda modify response body

NLB AZ down

RDS instance delete

RDS instance reboot

Resource access restrict

SSM chaos by ID

SSM chaos by tag

Windows EC2 blackhole chaos

Windows EC2 CPU hog

Windows EC2 memory hog

Windows EC2 Network Latency

Windows EC2 Network Loss

Windows EC2 Process Kill

Lambda Block TCP Connection