GCP VM disk loss by label disrupts the state of GCP persistent disk volume filtered using a label by detaching it from its VM instance for a specific duration.
- GCP VM disk loss by label fault can be used to determine the resilience of the GKE infrastructure. It helps determine how quickly a node can recover when a persistent disk volume is detached from the VM instance associated with it.
- Kubernetes > 1.16 is required to execute this fault.
- Service account should have editor access (or owner access) to the GCP project.
- Target disk volume should not be a boot disk of any VM instance.
- Disk volumes with the target label should be attached to their respective instances.
- Kubernetes secret should have the GCP service account credentials in the default namespace. Refer generate the necessary credentials in order to authenticate your identity with the Google Cloud Platform (GCP) docs for more information.
|GCP_PROJECT_ID||Id of the GCP project containing the disk volumes.||All the target disk volumes should belong to a single GCP project. For more information, go to GCP project ID.|
|DISK_VOLUME_LABEL||Label of the target non-boot persistent disk volume.|| This value is provided as a |
|ZONES||The zone of the target disk volumes.||Only one zone is provided, which indicates that all target disks reside in the same zone. For more information, go to zones.|
|TOTAL_CHAOS_DURATION||Duration that you specify, through which chaos is injected into the target resource (in seconds).||Defaults to 30s. For more information, go to duration of the chaos.|
|CHAOS_INTERVAL||Time interval between two successive chaos iterations (in seconds).||Defaults to 30s. For more information, go to chaos interval.|
|DISK_AFFECTED_PERC||Percentage of total disks that are filtered using the target label (specify numeric values only).||Defaults to 0 (that corresponds to 1 disk). For more information, go to disk affected percentage.|
|SEQUENCE||Sequence of chaos execution for multiple target disks.||Defaults to parallel. It supports serial sequence as well. For more information, go to sequence of chaos execution.|
|RAMP_TIME||Period to wait before and after injecting chaos (in seconds).||For example, 30s. For more information, go to ramp time.|
GCP project ID
It specifies the project ID which is a unique identifier for a GCP project. Tune it by using the
GCP_PROJECT_ID environment variable.
It specifies the zone of the disk volumes subject to the fault. Tune it by using the
ZONES environment variable.
Detach volumes by label
It specifies the label of disk volumes that are subject to disk loss. It detaches all the disks with the
DISK_VOLUME_LABEL label in the
ZONES zone within the
GCP_PROJECT_ID project. It re-attaches the disk volume after waiting for the duration specified by
TOTAL_CHAOS_DURATION environment variable.
DISK_VOLUME_LABEL accepts only one label and
ZONES accepts only one zone name. Therefore, all the disks must reside in the same zone.
Use the following example to tune it:
- name: gcp-vm-disk-loss-by-label
- name: DISK_VOLUME_LABEL
- name: ZONES
- name: GCP_PROJECT_ID
- name: TOTAL_CHAOS_DURATION