d6cfd78c4d
The checkPGs script is implemented to check the Object Storage Daemons (OSDs) in Placement Groups (PGs) of ceph pools to make sure OSDs were not allocated from the same failure domain. This script is intended to run from any one of the ceph-mon pods. Invoke the checkPGs script with --help to get the details on how to run it. A Kubernetes cron job is created to schedule the execution of this script at a regular interval. The execution frequency is defined in the ceph-mon/values.yaml file. Change-Id: I5d46bc824e88545cde1cc448ae714d7d3c243817
22 lines
611 B
ReStructuredText
22 lines
611 B
ReStructuredText
==============================================
|
|
Resiliency Tests for OpenStack-Helm-Infra/Ceph
|
|
==============================================
|
|
|
|
Mission
|
|
=======
|
|
|
|
The goal of our resiliency tests for `OpenStack-Helm-Infra/Ceph
|
|
<https://github.com/openstack/openstack-helm-infra/tree/master/ceph>`_ is to
|
|
show symptoms of software/hardware failure and provide the solutions.
|
|
|
|
Caveats:
|
|
- Our focus lies on resiliency for various failure scenarios but
|
|
not on performance or stress testing.
|
|
|
|
Software Failure
|
|
================
|
|
* `CRUSH Failure Domain <./failure-domain.html>`_
|
|
|
|
Hardware Failure
|
|
================
|