openstack-helm-infra/doc/source/testing/ceph-resiliency/README.rst
Al Lau d6cfd78c4d A script to check the failure domains of OSDs in PGs
The checkPGs script is implemented to check the Object Storage
Daemons (OSDs) in Placement Groups (PGs) of ceph pools to make
sure OSDs were not allocated from the same failure domain.  This
script is intended to run from any one of the ceph-mon pods.

Invoke the checkPGs script with --help to get the details on how
to run it.

A Kubernetes cron job is created to schedule the execution of
this script at a regular interval.  The execution frequency is
defined in the ceph-mon/values.yaml file.

Change-Id: I5d46bc824e88545cde1cc448ae714d7d3c243817
2018-09-06 06:06:28 -07:00

22 lines
611 B
ReStructuredText

==============================================
Resiliency Tests for OpenStack-Helm-Infra/Ceph
==============================================
Mission
=======
The goal of our resiliency tests for `OpenStack-Helm-Infra/Ceph
<https://github.com/openstack/openstack-helm-infra/tree/master/ceph>`_ is to
show symptoms of software/hardware failure and provide the solutions.
Caveats:
- Our focus lies on resiliency for various failure scenarios but
not on performance or stress testing.
Software Failure
================
* `CRUSH Failure Domain <./failure-domain.html>`_
Hardware Failure
================