Currently resource usage of subprocesses is added
in to their parent's usage. And this will provide
option to override.
Change-Id: I22c36e5a6f354f6318e72798ce9865011a85b2af
An entire rack's OSDs are not being marked out after
down_out interval. This manifested itself during
resiliency testing when all interfaces were brought
down on a control plan host and the down_interval
was surpassed.
Change-Id: I6f4a69ec442c3e768feb7bd74c7d610aa9d4aa67
This repo is now testing docs only with Python 3, so let's make
a few cleanups:
- Remove obsolete sections from setup.cfg
- Switch to using sphinx-build
- Cleanup doc/source/conf.py to remove now obsolete content.
- Use newer openstackdocstheme version
- Remove install_command from tox.ini, the default is fine
Change-Id: Iffd00261e6a15e2dd7c98e96bbcc9db3e86c0c65
In the scenario where grastate values cannot be found, we will set the
configmap to 'None' and log a warning.. This should also prevent a possible
type incompatibility issue in error scenario.
Change-Id: I0fb08b329a3fb05c65bead5781c84a592ae4c263
Signed-off-by: Tin Lam <tin@irrational.io>
subprocess.Popen() returns byte object by defect which has issue with
operations that treats thing as str. This ensure Popen() encodes the
return as utf-8 before we do anything.
Change-Id: I321771f69cfcb492be1308c61313a0598b1e766a
Signed-off-by: Tin Lam <tin@irrational.io>
This patch set handles an unexpected IndexError stacktrace when the
galera cluster's data file does not return with an expected key with a
colon (:) in the string.
Change-Id: I4f58e97753a0f68468a02b98676e031176145e44
Signed-off-by: Tin Lam <tin@irrational.io>
This PS updates the bind mounts for ceph logs directorys to be
emptydirs. This ensures we do not polute the hosts permanantly
with ceph logs, which should be directed to stdout.
Change-Id: I6d72c0864b9ecc493cd62564e0e0450d90cfcf00
Signed-off-by: Pete Birley <pete@port.direct>
This change refactors the apparmor job to utilize the feature
gates system instead of relying on separate scripts.
Change-Id: I51b36c1972ff3ee8d4366bf2d5027e433721d740
This is to fix the issue with osd-id file creation since current
logic failing to create /tmp/osd-id file when we restart the pod/container.
Change-Id: Ie9f810fa3a705eccaf625a3cbd91d3d8ee4e05d1
Since apparmor configs are moved to value overrides, removing this.
Change-Id: Ia23c34c2ed76fceb78f68e609066139b69e09e61
Signed-off-by: diwakar thyagaraj <diwakar.chitoor.thyagaraj@att.com>
There are scenarios where the wsrep_rec_pos variable is being returned
without it being first initialized when the .communicate() method
returns a blank. This patchset sets up a default initialization, so
the readiness check does not error out with an exception.
Change-Id: Ifea922f446bf3cbc9220f39a41dffc2763e6a5f3
Signed-off-by: Tin Lam <tin@irrational.io>
This change updates the fluentd chart to use the native
fluent-plugin-prometheus for metric production. This plugin
provides more detailed metrics about fluentd's operations,
specifically regarding input and output statistics.
https://github.com/fluent/fluent-plugin-prometheus
Using the plugin, each fluentd pod produces metrics, so
scape annotations have been added to the pod spec. The
zuul check on metric producers has been updated to account
for this.
Depends-On: https://review.opendev.org/714167
Change-Id: I809356d92b0cff1e31cb2062102bbedefd4843fd
This is to redirect all the logs from daemons to stdout to avoid
accumulating large sized log files on filesystem.
NOTE: The ceph-osd daemon won't work this way and is addressed
separately in https://review.opendev.org/715295. All other Ceph
daemons are included here.
Change-Id: I3045d6e941791aba14979472fac1bca09776d3bf
This is to make enabling autosclaer feature optional from values since
its new feature and few deployments may not required it.
Change-Id: Ie7cbdb71f6a1bf636db2d61ae0f3a4c19af2ca34
This change adds in a log tailer script in a pod that tails the ceph
logs, periodicly truncating them.
Change-Id: Idbe03d4123f86b1b88e277fea6d13f58104f94b0
Signed-off-by: Pete Birley <pete@port.direct>
This Patch Set enables whitelisting of proper field names
of different types(i.e string, integer)in configMap volume.
It makes the pauseImage for daemonjob configurable.
Change-Id: Ia2062c5bc9ba1d8783e9573d7f9ea315c34f7fe7
This is to workaround a pg merging bug in ceph. The number of PGs
is set to a small value. The reason this works is because the
pg autoscaler is set to work unconditionally. This will need
to change once the autoscaler is optional.
Change-Id: I6ec404f281e201023fa974601d7083f4ce3cfd30
This change updates how the gather-prom-metrics playbook role
chooses which services and ports to scrape when gathering metrics
at the end of a zuul build.
We can hit more metric endpoints by finding services with a "metrics"
port, as not all applications have a service labeled "component=exporter"
Change-Id: Ib8db7ea2e7034063eefadad74828d0396407275b
The current podsecuritypolicy job has not been passing for quite
a while. Disable it for now until it's fixed to avoid wasting
infra resources.
Change-Id: I14b184cf03e625cbbaa829a4de101dc2142a7e27