This change removes "ceph-volume inventory" and "ceph-volume lvm
list" commands from the ceph-volume OSD initialization script and
Bluestore start script and replaces them with "pvdisplay" and
"lvs" to retrieve lvm tags directly from lvm volumes instead.
Ceph-volume makes repeated calls to blkid, which is very slow in
some cases and deadlocks in others when there are RBDs mapped on
the host.
Change-Id: Ia999770d4a59729e38dbb494b34c30e5a1b36a8b
The PS moves the relese of lock out from the if-else statement.
Otherwise we are not releasing the lock in two of three cases.
And the lock will be released only if we are going though
the last "else" statement. The PS supposed to fix that.
Also it increases the timeout value.
Change-Id: I284cff06a34ddb864484d2a63780b4fdb9075a0a
This patch fixes 2 problems with MariaDB backup:
1) If a user with grants to a database has a hyphenated name, the backup
script errors out and the grants for this user won't be saved in the backup.
2) While restoring databases from a backup, if connections are allowed
during the restore operation, there is potential for deadlock. Table
level locks are added to the backup sql file in order to try to prevent
these deadlock situations.
Change-Id: If612e7b9f3f4d75fc67018eea17609f07a0c0b0f
1) Added a new backup container for accessing RGW via Openstack Swift API.
2) Modified the backup script so that tarballed databases can be sent to the RGW.
3) Added new script to send the database backup to the RGW.
4) Modified the restore script so that databases can be retrieved from the RGW.
5) Added new script to retrieve the database backups from the RGW.
Change-Id: Id17a8fcb63f5614ea038c58acdc256fb4e05f434
Cephfs tests were disabled in order to merge
https://review.opendev.org/695568 due to gate failures that were
blocking it. CephFS isn't used in openstack-helm-infra, so it
wasn't required for that work. This change re-enables the cephfs
tests so we can work through any issues that are causing further
failures.
Since the the issue got fixed in 14.2.8 , upgrading all daemons to 14.2.8.
(https://tracker.ceph.com/issues/43770)
Change-Id: I376d39b7ee00ccb1ab8046b58f92b19a822272e1
There was a change[0] in zuul that made the docs job fail
if the build/html dir was empty. Our docs job was putting the
built docs in the source file by mistake, this ps fixes
this issue.
[0] https://review.opendev.org/#/c/721221/
Change-Id: I9ef336cf32dad96ff4a4fcc2743363a8f40f4718
Currently resource usage of subprocesses is added
in to their parent's usage. And this will provide
option to override.
Change-Id: I22c36e5a6f354f6318e72798ce9865011a85b2af
An entire rack's OSDs are not being marked out after
down_out interval. This manifested itself during
resiliency testing when all interfaces were brought
down on a control plan host and the down_interval
was surpassed.
Change-Id: I6f4a69ec442c3e768feb7bd74c7d610aa9d4aa67
This repo is now testing docs only with Python 3, so let's make
a few cleanups:
- Remove obsolete sections from setup.cfg
- Switch to using sphinx-build
- Cleanup doc/source/conf.py to remove now obsolete content.
- Use newer openstackdocstheme version
- Remove install_command from tox.ini, the default is fine
Change-Id: Iffd00261e6a15e2dd7c98e96bbcc9db3e86c0c65
In the scenario where grastate values cannot be found, we will set the
configmap to 'None' and log a warning.. This should also prevent a possible
type incompatibility issue in error scenario.
Change-Id: I0fb08b329a3fb05c65bead5781c84a592ae4c263
Signed-off-by: Tin Lam <tin@irrational.io>
subprocess.Popen() returns byte object by defect which has issue with
operations that treats thing as str. This ensure Popen() encodes the
return as utf-8 before we do anything.
Change-Id: I321771f69cfcb492be1308c61313a0598b1e766a
Signed-off-by: Tin Lam <tin@irrational.io>
This patch set handles an unexpected IndexError stacktrace when the
galera cluster's data file does not return with an expected key with a
colon (:) in the string.
Change-Id: I4f58e97753a0f68468a02b98676e031176145e44
Signed-off-by: Tin Lam <tin@irrational.io>
This PS updates the bind mounts for ceph logs directorys to be
emptydirs. This ensures we do not polute the hosts permanantly
with ceph logs, which should be directed to stdout.
Change-Id: I6d72c0864b9ecc493cd62564e0e0450d90cfcf00
Signed-off-by: Pete Birley <pete@port.direct>
This change refactors the apparmor job to utilize the feature
gates system instead of relying on separate scripts.
Change-Id: I51b36c1972ff3ee8d4366bf2d5027e433721d740
This is to fix the issue with osd-id file creation since current
logic failing to create /tmp/osd-id file when we restart the pod/container.
Change-Id: Ie9f810fa3a705eccaf625a3cbd91d3d8ee4e05d1
Since apparmor configs are moved to value overrides, removing this.
Change-Id: Ia23c34c2ed76fceb78f68e609066139b69e09e61
Signed-off-by: diwakar thyagaraj <diwakar.chitoor.thyagaraj@att.com>
There are scenarios where the wsrep_rec_pos variable is being returned
without it being first initialized when the .communicate() method
returns a blank. This patchset sets up a default initialization, so
the readiness check does not error out with an exception.
Change-Id: Ifea922f446bf3cbc9220f39a41dffc2763e6a5f3
Signed-off-by: Tin Lam <tin@irrational.io>
This change updates the fluentd chart to use the native
fluent-plugin-prometheus for metric production. This plugin
provides more detailed metrics about fluentd's operations,
specifically regarding input and output statistics.
https://github.com/fluent/fluent-plugin-prometheus
Using the plugin, each fluentd pod produces metrics, so
scape annotations have been added to the pod spec. The
zuul check on metric producers has been updated to account
for this.
Depends-On: https://review.opendev.org/714167
Change-Id: I809356d92b0cff1e31cb2062102bbedefd4843fd
This is to redirect all the logs from daemons to stdout to avoid
accumulating large sized log files on filesystem.
NOTE: The ceph-osd daemon won't work this way and is addressed
separately in https://review.opendev.org/715295. All other Ceph
daemons are included here.
Change-Id: I3045d6e941791aba14979472fac1bca09776d3bf
This is to make enabling autosclaer feature optional from values since
its new feature and few deployments may not required it.
Change-Id: Ie7cbdb71f6a1bf636db2d61ae0f3a4c19af2ca34
This change adds in a log tailer script in a pod that tails the ceph
logs, periodicly truncating them.
Change-Id: Idbe03d4123f86b1b88e277fea6d13f58104f94b0
Signed-off-by: Pete Birley <pete@port.direct>
This is to workaround a pg merging bug in ceph. The number of PGs
is set to a small value. The reason this works is because the
pg autoscaler is set to work unconditionally. This will need
to change once the autoscaler is optional.
Change-Id: I6ec404f281e201023fa974601d7083f4ce3cfd30