3496 Commits

Author SHA1 Message Date
Ritchie, Frank (fr801x)
4f0f5155e7 Set default python
Python needs to be set to python2 before checking for python3 to prevent
certain test framework errors.

Change-Id: Ifd1ed35160338688d3c723c055ca75cd999e46e0
2022-06-27 17:58:20 +00:00
Zuul
3e1572c959 Merge "[MariaDB] Add liveness probe to restart a pod that got stuck in a transfer wsrep_local_state_comment" 2022-06-22 17:26:49 +00:00
Zuul
5e1a6db7cb Merge "Add missing CRDs for volume snapshots (classes, contents)" 2022-06-22 02:26:32 +00:00
Markin, Sergiy (sm515x)
931ba39e87 [MariaDB] Add liveness probe to restart a pod that got stuck in a transfer wsrep_local_state_comment
Readiness probe that we currently have does not help with restarting a
pod that got stuck in a transfer state reported by
wsrep_local_state_comment.

root@mariadb-server-2:/# mysql_status_query wsrep_ready
OFF
root@mariadb-server-2:/# mysql_status_query wsrep_connected
ON
root@mariadb-server-2:/# mysql_status_query wsrep_cluster_status
non-Primary
root@mariadb-server-2:/# mysql_status_query wsrep_local_state_comment
Transfer

So the idea is to add a liveness probe that will take care of this.

Change-Id: I2ccecc75349667fe19c6f7f9dccc2dbbd17d0a5e
2022-06-21 20:32:52 +00:00
Ruslan Aliev
6addeaf050 Add missing CRDs for volume snapshots (classes, contents)
Also bump images versions.

Signed-off-by: Ruslan Aliev <raliev@mirantis.com>
Change-Id: I0d9814bd3427055c530f4b2e5de8bd17f36694dc
2022-06-17 14:31:23 -05:00
Zuul
9be2ea22a0 Merge "Add pods watch and list permissions" 2022-06-15 21:16:51 +00:00
Zuul
e92b91829c Merge "[ceph-mon] Allow for unconditional mon restart" 2022-06-15 21:13:32 +00:00
Walter Wahlstedt
9dad642c84 Add pods watch and list permissions
Change-Id: I050bc8df976032b094154a4c6612dd80eb4d54f8
2022-06-15 13:38:56 -05:00
Zuul
1fbdb5baa2 Merge "Add run migrator job prior running grafana pods" 2022-06-15 18:19:45 +00:00
Ruslan Aliev
e99dfc1c84 Add run migrator job prior running grafana pods
During the first run, grafana will run migrator job, which populates
necessary fields in database. Previously, if there are two or more
grafana pods which start up simultaneously, it causes the race condition
for database access and finally one of the pods will fail (in some
cases both of them), leaving the grafana database in incomplete state.

Signed-off-by: Ruslan Aliev <raliev@mirantis.com>
Change-Id: I5a7993b3cad2d48af3f73218d6c61c216520e1c5
2022-06-14 21:31:33 -05:00
Stephen Taylor
d2c8de85c9 [ceph-client] Handle multiple mon versions in the pool job
The mon version check in the rbd-pool job can cause the script to
error and abort if there are multiple mon versions present in the
Ceph cluster. This change chooses the lowest-numbered major version
from the available mon versions when performing the version check
since the check is performed in order to determine the right way to
parse JSON output from a mon query.

Change-Id: I51cc6d1de0034affdc0cc616298c2d2cd3476dbb
2022-06-14 13:35:36 -06:00
Stephen Taylor
13fd81b8d9 [ceph-mon] Allow for unconditional mon restart
This change allows mons to be restarted unconditionally by the
ceph-mon chart. This can be useful in upgrade scenarios where
ceph-mon pods need to be forcibly restarted for any reason.

Change-Id: I93a1426c2ca02b060f7a606495893feb2813c142
2022-06-13 14:52:54 -06:00
Schubert Anselme
fce7ca38ae
Uplift Mariadb-ingress image to v1.2.0
Change-Id: Ic368517f893c5016793ce5d65b882a43fb2381ec
2022-06-06 09:14:56 -04:00
Dustin Specker
6aa283d60d follow redirects when downloading calico manifests
When attempting to use Calico v3.23, docs.projectcalico.org is
redirected to projectcalico.docs.tigera.io.

Calico v3.20 does not exist at the new URL, so following redirects
to handle both cases.

Change-Id: I251ced47224f2c1d369cb6acf68784cf3351ade0
2022-06-03 12:04:40 -05:00
Zuul
ee331fd0d3 Merge "Remove kubeadm-aio" 2022-05-20 16:03:29 +00:00
Zuul
44eea70109 Merge "[Devtools] Add .vscode folder to .gitignore" 2022-05-20 04:49:38 +00:00
Zuul
3be1004a3a Merge "Remove gnocchi override in memcache chart" 2022-05-20 04:49:36 +00:00
Markin, Sergiy (sm515x)
8bac49aca5 [Devtools] Add .vscode folder to .gitignore
Change-Id: I177f9bf11fae86576b32249018fae1366f579ddf
2022-05-20 02:34:07 +00:00
Gage Hugo
c1f51b7149 Remove gnocchi override in memcache chart
This change updates the memcache chart to remove an override for
the gnocchi chart that has since been removed.

Change-Id: I73ce7859941fd87cbf0bc734195924e870aac81d
2022-05-19 17:37:26 +00:00
Gage Hugo
ad133069aa Remove kubeadm-aio
As part of the move to helm v3, we migrated how we setup
deploying kubernetes from a series of playbooks to using
a bash script. This had the advantage of being easier to
follow and deploy locally, as well as easier to debug
when an issue arose. While the kubeadm-aio was very useful
in the past to help lower run-times, currently it appears
that optimizations in our jobs have seem to mitigate a lot
of the issues that were lessened with the AIO.

Change-Id: If0c6a97aed4083307a9b6f5beb4ec525e8103e01
2022-05-19 16:13:12 +00:00
Markin, Sergiy (sm515x)
1f1a2ff527 [MariaDB] Fix backup/restore scripts for MariaDB 10.6
This patch adds database sys to the list of databases
to be ignored by backup/restore scripts in mariadb chart.

Change-Id: Ida7965bc583ada2c7ca4800c8ff5d6761fb3913a
2022-05-19 00:26:49 +00:00
Zuul
88f5713100 Merge "Unpin ansible version in start script" 2022-05-17 21:09:42 +00:00
Gage Hugo
b412d729fd Unpin ansible version in start script
We previously pinned the version of ansible we deployed in order
to avoid issues when upgrading which would cause gate failures.

This change removes the pinnned version since it appears to be
more stable now.

Change-Id: Iabe516273bb68444340f06ad652d007d707cf888
2022-05-16 20:48:26 +00:00
Markin, Sergiy (sm515x)
322e5b8ccb [MariaDB] Fix ingress cluster role privileges
This patchset is adding update priviledge to ingress cluster role in order to let it to update mariadb state configmap. The problem appeared after upgrading nginx controller up to v1.1.3 in https://review.opendev.org/c/openstack/openstack-helm-infra/+/840691

Change-Id: I962ac336bf6b3588db88b04e2259de1aa20b1221
2022-05-13 17:42:32 -05:00
Markin, Sergiy (sm515x)
9d9edbded5 [MariaDB] Fix privileges for mysql-exporter user used by prometheus exporter
Change-Id: I1a2ba8d2525d28d1179a64d5c815e2f32ef56744
2022-05-12 17:35:55 -05:00
Zuul
1e56dd59ea Merge "Add libvirt exporter as a sidecar in the libvirt chart" 2022-05-12 20:49:25 +00:00
Zuul
40847ca4a6 Merge "Migrate CronJob resources to batch/v1 and PodDisruptionBudget resources to policy/v1" 2022-05-12 20:43:31 +00:00
Zuul
5706d6e960 Merge "fix helm-release-status role to use helm3 commands" 2022-05-12 04:08:01 +00:00
Zuul
eb0a997388 Merge "Update ubuntu nodesets to focal" 2022-05-11 17:42:29 +00:00
Zuul
8d27996cc9 Merge "Add clear-firewall role to bandit playbook" 2022-05-11 17:42:26 +00:00
Zuul
f4405d4f1b Merge "[ceph-osd] Remove ceph-mon dependency in ceph-osd liveness probe" 2022-05-11 16:23:15 +00:00
Schubert Anselme
753a32c33d
Migrate CronJob resources to batch/v1 and PodDisruptionBudget resources to policy/v1
This change updates the following charts to migrate CronJob resources to the batch/v1 API version, available since v1.21. [0]
and to migrate PodDisruptionBudget to the policy/v1 API version, also available since v1.21. [1]

This also uplift ingress controller to 1.1.3

- ceph-client (CronJob)
- cert-rotation (CronJob)
- elasticsearch (CronJob)
- mariadb (CronJob & PodDisruptionBudget)
- postgresql (CronJob)

0: https://kubernetes.io/docs/reference/using-api/deprecation-guide/#cronjob-v125
1: https://kubernetes.io/docs/reference/using-api/deprecation-guide/#poddisruptionbudget-v125

Change-Id: Ia6189b98a86b3f7575dc4678bb3a0cce69562c93
2022-05-10 15:12:53 -04:00
Zuul
1eeb05397e Merge "Remove gnocchi chart" 2022-05-10 18:20:25 +00:00
root
58becf2886 Add libvirt exporter as a sidecar in the libvirt chart
Users can enable the libvirt exporter sidecar by set
values.sidecars.libvirt_exporter as true.

Change-Id: I7451aa278982bced3178ac3e001eaad3b63d005d
2022-05-11 02:51:45 +10:00
Dustin Specker
367bed952f fix helm-release-status role to use helm3 commands
Helm3 requires specifying a namespace to retrieve status and values for
a given release.

Before this role would not retrieve any values or statuses. Now values
and statuses are retrieved in saved in Zuul archive correctly.

Change-Id: If8cf28349b0c14ba6e1de74d49e46ab4a61b93c7
2022-05-10 11:17:53 -05:00
Dustin Specker
980d92e6b4 pin containerd to 1.5.11-1
containerd 1.6.4-1 changes something about cgroups.procs file
(need to investigate) and this causes libvirt's
readiness probes to fail with an error like:

Readiness probe failed: OCI runtime exec failed: exec failed: unable to
start container process: error adding pid 7366 to cgroups: failed to
write 7366: open
/sys/fs/cgroup/unified/kubepods.slice/kubepods-besteffort.slice/kubepods-besteffort-podc2658f11_e579_435c_aab8_d7359f88968f.slice/docker-8cb51ac90cc54e6333028634c4a16592c093ceb0944964e344dcfce9e93f3ed3.scope/cgroup.procs:
no such file or directory: unknown

Change-Id: Icd860b4360d8547480780b271954a20e79d5a4cd
2022-05-06 16:14:42 -05:00
Stephen Taylor
9a37183b26 [ceph-osd] Remove ceph-mon dependency in ceph-osd liveness probe
It is possible for misbehaving ceph-mon pods to cause the ceph-osd
liveness probe to fail for healthy ceph-osd pods, which can cause
healthy pods to get restarted unnecessarily. This change removes
the ceph-mon query from the ceph-osd liveness probe so the probe
is only dependent on ceph-osd state.

Change-Id: I9e1846cfdc5783dbb261583e04ea19df81d143f4
2022-05-06 10:15:45 -06:00
Zuul
8e27278d68 Merge "[ceph-osd] Remove udev interactions from osd-init" 2022-05-03 23:01:52 +00:00
Gage Hugo
6be6d638b3 Update ubuntu nodesets to focal
The openstack-single-node nodeset still is using ubuntu-bionic,
which is nearly 4 years old now. This change updates it to use
the newer ubuntu focal release.

Depends-on: https://review.opendev.org/c/openstack/openstack-helm-infra/+/839996
Depends-on: https://review.opendev.org/c/openstack/openstack-helm-infra/+/840370

Change-Id: Ia43cb31e13bc059541116064aa2092526186b831
2022-05-03 19:57:15 +00:00
Gage Hugo
690258b221 Add clear-firewall role to bandit playbook
This change adds the clear-firewall role to the osh-infra-bandit
playbook to resolve an issue with coredns not coming up when
this job is ran on an ubuntu focal node.

Change-Id: I189ceff30271f3a478aff697b84709b19d0b09fc
2022-05-03 14:55:46 -05:00
Gage Hugo
4dafe7e254 Remove gnocchi chart
As part of the discussions from the last several PTGs, the topic
of removing un-maintained charts was brought up as an action item.

Since gnocchi hasn't seen much maintenance outside of general
infrastructure maintenance and no one has stepped up with any
interest in maintaining it, this change removes the gnocchi
chart from openstack-helm-infra.

Change-Id: I9c68457df1243370ef606924b4f776870baedd07
2022-05-03 14:51:59 -05:00
Vladimir Sigunov (vs422h)
89d290d3da [Database] Remote backup should keep given number of backup days
Fixes minor issue with naming of variables which prevents the script to
be compliant the backup retention policy.

Change-Id: Ic241310a66af92ee423f5c762c413af7d6d53f0b
2022-05-03 13:51:41 -04:00
Zuul
e568acf53d Merge "Update default image value in shaker" 2022-05-03 15:26:28 +00:00
Zuul
a4b3a63253 Merge "Update default image values for mariadb" 2022-05-03 15:26:01 +00:00
Zuul
437d79878f Merge "Update default image value for prometheus" 2022-05-03 15:23:32 +00:00
Stephen Taylor
e02dc3da44 [ceph-osd] Remove udev interactions from osd-init
There are bugs with containerizing certain udev operations in some
udev versions. The osd-init container can hang in these
circumstances, so the osd-init scripts are modified not to use
these problematic operations.

Change-Id: I6b39321b849f5fbf1b6f2097c6c57ffaebe68121
2022-04-29 14:44:32 -06:00
Gage Hugo
d99955ccff Update default image value in shaker
This change updates the default image value in the shaker chart
from newton to wallaby.

Change-Id: Icf638a0d896fc77aaf43f4d82ed2bd82aef13328
2022-04-28 17:23:26 +00:00
Gage Hugo
78c3c80c25 Update image default value for fluentd chart
This change updates the default image value where openstack is
used to Wallaby from newton in the fluentd chart.

Change-Id: I794062bee9e5d316e1c754544c0970da93e46dbf
2022-04-28 17:23:10 +00:00
Gage Hugo
711d5706dd Update default image value for prometheus
This change updates the default image value in the prometheus
chart from newton to wallaby for the helm_test image.

Change-Id: I0f70734a8455661f7705baeed3cafbaf529c56a8
2022-04-28 17:23:04 +00:00
Gage Hugo
20d7d55f33 Update default image values for mariadb
This change updates the default image values in the mariadb chart
up to using Wallaby for the ones that use openstack images.

Change-Id: Id28da22932362c0400766a564b382ddbcada8c61
2022-04-28 17:22:09 +00:00