4030 Commits

Author SHA1 Message Date
Vladimir Kozhukalov
7ef00681be Add release note template
Change-Id: Ied6af6bf7521a92c70170a62d6ad8b29c731eac0
2025-01-15 03:09:52 -06:00
Vladimir Kozhukalov
628a320c60 Update values_overrides to use images from buildset registry
Recently we moved all overrides to a separate directory and
if we want to test images published to buildset registry
we have to update those overrides before deployment.

Change-Id: I9a515b5ba98be7ee0225fc1c95a35828055383f6
2025-01-13 15:09:24 -06:00
Vasyl Saienko
197c5bed6c [helm-toolkit] Allow to pass raw network policy
Allow to pass raw network policy via values, labels
without spec are ingnored in this case.

values: |
  network_policy:
    myLabel:
      spec:
        <RAW SPEC>
Change-Id: I87fce44f143fbdf9771ad043133dee22daced3f3
2025-01-13 13:28:00 +00:00
Zuul
b2e6f27af4 Merge "[memcached] Allign with security best practices" 2025-01-07 11:47:39 +00:00
Zuul
7544151284 Merge "[memcached] Unhardcode port in exporter" 2025-01-07 00:05:07 +00:00
Zuul
58c4b6d0f5 Merge "[memcached] Enasure liveness probe is enabled" 2025-01-06 23:05:27 +00:00
Zuul
c84f4dbbbd Merge "Delete setup.py to avoid validate_build_sdist" 2025-01-06 21:20:51 +00:00
Zuul
d5fe24c0dd Merge "[memcached] Drop max_surge option" 2025-01-06 21:20:01 +00:00
Zuul
8bf0443c26 Merge "Ensure memcached pods antiaffinity" 2025-01-06 19:34:18 +00:00
Vladimir Kozhukalov
41a2250c07 Delete setup.py to avoid validate_build_sdist
To create git tags we have to submit PRs to
the openstack/releases which checks if a project
contains setup.py file. If it does then the validation
test tries to build sdist package. For openstack-helm
this is not needed.

Change-Id: I3030dcf21d58d54d37b03e2db20004d086dbfaa9
2025-01-06 13:25:54 -06:00
Vasyl Saienko
725dc5518d [memcached] Allign with security best practices
* Add runAsNonRoot directive
* Drop all capabilities
* Mount bianries with 550 and 65534 fsgroup

Change-Id: I0636088b40ce8ebaef84dad017ddbcaaecfc8221
2025-01-06 13:07:53 +00:00
Vasyl Saienko
11915a30a7 [memcached] Unhardcode port in exporter
* Pick up port for exporter from endpoints
* Drop exporter port from service as we should not use
  service that do loadbalancing among pods which are independent

Change-Id: I0408039ba87aca5b8b3c9333644fa0c92f0ca01a
2025-01-01 11:06:45 +00:00
Sergiy Markin
282b3b98df [ceph-osd] Remove wait_for_degraded_objects
This PS removes the wait_for_degraded_objects
function from ceph-osd helm-test script because
not all pgs may be in good condition even if all
osds are up and running. The pgs will get healthy
after complete osd charts set upgrade is complete.

Change-Id: Ia8da3d96e01b765c5cb691dd0af15f36a7292e89
2024-12-31 16:14:58 +00:00
Zuul
367621ad05 Merge "Append metadata suffix when building charts" 2024-12-24 21:52:27 +00:00
Vasyl Saienko
8c6fb7afec [memcached] Enasure liveness probe is enabled
Change-Id: I4980d2e9ec4fbfc8e57bd643b703d37c12b32dfa
2024-12-24 19:12:58 +02:00
Vasyl Saienko
3f230251b4 [memcached] Drop max_surge option
We do not use service proxy to comminicate to memcached.
All services has exact number of endpoints to communicate.
Having max_surge is useless as clients will never use it.

Change-Id: I74a665c96cfc99cbb8d31c4a17700c467c746c9e
2024-12-24 19:12:18 +02:00
Vasyl Saienko
5eb63ac2c1 Ensure memcached pods antiaffinity
Use required* antiaffinity to make sure we do not have
two pods sitting on same node as it does not make any
sense.

Change-Id: I6c0c55733b75eb1bd53eee855907533d672dbf22
2024-12-24 19:11:10 +02:00
Vladimir Kozhukalov
3a4fb2185d Append metadata suffix when building charts
Change-Id: Ic9af11193f097c3bad99b63c63abc5e8dd93de53
2024-12-21 11:24:15 -06:00
Vladimir Kozhukalov
3edbef16f3 [deploy-env] Fix fetching images
Even with the docker proxy cache we often get
jobs failed due to Docker Hub rate limits.
As per recommendation from the Opendev Infra team
let's pull as many as possible images from other
registires.

This PR updates the dnsmasq and nginx images used
for auxiliary purposes during deployments.

Change-Id: I58946e6fc63d726e08d83ea7f96e7fef140ddf21
2024-12-20 14:44:27 -06:00
Vladimir Kozhukalov
672e488519 Update versions of all charts to 2024.2.0
As per agreement with
https://docs.openstack.org/openstack-helm/latest/specs/2025.1/chart_versioning.html

Change-Id: Ia064d83881626452dc3c0cf888128e152692ae77
2024.2.0
2024-12-18 11:05:10 -06:00
Vladimir Kozhukalov
693d3a2686 Update Chart.yaml apiVersion to v2
Change-Id: I66dcaedefd0640f8a7b5343363354ba539d70627
2024-12-16 16:48:03 -06:00
Vladimir Kozhukalov
b600c06628 Enable temporarily disabled jobs
Here I7bfdef3ea2128bbb4e26e3a00161fe30ce29b8e7
we disabled some jobs that involve scripts from
OSH git repo because these scripts had to be
aligned with the new values_overrides location and
directory structure.

Change-Id: I7d0509051c8cd563a3269e21fe09eb56dcdb8f37
2024-12-16 10:29:56 -06:00
Vladimir Kozhukalov
8b29037cec Move values overrides to a separate directory
This is the action item to implement the spec:
doc/source/specs/2025.1/chart_versioning.rst

Also add overrides env variables

- OSH_VALUES_OVERRIDES_PATH
- OSH_INFRA_VALUES_OVERRIDES_PATH

This commit temporarily disables all jobs that involve scripts
in the OSH git repo because they need to be updated to work
with the new values_overrides structure in the OSH-infra repo.
Once this is merged I4974785c904cf7c8730279854e3ad9b6b7c35498
all these disabled test jobs must be enabled.

Depends-On: I327103c18fc0e10e989a17f69b3bff9995c45eb4
Change-Id: I7bfdef3ea2128bbb4e26e3a00161fe30ce29b8e7
2024-12-13 12:04:44 -06:00
Sergiy Markin
2b500465da [ceph] Fix for ceph-osd pods restart
This PS updates ceph-osd pod containers making
sure that osd pods are not stuck at deletion. In
this PS we are taking care of another background
process that has to be terminated by preStop
script.

Change-Id: Icebb6119225b4b88fb213932cc3bcf78d650848f
2024-12-12 01:01:28 +00:00
Sergiy Markin
c98ea9ca61 [ceph] Fix for ceph-osd pods restart
This PS updates ceph-osd pod containers making sure
that osd pods are not stuck at deletion.

It adds missed lifecycle preStop action for log0runner container.

Change-Id: I8d6853a457d3142c33ca6b5449351d9b05ffacda
2024-12-11 19:35:20 +00:00
Sergiy Markin
7811e90f4e [ceph] Fix for ceph-osd pods restart
This PS updates ceph-osd pod containers making sure
that osd pods are not stuck at deletion. Also
added similar approach to add lifecycle ondelete
hook to kill log-runner container process before pod restart.

And added wait_for_degraded_object function to
helm-test pod making sure that newly deployed pod
are joined the ceph cluster and it is safe to go
on with next ceph-osd chart releade upgrade.

Change-Id: Ib31a5e1a82526906bff8c64ce1b199e3495b44b2
2024-12-10 16:36:24 +00:00
Zuul
cd1ee943f9 Merge "Remove tini from ceph-osd chart" 2024-12-10 01:22:01 +00:00
SPEARS, DUSTIN (ds443n)
9c55663c87 Remove tini from ceph-osd chart
Removing tini from ceph daemon as this didn't resolve
an issue with log runner process as will be resolved in
another change in post-apply job.

Change-Id: I4ebb1d12e736d387e6e34354619a532dd50dfeae
2024-12-09 14:42:56 -05:00
Vladimir Kozhukalov
7fddc1a8d3 Bump K8s to v1.31
Change-Id: I384b10ef7b2da42d2227b4134e4ece4c5f9aa6d1
2024-12-02 22:00:51 -06:00
Zuul
daff3cbdea Merge "Remove 2023.1 build jobs" 2024-11-26 23:52:05 +00:00
Zuul
a2910463ac Merge "[mariadb] Add probes for exporter" 2024-11-22 10:20:46 +00:00
Zuul
36efabdba8 Merge "Allow to use default storage class" 2024-11-22 00:53:28 +00:00
Zuul
71bdab2ceb Merge "[mariadb] Add terminationGracePeriodSeconds" 2024-11-21 16:52:35 +00:00
Zuul
d3dd74e9a0 Merge "[mariadb] Use service IP to discover endpoints" 2024-11-21 00:18:46 +00:00
Zuul
b490e77dc2 Merge "[mariadb] Implement mariadb upgrade on start" 2024-11-20 14:52:49 +00:00
Zuul
67ce775ca1 Merge "[mariadb] Avoid using deprecated isAlive" 2024-11-20 04:44:35 +00:00
Vasyl Saienko
10e8b39a91 [mariadb] Add probes for exporter
Implement readiness/liveness probes for exporter

Change-Id: I7e73872dd35b8e6adf67d585e7d4d9250eca70c3
2024-11-19 14:28:01 +00:00
Vasyl Saienko
cae49d7455 Allow to use default storage class
When name of storage class is specified as default, do not add
storageClassName option to let kubernetes pick a default

Change-Id: I25c60e49ba770ce10ea2ec68c3555ffea49848fe
2024-11-19 07:49:37 +00:00
Vasyl Saienko
bb236e0a98 [mariadb] Add terminationGracePeriodSeconds
Allow to set terminationGracePeriodSeconds for server instace to let
more time to shutdown all clients gracefully.
Increase timeout to 600 seconds by default.

Change-Id: I1f4ba7d5ca50d1282cedfacffbe818af7ccc60f2
2024-11-19 07:49:33 +00:00
Vasyl Saienko
c2269d70a2 [mariadb] Use service IP to discover endpoints
It was observed that under certain circumstances
galera instances can use old IP address of the node
after pod restart. This patch changes the value of
wsrep_cluster_address variable - instead of listing
all dns names of the cluster nodes the discovery service
IP address is used. In this case cluster_node_address is set to IP
address instead of DNS name - otherwise SST method will fail.

Co-Authored-By: Oleksii Grudev <ogrudev@mirantis.com>

Change-Id: I8059f28943150785abd48316514c0ffde56dfde5
2024-11-19 07:49:29 +00:00
Vasyl Saienko
65ec71b939 [mariadb] Implement mariadb upgrade on start
Call mysql_upgrade during start to check and upgrade if needed

Change-Id: I9c4ac1a5ea5f492282bb6bb1ee9923b036faa998
2024-11-19 06:17:24 +00:00
Vasyl Saienko
f0ad9daa5a [mariadb] Avoid using deprecated isAlive
The method was deprecated and later dropped, switch to is_alive()

Co-Authored-By: dbiletskiy <dbiletskiy@mirantis.com>

Change-Id: Ie259d0e59c68c9884e85025b1e44bcd347f45eff
2024-11-18 07:41:35 +00:00
Vladimir Kozhukalov
5d943fd72d Remove 2023.1 build jobs
The 2023.1 release is unmaintained since 2024-10-30.
See https://releases.openstack.org/

Change-Id: I8375b16338b172a5875b7a379df085020490305c
2024-11-15 14:40:51 -06:00
Zuul
c77bdd82d4 Merge "Update ceph-osd to be able to use tini" 2024-11-15 18:15:55 +00:00
Zuul
dcddce5bb7 Merge "ovn: fix resources" 2024-11-15 17:59:35 +00:00
Vasyl Saienko
174f6f5bd5 [mariadb] Refactor liveness/readiness probes
* Move all probes into single script to reduce code duplication
* Check free disk percent, fail when we consume 99% to avoid
  data corruption
* Do not restart container when SST is in progress

Change-Id: I6efc7596753dc988aa9edd7ade4d57107db98bdd
2024-11-13 18:37:52 +00:00
Vasyl Saienko
842f0f11dc [mariadb] Give more time on resolving configmap update conflicts
Make 'data too old' timeout dependent on state report interval. Increase
timeout to 5 times of report interval.

Change-Id: I0c350f9e64b65546965002d0d6a1082fd91f6f58
2024-11-13 18:37:48 +00:00
Oleksii Grudev
cb3afe6f85 Prevent TypeError in get_active_endpoint function
Sometimes "endpoints_dict" var can be evaluated to None
resulting in "TypeError: 'NoneType' object is not iterable"
error. This patch catches the exception while getting
list of endpoints and checks the value of
endpoints_dict.  Also the amount of active endpoints is being logged
for debug purposes.

Change-Id: I79f6b0b5ced8129b9a28c120b61e3ee050af4336
2024-11-13 18:37:44 +00:00
Vasyl Saienko
13a683b9c2 [mariadb] Remove useless retries on conflics during cm update
The retries were originally added at [0] but they were never working.
We pass fixed revision that we would like to see during patch to avoid
race condition, into the safe_update_configmap. We can't organize retries
inside function as it will require change of the original revision which
may happen only at upper layer. Revert patch partially.

[0] https://review.opendev.org/c/openstack/openstack-helm-infra/+/788886

Change-Id: I81850d5e534a3cfb3c4993275757c244caec8be9
2024-11-13 18:37:39 +00:00
Vasyl Saienko
ef707fa3f3 [mariadb] Stop running threads on sigkill
Stop monitor cluster and leader election threads on sigkill.
This allows to terminate all threads from start.py and actually
exit earlier than terminationGracePeriod in statefulset.
Drop preStop hook which is redundant with stop_mysqld() function call.

Change-Id: Ibc4b7604f00b1c5b3a398370dafed4d19929fd7d
2024-11-13 18:36:49 +00:00