13774 Commits

Author SHA1 Message Date
Piotr Parczewski
03a1b9925d Adjust Ceph metrics scrape interval in Prometheus
Enables modifying the interval and sets the recommended default value.

[1] https://docs.ceph.com/en/latest/mgr/prometheus/#configuration

Change-Id: I4b91d184485aa52b3c06011f9dbb6b34bcad3ca8
2024-01-17 21:40:19 +01:00
Zuul
77c18fa615 Merge "CI: Use 2023.2 as previous_release" 2024-01-15 22:00:18 +00:00
Zuul
3490b0f14e Merge "Test haproxy single external frontend" 2024-01-12 21:06:10 +00:00
Zuul
aac86a9248 Merge "CI: Rework docker config vars" 2024-01-12 14:50:39 +00:00
Zuul
3ed60961bb Merge "Fix trove failed to discover swift endpoint" 2024-01-12 11:41:01 +00:00
Zuul
1832eee361 Merge "Fix trove failed to connect rabbitmq - quorum queues support" 2024-01-11 21:48:52 +00:00
Zuul
781e3949f4 Merge "Fix trove failed to connect rabbitmq - durable queues support" 2024-01-11 14:13:45 +00:00
wu.chunyang
9eff43809f Fix trove failed to discover swift endpoint
This change fixes the trove failed to discover swift endpoint
by adding service_credentials in guest-agent.conf

Closes-Bug: #2048829

Change-Id: I185484d2a0d0a2d4016df6acf8a6b0a7f934c237
2024-01-11 10:15:12 +00:00
wu.chunyang
57b24f01f3 Fix trove failed to connect rabbitmq - quorum queues support
This change fixes the trove guest instance failed to connect to
RabbitMQ by adding quorum queues support to oslo_messaging_rabbit
section in guest-agent.conf.

Closes-Bug: #2048822
Change-Id: I94908f8e20981f20fbe4dc18e2091d3798f8b801
2024-01-11 10:14:18 +00:00
wu.chunyang
6b96d098bf Fix trove failed to connect rabbitmq - durable queues support
This change fixes the trove guest instance failed to connect to
RabbitMQ by adding durable queues support to oslo_messaging_rabbit
section in guest-agent.conf.

Partial-Bug: #2048822

Change-Id: I8efc3c92e861816385e6cda3b231a950a06bf57d
2024-01-11 10:11:29 +00:00
Zuul
357db52433 Merge "Enable the Fluentd Plugin Systemd" 2024-01-10 16:00:36 +00:00
Zuul
e30ef79d63 Merge "CI: Test Nova server resize functionality" 2024-01-09 13:53:36 +00:00
Zuul
c78cedfa75 Merge "Fix Nova scp failures on Debian Bookworm" 2024-01-09 13:53:33 +00:00
Zuul
03ec17982c Merge "Update python classifier in setup.cfg" 2024-01-09 08:55:59 +00:00
Zuul
6bbe0987dc Merge "Enable glance proxying behaviour" 2024-01-09 08:45:43 +00:00
Ghanshyam Mann
27f162cfc7 Update python classifier in setup.cfg
As per the current release tested runtime, we test
till python 3.11 so updating the same in python
classifier in setup.cfg

Change-Id: I241e77dbf6bb2085a5bf5d54f9e5b0d2af96fbf3
2024-01-08 20:21:45 -08:00
Pierre Riteau
f86ed0270f CI: Test Nova server resize functionality
This adds an extra resize operation to core OpenStack tests. This should
be fast since we are only increasing the number of cores of the VM and
could help catch additional errors in CI tests.

Change-Id: Ia61b995dbffcda4f1e6494548df457231cb67bd7
2024-01-08 22:15:04 +00:00
Pierre Riteau
bfa9dd97a9 Fix Nova scp failures on Debian Bookworm
The addition of an instance resize operation [1] to CI testing is
triggering a failure in kolla-ansible-debian-ovn jobs, which are using a
nodeset with multiple nodes:

    oslo_concurrency.processutils.ProcessExecutionError: Unexpected error while running command.
    Command: scp -r /var/lib/nova/instances/8ca2c7e8-acae-404c-af7d-6cac38e354b8_resize/disk 192.0.2.2:/var/lib/nova/instances/8ca2c7e8-acae-404c-af7d-6cac38e354b8/disk
    Exit code: 255
    Stdout: ''
    Stderr: "Warning: Permanently added '[192.0.2.2]:8022' (ED25519) to the list of known hosts.\r\nsubsystem request failed on channel 0\r\nscp: Connection closed\r\n"

This is not seen on Ubuntu Jammy, which uses OpenSSH 8.9, while Debian
Bookworm uses OpenSSH 9.2. This is likely related to this change in
OpenSSH 9.0 [2]:

    This release switches scp(1) from using the legacy scp/rcp protocol
    to using the SFTP protocol by default.

Configure sftp subsystem like on RHEL9 derivatives. Even though it is
not yet required for Ubuntu, we also configure it so we are ready for
the Noble release.

[1] https://review.opendev.org/c/openstack/kolla-ansible/+/904249
[2] https://www.openssh.com/txt/release-9.0

Closes-Bug: #2048700
Change-Id: I9f1129136d7664d5cc3b57ae5f7e8d05c499a2a5
2024-01-08 23:12:38 +01:00
Michal Arbet
9ecfcf5a17 Enable glance proxying behaviour
This patch sets URL to glance worker.
If this is set, other glance workers will know how to contact this one
directly if needed. For image import, a single worker stages the image
and other workers need to be able to proxy the import request to the
right one.

With current setup glance image import just not working.

Closes-Bug: #2048525

Change-Id: I4246dc8a80038358cd5b6e44e991b3e2ed72be0e
2024-01-08 16:30:29 +01:00
Zuul
1538092522 Merge "CI: Use ControlPersist and ControlMaster" 2024-01-08 11:49:02 +00:00
Zuul
205fd639b8 Merge "cadvisor: Set housekeeping interval to Prometheus scrape interval" 2024-01-06 08:53:43 +00:00
Mark Goddard
97e5c0e9b1 cadvisor: Set housekeeping interval to Prometheus scrape interval
The prometheus_cadvisor container has high CPU usage. On various
production systems I checked it sits around 13-16% on controllers,
averaged over the prometheus 1m scrape interval. When viewed with top we
can see it is a bit spikey and can jump over 100%.

There are various bugs about this, but I found
https://github.com/google/cadvisor/issues/2523 which suggests reducing
the per-container housekeeping interval. This defaults to 1s, which
provides far greater granularity than we need with the default
prometheus scrape interval of 60s.

Reducing the housekeeping interval to 60s on a production controller
reduced the CPU usage from 13% to 3.5% average. This still seems high,
but is more reasonable.

Change-Id: I89c62a45b1f358aafadcc0317ce882f4609543e7
Closes-Bug: #2048223
2024-01-05 11:02:41 +00:00
Michal Arbet
b1fd2b40f7 Fix long service restarts while using systemd
Some containers exiting with 143 instead of 0, but
this is still OK. This patch just allows
ExitCode 143 (SIGTERM) as fix. Details in
bugreport.

Services which exited with 143 (SIGTERM):

kolla-cron-container.service
kolla-designate_producer-container.service
kolla-keystone_fernet-container.service
kolla-letsencrypt_lego-container.service
kolla-magnum_api-container.service
kolla-mariadb_clustercheck-container.service
kolla-neutron_l3_agent-container.service
kolla-openvswitch_db-container.service
kolla-openvswitch_vswitchd-container.service
kolla-proxysql-container.service

Partial-Bug: #2048130
Change-Id: Ia8c85d03404cfb368e4013066c67acd2a2f68deb
2024-01-05 10:06:56 +01:00
Zuul
39db9a04fe Merge "post-deploy: add public-openrc.sh" 2024-01-04 17:11:59 +00:00
Zuul
288d2f0834 Merge "ironic: Remove enable_ironic_pxe_uefi bits" 2024-01-04 14:29:45 +00:00
Michal Nasiadka
d8700ad013 ironic: Remove enable_ironic_pxe_uefi bits
These were missed in I081aa1345603fa27c390e4e09231a5ff226bcb39

Change-Id: I2884bca3c06ff98004e318757a20b60c12375924
2024-01-04 10:21:35 +00:00
Michal Nasiadka
6daadfdbcb CI: Use 2023.2 as previous_release
Change-Id: I30e9e8c6f59bf2b2f912d70178484ddcd657436e
2024-01-04 10:56:40 +01:00
Mark Goddard
498d324357 Use service-images-pull role for letsencrypt and venus
This reduces code duplication.

Change-Id: Ie529875aaa42435835417468868250bbe4fcf649
2024-01-03 16:51:25 +00:00
Zuul
16928ceddf Merge "Remove nova cell sync comment" 2024-01-03 11:42:52 +00:00
Zuul
2712a7a6a6 Merge "haproxy: Fix single frontend after LE cert path change" 2024-01-03 10:11:26 +00:00
Zuul
3681427b31 Merge "Persist Neutron agent state files in volume" 2024-01-03 09:51:57 +00:00
Zuul
dd78473195 Merge "magnum: Disable CAPI driver when kubeconfig missing" 2024-01-03 09:39:05 +00:00
Michal Nasiadka
9bc99b9434 Test haproxy single external frontend
Change-Id: Id25b4407a8170f69e4cd7278e0aff64c609ace7d
2024-01-03 08:31:14 +00:00
Michal Nasiadka
21e5b21f59 haproxy: Fix single frontend after LE cert path change
I35317ea0343f0db74ddc0e587862e95408e9e106 changed certificate path but omitted
single frontend template.

Change-Id: I638ba32e97234900745df62056710dcc37e7db77
2024-01-02 18:30:42 +01:00
Michal Nasiadka
4879656058 magnum: Disable CAPI driver when kubeconfig missing
Closes-Bug: #2047360
Change-Id: I73490d84da39a74ea7ac493c7dd41fe7bfe2f578
2024-01-02 17:22:39 +00:00
Zuul
65886c1d4e Merge "Fix wsrep sync status task while switched to TCP/IP" 2024-01-02 14:07:22 +00:00
Zuul
eb0e5bac9f Merge "Remove after-Zed TODOs" 2024-01-02 14:07:20 +00:00
Zuul
910ec9272c Merge "post-2023.1: Remove keystone admin endpoint bits" 2023-12-28 17:47:41 +00:00
Zuul
b854a0f9db Merge "Make designate bind9 cmdline configurable" 2023-12-28 15:17:16 +00:00
Michal Nasiadka
982c4d5e5f post-2023.1: Remove keystone admin endpoint bits
Change-Id: I27028ffae26a57d510e1a78c38ead2f925396e81
2023-12-28 09:39:18 +01:00
Michal Nasiadka
65a0cee749 Remove after-Zed TODOs
Change-Id: I081aa1345603fa27c390e4e09231a5ff226bcb39
2023-12-28 08:35:54 +00:00
Zuul
00fc2f85b3 Merge "Set a log retention policy for OpenSearch" 2023-12-21 15:17:32 +00:00
Zuul
418aa9b02a Merge "fluentd: Fix LE pos_file path after version bump" 2023-12-21 11:41:13 +00:00
Zuul
9d1d8448bd Merge "CI: test job definitions when zuul files change" 2023-12-21 11:01:23 +00:00
Doug Szumski
5e5a2dca09 Set a log retention policy for OpenSearch
We previously used ElasticSearch Curator for managing log
retention. Now that we have moved to OpenSearch, we can use
the Index State Management (ISM) plugin which is bundled with
OpenSearch.

This change adds support for automating the configuration of
the ISM plugin via the OpenSearch API. By default, it has
similar behaviour to the previous ElasticSearch Curator
default policy.

Closes-Bug: #2047037

Change-Id: I5c6d938f2bc380f1575ee4f16fe17c6dca37dcba
2023-12-21 10:51:17 +01:00
Alex-Welsh
e9e7362f54 Remove nova cell sync comment
Removed a comment suggesting we use nova-manage db sync --local_cell
when bootstrapping the nova service, since that suggestion has now been
implemented in Kolla. See [1] for more details.

[1]: https://review.opendev.org/c/openstack/kolla/+/902057

Related-Bug: #2045558
Depends-On: Ic64eb51325b3503a14ebab9b9ff2f4d9caec734a
Change-Id: I591f83c4886f5718e36011982c77c0ece6c4cbd7
2023-12-21 09:29:02 +00:00
Michal Nasiadka
85e6432630 CI: Rework docker config vars
Change-Id: I552fea9f9b461e57611f1d2aa5c767a1f4043ff8
2023-12-20 15:40:10 +00:00
Zuul
93725528f8 Merge "Drop redundant note in globals-default.j2" 2023-12-20 11:17:24 +00:00
Michal Nasiadka
bf22f3dd09 fluentd: Fix LE pos_file path after version bump
Change-Id: Ia6db7d6a41ddbda8fcbf563dc55a0c65ef8db9be
2023-12-20 08:18:27 +00:00
Zuul
8fe6be8f4c Merge "Add nova conf for cyborg" 2023-12-20 08:08:16 +00:00