10034 Commits

Author SHA1 Message Date
Mark Goddard
e6d2b92200 Restart all nova services after upgrade
During an upgrade, nova pins the version of RPC calls to the minimum
seen across all services. This ensures that old services do not receive
data they cannot handle. After the upgrade is complete, all nova
services are supposed to be reloaded via SIGHUP to cause them to check
again the RPC versions of services and use the new latest version which
should now be supported by all running services.

Due to a bug [1] in oslo.service, sending services SIGHUP is currently
broken. We replaced the HUP with a restart for the nova_compute
container for bug 1821362, but not other nova services. It seems we need
to restart all nova services to allow the RPC version pin to be removed.

Testing in a Queens to Rocky upgrade, we find the following in the logs:

Automatically selected compute RPC version 5.0 from minimum service
version 30

However, the service version in Rocky is 35.

There is a second issue in that it takes some time for the upgraded
services to update the nova services database table with their new
version. We need to wait until all nova-compute services have done this
before the restart is performed, otherwise the RPC version cap will
remain in place. There is currently no interface in nova available for
checking these versions [2], so as a workaround we use a configurable
delay with a default duration of 30 seconds. Testing showed it takes
about 10 seconds for the version to be updated, so this gives us some
headroom.

This change restarts all nova services after an upgrade, after a 30
second delay.

[1] https://bugs.launchpad.net/oslo.service/+bug/1715374
[2] https://bugs.launchpad.net/nova/+bug/1833542

Change-Id: Ia6fc9011ee6f5461f40a1307b72709d769814a79
Closes-Bug: #1833069
Related-Bug: #1833542
2019-06-27 09:36:20 +00:00
Zuul
860f4ea605 Merge "Include cron_generator script in coverage report" 2019-06-11 08:05:40 +00:00
Zuul
a7d7838689 Merge "fluentd: Configure td-agent only for x86_64" 2019-06-07 22:30:43 +00:00
Doug Szumski
9a402018fc Include cron_generator script in coverage report
Change-Id: Ic011c25c491d7d2e1d1d656d8d1653e14bb2d1f1
2019-06-07 15:01:03 +00:00
Zuul
1151e5802b Merge "Fix the deploy guide build failed" 2019-06-07 13:16:10 +00:00
Zuul
ef9e85470b Merge "Check for 'become' in tasks that use Docker in pep8" 2019-06-07 10:47:24 +00:00
Zuul
888e50f01b Merge "Use become for all docker tasks" 2019-06-07 10:47:23 +00:00
Zuul
629edf0a1c Merge "Fix the document of external ceph for gnocchi" 2019-06-07 09:08:14 +00:00
Zuul
4eb141e160 Merge "Fix monasca grafana organisation check" 2019-06-07 09:08:12 +00:00
Zuul
cbc09a9131 Merge "Add support for elasticsearch TLS and authentication in fluentd" 2019-06-07 09:08:10 +00:00
Zuul
796980aa3f Merge "Add ansible_nodename (system hostname) to /etc/hosts" 2019-06-07 09:08:08 +00:00
Zuul
a4431930c6 Merge "Remove /%(tenant_id)s suffix from v2.1 endpoints" 2019-06-07 09:08:07 +00:00
Zuul
20e1e639bb Merge "[cloudkitty] Remove unused option from configuration file template" 2019-06-07 09:08:06 +00:00
Zuul
0a1ad98105 Merge "Support multi-region discovery of Nova cells" 2019-06-07 09:08:04 +00:00
Zuul
069bdd0415 Merge "Fix docs - simply run the script instead of sourcing it" 2019-06-07 08:53:44 +00:00
Zuul
68a81d48ec Merge "Fix docs - do not mention the old, deprecated CLI clients" 2019-06-07 08:53:42 +00:00
Zuul
01f0f2387d Merge "Hide logs when looping over passwords" 2019-06-07 08:53:40 +00:00
Zuul
fc5a1b4d71 Merge "[heat] Multi-region support for bootstrap" 2019-06-07 08:53:38 +00:00
Zuul
c9bc1f3ad1 Merge "Add blazar to fluentd aggregation" 2019-06-07 08:53:37 +00:00
Zuul
7d8569354e Merge "Update contributor guide" 2019-06-07 08:53:35 +00:00
Zuul
d71f15412a Merge "Add deploy guide stuff to irrelevant file list of zuul" 2019-06-07 08:19:25 +00:00
Mark Goddard
e474c6ca19 Check for 'become' in tasks that use Docker in pep8
We should use become: true for all tasks that use Docker.

Change-Id: I5ce06cc6f2c7403a1c36aadf9e55068c780f05ac
2019-06-06 19:05:01 +01:00
Mark Goddard
b123bf6621 Use become for all docker tasks
Many tasks that use Docker have become specified already, but
not all. This change ensures all tasks that use the following
modules have become:

* kolla_docker
* kolla_ceph_keyring
* kolla_toolbox
* kolla_container_facts

It also adds become for 'command' tasks that use docker CLI.

Change-Id: I4a5ebcedaccb9261dbc958ec67e8077d7980e496
2019-06-06 19:04:58 +01:00
Zuul
ff2b2f44ba Merge "Fix keystone fernet key rotation scheduling" 2019-06-06 16:33:03 +00:00
Zuul
7ab67590f7 Merge "Add unit test for keystone fernet cron generator" 2019-06-06 16:29:19 +00:00
Zuul
2d3344e8cd Merge "Fix Blazar Nova aggregate in multi-region setup" 2019-06-06 15:46:30 +00:00
Zuul
4c4a9dc5fb Merge "Make bootstrap more idempotent" 2019-06-06 14:04:12 +00:00
Radosław Piliszek
c8a57d0c19 Make bootstrap more idempotent
No need to touch the sudoers.d file each time
Creation and mode setting is handled by lineinfile itself

Change-Id: Ia36e21b04d3a08fab3c748f6298f142c1d73ee6d
Signed-off-by: Radosław Piliszek <radoslaw.piliszek@gmail.com>
2019-06-06 12:34:17 +02:00
Radosław Piliszek
a4d7378009 Fix docs - do not mention the old, deprecated CLI clients
Change-Id: If4c78c5ccefc6ff987b70509379dd91588eeea70
Signed-off-by: Radosław Piliszek <radoslaw.piliszek@gmail.com>
2019-06-06 11:54:08 +02:00
Radosław Piliszek
824378b8e2 Fix docs - simply run the script instead of sourcing it
Script looks like it is meant to be run and docs mention
running it rather than sourcing, yet the examples sourced it.

Change-Id: Ib4492ae01bee11b562022099cee8b06b4e3ee3c1
Signed-off-by: Radosław Piliszek <radoslaw.piliszek@gmail.com>
2019-06-06 11:38:17 +02:00
Mark Goddard
68796d79fa Update contributor guide
... or "what I wish existed when I first became PTL"

Some general improvements to the contributor guide, plus new sections
for PTL duties and release management.

Change-Id: If2f3b7c18de2e6c8d9bac131a16c28c2eeb348f2
2019-06-06 08:59:22 +00:00
Zuul
048e8f80c6 Merge "runonce: do not set hw_firmware_type=uefi" 2019-06-06 02:37:50 +00:00
Zuul
fbad0d9cce Merge "Spelling mistake in manila.conf.j2" 2019-06-05 14:50:43 +00:00
Zuul
f9d40d2d77 Merge "Remove unused [service_credentials] config sections" 2019-06-05 14:19:03 +00:00
Cody Hammock
2c343562ee
Add blazar to fluentd aggregation
If Blazar is enabled, ensure that fluentd processes its logs.

Change-Id: If71d5c056c042667388dae8e4ee6d51a5ecab46e
2019-06-05 08:30:14 -05:00
Jason
44da1963ba
[heat] Multi-region support for bootstrap
When bootstrapping, Heat was not setting a region explicitly, so it
could default to a region other than the one being deployed.

Change-Id: I0a0596a020fbff91ccc5b9f44f271eab220c88cd
2019-06-05 08:24:07 -05:00
Jason
f20cbf4958
Fix Blazar Nova aggregate in multi-region setup
The Nova aggregate was always defaulting to some region (usually first
in the Keystone endpoint list) when registering the Nova aggregate for
Blazar. Add in a region override to ensure we are always writing to the
region being deployed.

Change-Id: I3f921ac51acab1b1020a459c07c755af7023e026
2019-06-05 08:18:42 -05:00
Jason
30c619d1bc
Hide logs when looping over passwords
When ansible goes in to a loop, by default it prints all the keys for
the item it is looping over. Some roles, when setting up the databases,
iterate over an object that includes the database password.

Override the loop label to hide everything but the database name.

Change-Id: I336a81a5ecd824ace7d40e9a35942a1c853554cd
2019-06-05 08:09:51 -05:00
Jason
328e14253d
Support multi-region discovery of Nova cells
In a multi-region environment, each region is being deployed separately.
Cell discovery, however, would sometimes fail due to it picking a region
different than the one being deployed. Most likely, an internal endpoint
for region A will not be visible from region B. Furthermore, it is not
very useful to discover hosts on a region you're not modifying.

This changes the check to only run against nova compute services located
in the region being deployed.

Change-Id: I21eb1164c2f67098b81edbd5cc106472663b92cb
2019-06-05 08:07:13 -05:00
Gaetan Trellu
557193a7ec Improve Qinling documentation
- Remove trusted_cidrs that has just been removed from
Qinling code.
- Remove use_api_certificate because it's true by default
- Improve list syntax
- Add etcd section

Change-Id: I0426a9d61fbeaa23a1affbc7e981a78283e88263
2019-06-05 07:56:57 -04:00
Luka Peschke
288b9e06f4 [cloudkitty] Remove unused option from configuration file template
The unused "services" option has been removed from the [collect] section.

Change-Id: Id99fc99e1dff2b8f953b242abda2174cbbecd11d
2019-06-04 11:47:12 +02:00
chenxing
a02de6732e Fix the deploy guide build failed
Depends-On: https://review.opendev.org/662209
Co-authored-by: Stephen Finucane <stephenfin@redhat.com>
Closes-Bug: #1830586
Change-Id: I48d3f4c5f7c04b1ab41b3bad2a6c19e1e9a218fd
2019-06-04 17:06:31 +08:00
Marcin Juszkiewicz
df5a5b6578 runonce: do not set hw_firmware_type=uefi
We made it default in Queens.

Change-Id: I52de6e041bbc4c316d332cbedeca924a7f3c7346
2019-06-04 08:56:57 +02:00
Zuul
9d5b405328 Merge "nova: Fix DBNotAllowed during compute startup" 2019-06-04 03:58:15 +00:00
Zuul
2208b0214e Merge "Adds Qinling Ansible role" 2019-06-03 20:29:41 +00:00
Zuul
0f6cd86f3a Merge "Add support for ceilometer custom pipeline.yaml" 2019-06-03 19:35:55 +00:00
Zuul
3d1b53f301 Merge "Make kolla-ansible support extra volumes" 2019-06-03 18:51:24 +00:00
Zuul
300d9ffcc1 Merge "Add support for ceilometer custom event yaml" 2019-06-03 18:29:39 +00:00
Zuul
c3f011f128 Merge "Enable kolla-ansible to use a custom metrics files with Cloudkitty" 2019-06-03 18:06:57 +00:00
Zuul
d43ca094de Merge "Test Ceph upgrade in CI" 2019-06-03 15:28:02 +00:00