kolla-ansible

Author	SHA1	Message	Date
zhangmeng	8620a5e4fc	Add [taskflow] section for masakari.conf.j2 Closes-bug: 1966536 Change-Id: I66a0189511e4c937299442207459cf72165649dd	2022-07-20 15:22:23 +08:00
Radosław Piliszek	72b63dfee7	Further Keystone-related cleanups Per comments on [1]. [1] https://review.opendev.org/c/openstack/kolla-ansible/+/843727 Change-Id: I60162b54bc06e158534d29311d4474b34750c64d	2022-06-20 08:40:03 +00:00
Will Szumski	49006e56d9	Add keystone_authtoken.service_type Fixes an issue where access rules failed to validate: Cannot validate request with restricted access rules. Set service_type in [keystone_authtoken] to allow access rule validation I've used the values from the endpoint. This was mostly a straight forward copy and paste, except: - versioned endpoints e.g cinderv3 where I stripped the version - monasca has multiple endpoints associated with a single service. For this, I concatenated logging and monitoring to be logging-monitoring. Closes-Bug: #1965111 Change-Id: Ic4b3ab60abad8c3dd96cd4923a67f2a8f9d195d7	2022-06-09 22:49:38 +02:00
Zuul	b42cc19b57	Merge "Do not use keystone_admin_url et al"	2022-06-01 13:30:18 +00:00
Radosław Piliszek	7ca9349b09	Do not use keystone_admin_url et al Following up on [1]. The 3 variables are only introducing noise after we removed the reliance on Keystone's admin port. [1] I5099b08953789b280c915a6b7a22bdd4e3404076 Change-Id: I3f9dab93042799eda9174257e604fd1844684c1c	2022-05-28 18:19:01 +02:00
Maksim Malchuk	d3dbd812c5	Control Masakari monitors deploy Add a switches to enable/disable deploy of the Masakari monitors. Change-Id: I3ab603f7cab7946ea8f2e063fe91190d6592066a Signed-off-by: Maksim Malchuk <maksim.malchuk@gmail.com>	2022-05-25 15:19:32 +03:00
Radosław Piliszek	3e75a33ad4	Use the new image naming scheme Change-Id: Ib4b15ed4feac82d8492b1c0f0238a752eac668e6	2022-05-23 06:37:25 +00:00
Mark Goddard	5d28a7c89b	masakari: support libvirt SASL in instance monitor Since enabling libvirt SASL authentication, the masakari instance monitor fails to connect to libvirt. We see the following error in logs: libvirt.libvirtError: authentication failed: Failed to start SASL negotiation: -4 (SASL(-4): no mechanism available: No worthy mechs found) This change adds support for SASL authentication in Masakari instance monitor. Depends-On: https://review.opendev.org/c/openstack/kolla/+/834456 Closes-Bug: #1965754 Change-Id: I974046662b383a12ac6281b725523760a96657bd	2022-05-21 13:27:27 +00:00
Marcin Juszkiewicz	1620ab5be9	drop install_type from image names We have only one value for install_type now and it gets removed from image names. Change-Id: I8bf95fd7aa9dd26b80d618ca0fcb097003b4cb0a	2022-04-20 12:29:12 +02:00
Marcin Juszkiewicz	463f10014e	drop binary install type from templates and config As we have only source image type then we do not need to handle other option. Change-Id: I753aa0182cfc975bb8b5cd1476ab2c336a7691fa	2022-04-05 15:31:21 +02:00
Pierre Riteau	56fc74f231	Move project_name and kolla_role_name to role vars Role vars have a higher precedence than role defaults. This allows to import default vars from another role via vars_files without overriding project_name (see related bug for details). Change-Id: I3d919736e53d6f3e1a70d1267cf42c8d2c0ad221 Related-Bug: #1951785	2021-12-31 09:26:25 +00:00
Zuul	42fd0a795e	Merge "Stop creating non-keystone admin endpoints"	2021-12-27 15:06:12 +00:00
Dr. Jens Harbott	479a78706a	Stop creating non-keystone admin endpoints The admin interface for endpoints never had any real use, the functionality was the same as for the public or internal endpoints, except for Keystone. Even for Keystone with API v3 it would no longer really be needed, but it is still being required by some libraries that cannot be changed in order to stay backwards compatible. Signed-off-by: Dr. Jens Harbott <harbott@osism.tech> Change-Id: Icf3bf08deab2c445361f0a0124d87ad8b0e4e9d9	2021-12-21 13:09:36 +01:00
Radosław Piliszek	4e5e9abcd2	Fix wrong distro assumptions It seems some cases were missed in reviews and not fixed by the previous iterations: Ifc252ae793e6974356fcdca810b373f362d24ba5 I838e526b930d5276d3ce24f5188262af7eb33280 Change-Id: Id57da1c5024e1efc5810baca8fbe18967cf95a68	2021-10-22 17:06:10 +00:00
Radosław Piliszek	3c68e82585	Fix Masakari in multi-region deploys to behave like it is most commonly expected - query Nova in the same region. Closes-Bug: #1939291 Change-Id: I584a83d352c747a799b5dab1d3b8159ba3805454	2021-08-20 18:53:46 +00:00
Radosław Piliszek	9ff2ecb031	Refactor and optimise image pulling We get a nice optimisation by using a filtered loop instead of task skipping per service with 'when'. Partially-Implements: blueprint performance-improvements Change-Id: I8f68100870ab90cb2d6b68a66a4c97df9ea4ff52	2021-08-10 11:57:54 +00:00
Zuul	d5b7af30e8	Merge "Fix deployment failure when kolla_dev_mod is enabled"	2021-08-04 13:00:58 +00:00
wu.chunyang	200e36da7d	Fix deployment failure when kolla_dev_mod is enabled trivial fix Change-Id: I43bc11183c2fa9773811a74a93c37cecceed7454	2021-07-21 21:31:52 +08:00
Radosław Piliszek	f71646da18	Fix Masakari host monitor default config Closes-Bug: #1933209 Change-Id: I644ad475ca88aac0c22b14163d33a30193fe706a	2021-07-01 18:22:10 +00:00
Mark Goddard	ade5bfa302	Use ansible_facts to reference facts By default, Ansible injects a variable for every fact, prefixed with ansible_. This can result in a large number of variables for each host, which at scale can incur a performance penalty. Ansible provides a configuration option [0] that can be set to False to prevent this injection of facts. In this case, facts should be referenced via ansible_facts.<fact>. This change updates all references to Ansible facts within Kolla Ansible from using individual fact variables to using the items in the ansible_facts dictionary. This allows users to disable fact variable injection in their Ansible configuration, which may provide some performance improvement. This change disables fact variable injection in the ansible configuration used in CI, to catch any attempts to use the injected variables. [0] https://docs.ansible.com/ansible/latest/reference_appendices/config.html#inject-facts-as-vars Change-Id: I7e9d5c9b8b9164d4aee3abb4e37c8f28d98ff5d1 Partially-Implements: blueprint performance-improvements	2021-06-23 10:38:06 +01:00
Mark Goddard	db517a44e4	masakari: support host monitor Change-Id: I3f43df7766c57622ab8d01a759fbeeef0a0c2b93 Implements: blueprint masakari-hostmonitor Co-Authored-By: Radosław Piliszek <radoslaw.piliszek@gmail.com>	2021-04-08 16:39:47 +00:00
Mark Goddard	0b0dd35837	masakari: fix minor issues with instance monitor * Don't generate masakari.conf for instance monitor * Don't generate masakari-monitors.conf for API or engine * Use a consistent name for dimensions - masakari_instancemonitor_dimensions * Fix source code paths in dev mode Change-Id: I551f93c9bf1ad6712b53c316074ae1df84e4352b	2021-04-07 13:28:01 +00:00
Doug Szumski	647ff667e6	Add variable for changing Apache HTTP timeout In services which use the Apache HTTP server to service HTTP requests, there exists a TimeOut directive [1] which defaults to 60 seconds. APIs which come under heavy load, such as Cinder, can sometimes exceed this which results in a HTTP 504 Gateway timeout, or similar. However, the request can still be serviced without error. For example, if Nova calls the Cinder API to detach a volume, and this operation takes longer than the shortest of the two timeouts, Nova will emit a stack trace with a 504 Gateway timeout. At some time later, the request to detach the volume will succeed. The Nova and Cinder DBs then become out-of-sync with each other, and frequently DB surgery is required. Although strictly this category of bugs should be fixed in OpenStack services, it is not realistic to expect this to happen in the short term. Therefore, this change makes it easier to set the Apache HTTP timeout via a new variable. An example of a related bug is here: https://bugs.launchpad.net/nova/+bug/1888665 Whilst this timeout can currently be set by overriding the WSGI config for individual services, this change makes it much easier. Change-Id: Ie452516655cbd40d63bdad3635fd66693e40ce34 Closes-Bug: #1917648	2021-03-04 11:25:06 +00:00
Zuul	860c32de76	Merge "Revert "Performance: Use import_tasks in the main plays""	2020-12-15 19:52:24 +00:00
Mark Goddard	db4fc85c33	Revert "Performance: Use import_tasks in the main plays" This reverts commit 9cae59be51e8d2d798830042a5fd448a4aa5e7dc. Reason for revert: This patch was found to introduce issues with fluentd customisation. The underlying issue is not currently fully understood, but could be a sign of other obscure issues. Change-Id: Ia4859c23d85699621a3b734d6cedb70225576dfc Closes-Bug: #1906288	2020-12-14 10:36:55 +00:00
Radosław Piliszek	71e9c603b8	Do not set 'always' tag where unnecessary Makes 'import_tasks' not change behaviour compared to 'include_tasks'. Change-Id: I600be7c3bd763b3b924bd4a45b4e7b4dca7a33e3	2020-10-27 19:51:46 +01:00
Radosław Piliszek	9cae59be51	Performance: Use import_tasks in the main plays Main plays are action-redirect-stubs, ideal for import_tasks. This avoids 'include' penalty and makes logs/ara look nicer. Fixes haproxy and rabbitmq not to check the host group as well. Change-Id: I46136fc40b815e341befff80b54a91ef431eabc0 Partially-Implements: blueprint performance-improvements	2020-10-27 19:09:32 +01:00
Radosław Piliszek	3411b9e420	Performance: optimize genconfig Config plays do not need to check containers. This avoids skipping tasks during the genconfig action. Ironic and Glance rolling upgrades are handled specially. Swift and Bifrost do not use the handlers at all. Partially-Implements: blueprint performance-improvements Change-Id: I140bf71d62e8f0932c96270d1f08940a5ba4542a	2020-10-12 19:30:06 +02:00
Zuul	ba933f16e9	Merge "Support TLS encryption of RabbitMQ client-server traffic"	2020-09-29 11:31:03 +00:00
Pierre Riteau	c81772024c	Reduce the use of SQLAlchemy connection pooling When the internal VIP is moved in the event of a failure of the active controller, OpenStack services can become unresponsive as they try to talk with MariaDB using connections from the SQLAlchemy pool. It has been argued that OpenStack doesn't really need to use connection pooling with MariaDB [1]. This commit reduces the use of connection pooling via two configuration options: - max_pool_size is set to 1 to allow only a single connection in the pool (it is not possible to disable connection pooling entirely via oslo.db, and max_pool_size = 0 means unlimited pool size) - lower connection_recycle_time from the default of one hour to 10 seconds, which means the single connection in the pool will be recreated regularly These settings have shown better reactivity of the system in the event of a failover. [1] http://lists.openstack.org/pipermail/openstack-dev/2015-April/061808.html Change-Id: Ib6a62d4428db9b95569314084090472870417f3d Closes-Bug: #1896635	2020-09-22 17:54:45 +02:00
Mark Goddard	761ea9a333	Support TLS encryption of RabbitMQ client-server traffic This change adds support for encryption of communication between OpenStack services and RabbitMQ. Server certificates are supported, but currently client certificates are not. The kolla-ansible certificates command has been updated to support generating certificates for RabbitMQ for development and testing. RabbitMQ TLS is enabled in the all-in-one source CI jobs, or when The Zuul 'tls_enabled' variable is true. Change-Id: I4f1d04150fb2b5af085b762890092f87ae6076b5 Implements: blueprint message-queue-ssl-support	2020-09-17 12:05:44 +01:00
Mark Goddard	496904d650	Performance: use import_tasks for register and bootstrap Including tasks has a performance penalty when compared with importing tasks. If the include has a condition associated with it, then the overhead of the include may be lower than the overhead of skipping all imported tasks. In the case of the register.yml and bootstrap.yml includes, all of the tasks in the included file use run_once: True. The run_once flag improves performance at scale drastically, so importing these tasks unconditionally will have a lower overhead than a conditional include task. It therefore makes sense to switch to use import_tasks there. See [1] for benchmarks of run_once. [1] https://github.com/stackhpc/ansible-scaling/blob/master/doc/run-once.md Change-Id: Ic67631ca3ea3fb2081a6f8978e85b1522522d40d Partially-Implements: blueprint performance-improvements	2020-08-28 16:31:04 +00:00
Mark Goddard	b685ac44e0	Performance: replace unconditional include_tasks with import_tasks Including tasks has a performance penalty when compared with importing tasks. If the include has a condition associated with it, then the overhead of the include may be lower than the overhead of skipping all imported tasks. For unconditionally included tasks, switching to import_tasks provides a clear benefit. Benchmarking of include vs. import is available at [1]. This change switches from include_tasks to import_tasks where there is no condition applied to the include. [1] https://github.com/stackhpc/ansible-scaling/blob/master/doc/include-and-import.md#task-include-and-import Partially-Implements: blueprint performance-improvements Change-Id: Ia45af4a198e422773d9f009c7f7b2e32ce9e3b97	2020-08-28 16:12:03 +00:00
Radosław Piliszek	9c38a0c77b	Drop python-path It was found to be useless in [1]. It is one of distro_python_version usages. Note Freezer and Horizon still use python_path (and hence distro_python_version) for different purposes. [1] https://review.opendev.org/675822 Change-Id: I6d6d9fdf4c28cb2b686d548955108c994b685bb1 Partially-Implements: blueprint drop-distro-python-version	2020-08-24 07:38:21 +00:00
Zuul	d1e5de2120	Merge "Add Keep Alive Timeout for httpd"	2020-08-13 15:27:39 +00:00
James Kirsch	19b028e660	Add Keep Alive Timeout for httpd This patch introduces a global keep alive timeout value for services that leverage httpd + wsgi to handle http/https requests. The default value is one minute. Change-Id: Icf7cb0baf86b428a60a7e9bbed642999711865cd Partially-Implements: blueprint add-ssl-internal-network	2020-08-13 09:52:40 +00:00
Mark Goddard	146b00efa7	Mount /etc/timezone based on host OS Previously we mounted /etc/timezone if the kolla_base_distro is debian or ubuntu. This would fail prechecks if debian or ubuntu images were deployed on CentOS. While this is not a supported combination, for correctness we should fix the condition to reference the host OS rather than the container OS, since that is where the /etc/timezone file is located. Change-Id: Ifc252ae793e6974356fcdca810b373f362d24ba5 Closes-Bug: #1882553	2020-08-10 10:14:18 +01:00
Radosław Piliszek	5d3ca8b09e	Fix Masakari role missing deploy-containers Masakari was introduced parallelly to deploy-containers action and so we missed to add this functionality to it. Change-Id: Ibef198d20d481bc92b38af786cdf0292b246bb12 Closes-Bug: #1889611	2020-07-30 15:41:37 +02:00
Zuul	98f773d0be	Merge "Masakari: copy TLS certificates into containers"	2020-07-24 07:53:48 +00:00
Zuul	39909a600c	Merge "Performance: remove unnecessary conditions from includes"	2020-07-24 07:52:37 +00:00
Mark Goddard	0b4c8a3c3d	Masakari: copy TLS certificates into containers From Ussuri, if CA certificates are copied into /etc/kolla/certificates/ca/, these should be copied into all containers. This is not being done for masakari currently. Additionally, we are not setting the [DEFAULT] nova_ca_certificates_file option in masakari.conf. This depends on masakari bug 1873736 being fixed to work. This change fixes these issues. Change-Id: I9a3633f58e5eb734fa32edc03a3022a500761bbb Closes-Bug: #1888655	2020-07-23 12:06:24 +01:00
Mark Goddard	56ae2db7ac	Performance: Run common role in a separate play The common role was previously added as a dependency to all other roles. It would set a fact after running on a host to avoid running twice. This had the nice effect that deploying any service would automatically pull in the common services for that host. When using tags, any services with matching tags would also run the common role. This could be both surprising and sometimes useful. When using Ansible at large scale, there is a penalty associated with executing a task against a large number of hosts, even if it is skipped. The common role introduces some overhead, just in determining that it has already run. This change extracts the common role into a separate play, and removes the dependency on it from all other roles. New groups have been added for cron, fluentd, and kolla-toolbox, similar to other services. This changes the behaviour in the following ways: * The common role is now run for all hosts at the beginning, rather than prior to their first enabled service * Hosts must be in the necessary group for each of the common services in order to have that service deployed. This is mostly to avoid deploying on localhost or the deployment host * If tags are specified for another service e.g. nova, the common role will not automatically run for matching hosts. The common tag must be specified explicitly The last of these is probably the largest behaviour change. While it would be possible to determine which hosts should automatically run the common role, it would be quite complex, and would introduce some overhead that would probably negate the benefit of splitting out the common role. Partially-Implements: blueprint performance-improvements Change-Id: I6a4676bf6efeebc61383ec7a406db07c7a868b2a	2020-07-07 15:00:47 +00:00
Mark Goddard	7ff27de7ac	Performance: remove unnecessary conditions from includes There are a number of tasks where we conditionally use include_tasks with a condition, and the condition is always true. This change removes these conditions, in preparation for switching unconditional task includes to task imports. Partially-Implements: blueprint performance-improvements Change-Id: I3804c440fe3552950d9d434ef5409f685c39bbcf	2020-07-07 15:50:58 +01:00
wu.chunyang	3e9a648601	permission denied when enable_kolla_dev_mod non-root user has no permission to create directory under /opt directory. use "become: true" to resolve it. Change-Id: I155efc4b1e0691da0aaf6ef19ca709e9dc2d9168	2020-06-07 19:36:42 +08:00
Zuul	87984f5425	Merge "Add Ansible group check to prechecks"	2020-04-16 15:33:46 +00:00
Dincer Celik	4b5df0d866	Introduce /etc/timezone to Debian/Ubuntu containers Some services look for /etc/timezone on Debian/Ubuntu, so we should introduce it to the containers. In addition, added prechecks for /etc/localtime and /etc/timezone. Closes-Bug: #1821592 Change-Id: I9fef14643d1bcc7eee9547eb87fa1fb436d8a6b3	2020-04-09 18:53:36 +00:00
Mark Goddard	0edad7138c	Remove default(omit) from openstack_cacert in templates The use of default(omit) is for module parameters, not templates. We define a default value for openstack_cacert, so it should never be undefined anyway. Change-Id: Idfa73097ca168c76559dc4f3aa8bb30b7113ab28	2020-04-03 14:49:11 +01:00
Radosław Piliszek	266fd61ad7	Use "name:" instead of "role:" for *_role modules Both include_role and import_role expect role's name to be given via "name" param instead of "role". This worked but caused errors with ansible-lint. See: https://review.opendev.org/694779 Change-Id: I388d4ae27111e430d38df1abcb6c6127d90a06e0	2020-03-02 10:01:17 +01:00
Mark Goddard	49fb55f182	Add Ansible group check to prechecks We assume that all groups are present in the inventory, and quite obtuse errors can result if any are not. This change adds a precheck that checks for the presence of all expected groups in the inventory for each service. It also introduces a common service-precheck role that we can use for other common prechecks. Change-Id: Ia0af1e7df4fff7f07cd6530e5b017db8fba530b3 Partially-Implements: blueprint improve-prechecks	2020-02-28 16:23:14 +00:00
Gaëtan Trellu	7f951ea56e	Use internal API for masakari-monitor By default api_interface is set to public, masakari-monitor on compute nodes should communicate via the internal API to reach masakari-api. Change-Id: I454f44e57d7b17d93d4aefc4cbbed93aefe874b1 Closes-Bug: #1858431	2020-02-12 10:23:50 +00:00

1 2

63 Commits