kolla-ansible

Author	SHA1	Message	Date
Mark Goddard	56ae2db7ac	Performance: Run common role in a separate play The common role was previously added as a dependency to all other roles. It would set a fact after running on a host to avoid running twice. This had the nice effect that deploying any service would automatically pull in the common services for that host. When using tags, any services with matching tags would also run the common role. This could be both surprising and sometimes useful. When using Ansible at large scale, there is a penalty associated with executing a task against a large number of hosts, even if it is skipped. The common role introduces some overhead, just in determining that it has already run. This change extracts the common role into a separate play, and removes the dependency on it from all other roles. New groups have been added for cron, fluentd, and kolla-toolbox, similar to other services. This changes the behaviour in the following ways: * The common role is now run for all hosts at the beginning, rather than prior to their first enabled service * Hosts must be in the necessary group for each of the common services in order to have that service deployed. This is mostly to avoid deploying on localhost or the deployment host * If tags are specified for another service e.g. nova, the common role will not automatically run for matching hosts. The common tag must be specified explicitly The last of these is probably the largest behaviour change. While it would be possible to determine which hosts should automatically run the common role, it would be quite complex, and would introduce some overhead that would probably negate the benefit of splitting out the common role. Partially-Implements: blueprint performance-improvements Change-Id: I6a4676bf6efeebc61383ec7a406db07c7a868b2a	2020-07-07 15:00:47 +00:00
Zuul	03b6aaf3a7	Merge "fix deploy nova failed when use kolla_dev_mod"	2020-06-15 05:03:59 +00:00
wu.chunyang	40096b4868	fix deploy nova failed when use kolla_dev_mod There's a logic error here, we call nova role from nova.yml file under ansible folder. we should clone code before run bootstrap_service task. if not, /opt/stack/nova which is empty will mount to nova_api container. Change-Id: Icc54c15080db9c2dc92709480e00b990e5a88662	2020-06-15 01:45:23 +00:00
gugug	2ebd62014e	Remove nova-consoleauth task in U cycle planned task removal Change-Id: I613794667b8c08f524a69e7e3f447b2217efb3f7	2020-06-15 00:22:15 +00:00
wu.chunyang	3e9a648601	permission denied when enable_kolla_dev_mod non-root user has no permission to create directory under /opt directory. use "become: true" to resolve it. Change-Id: I155efc4b1e0691da0aaf6ef19ca709e9dc2d9168	2020-06-07 19:36:42 +08:00
Radosław Piliszek	93c9ad892c	Make nova perms consistent between applications Nova cells support introduced a slight regression that triggers odd behaviour when we tried switching to Apache (httpd) [1]. Bootstrap no longer applied permissions recursively to all log files, creating a discrepancy between normal and bootstrap runs and also Nova and other services such as Cinder (regarding bootstrap logging). This patch fixes it. Backport to Train. Not creating reno nor a bug record because it does not affect any current standard usage in any currently known way. Note this only really hides (standardizes?) the global issue that we don't control file permissions on newly created files too well. [1] https://review.opendev.org/724793 Change-Id: I35e9924ccede5edd2e1307043379aba944725143 Needed-By: https://review.opendev.org/724793	2020-05-06 18:36:10 +00:00
Zuul	5079fa240d	Merge "Manage nova scheduler workers count"	2020-04-21 19:36:44 +00:00
Jeffrey Zhang	8523ab569d	Manage nova scheduler workers count Closes-Bug: 1873753 Change-Id: Ibf75b3fd3090d47b50c71d17f6c12177d16442ab	2020-04-21 09:15:36 +00:00
Zuul	7a193d1f06	Merge "Ansible lint: lines longer than 160 chars"	2020-04-17 09:29:00 +00:00
Zuul	87984f5425	Merge "Add Ansible group check to prechecks"	2020-04-16 15:33:46 +00:00
Zuul	7f42813159	Merge "Refactor copy certificates task"	2020-04-16 14:03:37 +00:00
Michal Nasiadka	d403690b88	Ansible lint: lines longer than 160 chars Change-Id: I500cc8800c412bc0e95edb15babad5c1189e6ee4	2020-04-16 15:59:06 +02:00
James Kirsch	4d155d69cd	Refactor copy certificates task Refactor service configuration to use the copy certificates task. This reduces code duplication and simplifies implementing encrypting backend HAProxy traffic for individual services. Change-Id: I0474324b60a5f792ef5210ab336639edf7a8cd9e	2020-04-14 17:26:19 +00:00
Dincer Celik	4b5df0d866	Introduce /etc/timezone to Debian/Ubuntu containers Some services look for /etc/timezone on Debian/Ubuntu, so we should introduce it to the containers. In addition, added prechecks for /etc/localtime and /etc/timezone. Closes-Bug: #1821592 Change-Id: I9fef14643d1bcc7eee9547eb87fa1fb436d8a6b3	2020-04-09 18:53:36 +00:00
Mark Goddard	0edad7138c	Remove default(omit) from openstack_cacert in templates The use of default(omit) is for module parameters, not templates. We define a default value for openstack_cacert, so it should never be undefined anyway. Change-Id: Idfa73097ca168c76559dc4f3aa8bb30b7113ab28	2020-04-03 14:49:11 +01:00
Zuul	2a2ce059dc	Merge "Add notify restart container when cert changed"	2020-03-10 12:12:55 +00:00
yj.bai	d3cc2f670e	Add notify restart container when cert changed When change the cert file in /etc/kolla/certificate/. The certificate in the container has not changed. So I think can use kolla-ansible deploy when certificate is changed. restart <container> Partially-Implements: blueprint custom-cacerts Change-Id: Iaac6f37e85ffdc0352e8062ae5049cc9a6b3db26 Signed-off-by: yj.bai <bai.yongjun@99cloud.net>	2020-03-10 16:23:09 +08:00
Radosław Piliszek	266fd61ad7	Use "name:" instead of "role:" for *_role modules Both include_role and import_role expect role's name to be given via "name" param instead of "role". This worked but caused errors with ansible-lint. See: https://review.opendev.org/694779 Change-Id: I388d4ae27111e430d38df1abcb6c6127d90a06e0	2020-03-02 10:01:17 +01:00
Mark Goddard	49fb55f182	Add Ansible group check to prechecks We assume that all groups are present in the inventory, and quite obtuse errors can result if any are not. This change adds a precheck that checks for the presence of all expected groups in the inventory for each service. It also introduces a common service-precheck role that we can use for other common prechecks. Change-Id: Ia0af1e7df4fff7f07cd6530e5b017db8fba530b3 Partially-Implements: blueprint improve-prechecks	2020-02-28 16:23:14 +00:00
Zuul	b3c8ff59f1	Merge "Copy CA into containers."	2020-02-07 17:25:01 +00:00
Mark Goddard	5a786436be	Python 3: Use distro_python_version for dev mode In dev mode currently the python source is mounted under python2.7 site-packages. This change fixes this to use the distro_python_version variable to ensure dev mode works with Python 3 images. Change-Id: Ieae3778a02f1b79023b4f1c20eff27b37f481077 Partially-Implements: blueprint python-3	2020-01-30 14:00:34 +00:00
James Kirsch	511ba9f6a2	Copy CA into containers. When kolla_copy_ca_into_containers is set to "yes", the Certificate Authority in /etc/kolla/certificates will be copied into service containers to enable trust for that CA. This is especially useful when the CA is self signed, and would not be trusted by default. Partially-Implements: blueprint custom-cacerts Change-Id: I4368f8994147580460ebe7533850cf63a419d0b4	2020-01-28 14:03:32 -08:00
Zuul	5126087af5	Merge "CentOS 8: Support variable image tag suffix"	2020-01-21 09:29:58 +00:00
Marcin Juszkiewicz	1453474c65	Ansible lint: Variables should have spaces before and after Change-Id: I70f692f125739b5119c71a554a37b5c21d4164f6	2020-01-16 16:37:20 +01:00
James Kirsch	c15dc20341	Configure services to use Certificate Authority Include a reference to the globally configured Certificate Authority to all services. Services use the CA to verify HTTPs connections. Change-Id: I38da931cdd7ff46cce1994763b5c713652b096cc Partially-Implements: blueprint support-trusted-ca-certificate-file	2020-01-13 11:00:11 -08:00
Mark Goddard	9755c924be	CentOS 8: Support variable image tag suffix For the CentOS 7 to 8 transition, we will have a period where both CentOS 7 and 8 images are available. We differentiate these images via a tag - the CentOS 8 images will have a tag of train-centos8 (or master-centos8 temporarily). To achieve this, and maintain backwards compatibility for the openstack_release variable, we introduce a new 'openstack_tag' variable. This variable is based on openstack_release, but has a suffix of 'openstack_tag_suffix', which is empty except on CentOS 8 where it has a value of '-centos8'. Change-Id: I12ce4661afb3c255136cdc1aabe7cbd25560d625 Partially-Implements: blueprint centos-rhel-8	2020-01-10 09:56:04 +00:00
Zuul	a876f3b972	Merge "Use valid_interfaces instead of os_interface for placement"	2019-11-22 18:42:13 +00:00
Pierre Riteau	96590822de	Use valid_interfaces instead of os_interface for placement The [placement].os_interface option was replaced by [placement].valid_interfaces in Queens and was removed in Rocky. Change-Id: I306c57305b9088159dd18af4aa373bbc39a8b881 Closes-Bug: #1853621	2019-11-22 16:39:04 +01:00
Michal Nasiadka	1009931162	Change local_action to delegate_to: localhost As part of the effort to implement Ansible code linting in CI (using ansible-lint) - we need to implement recommendations from ansible-lint output [1]. One of them is to stop using local_action in favor of delegate_to - to increase readability and and match the style of typical ansible tasks. [1]: https://review.opendev.org/694779/ Partially implements: blueprint ansible-lint Change-Id: I46c259ddad5a6aaf9c7301e6c44cd8a1d5c457d3	2019-11-22 15:04:44 +00:00
Mark Goddard	d98be11e46	Remove precheck for nova_enable_rolling_upgrade This variable was removed in the Train cycle, and a precheck added for its use. This precheck can now be removed. Change-Id: I6d9f0b577631ff9443deecf8ef9d94ca217674c5	2019-11-14 11:53:53 +00:00
yuchengde	44a302d7d2	Allow nova to delete volume with cinder auth If "reclaim_instance_interval" has been set in nova conf, attched volume may not be delete while instacne deleted. Adding cinder auth in nova conf can solve the problem. Change-Id: I9eb3a74c2f6976043cc35a94915f1fcecb9ef601 Closes-Bug: 1850279	2019-11-01 14:48:37 +00:00
Zuul	9852082724	Merge "Fix nova scheduler down after first docker restart"	2019-10-30 16:38:37 +00:00
Mark Goddard	6bdf202658	Fix nova scheduler down after first docker restart Due to a Docker bug [1] we cannot use Docker to send SIGHUP to the container because it will mark it as stopped. This patch sends the signal directly to the process, bypassing Docker. 'changed_when: false' is also removed from the relevant task as it definitely changes the state. In the future we could do the refresh only if there really is a need for another one. [1] https://github.com/moby/moby/issues/11065 Change-Id: Ief73bbd24568d6941384ea3330ab45f11aa42d37 Co-authored-by: Radosław Piliszek <radoslaw.piliszek@gmail.com> Closes-Bug: #1845244	2019-10-25 12:32:48 +00:00
Radosław Piliszek	ebf88232dd	[nova][train] Remove deprecated RetryFilter Affects config with Blazar and fake Nova only. The default does not include it. Upstream docs: RetryFilter - Deprecated since version 20.0.0 (Train) Since the 17.0.0 (Queens) release, the scheduler has provided alternate hosts for rescheduling so the scheduler does not need to be called during a reschedule which makes the RetryFilter useless. Change-Id: I26bf45997005124e9166b5bf1d44cb276624430b	2019-10-18 10:30:37 +02:00
Doug Szumski	78a828ef42	Support multiple nova cells This patch adds initial support for deploying multiple Nova cells. Splitting a nova-cell role out from the Nova role allows a more granular approach to deploying and configuring Nova services. A new enable_cells flag has been added that enables the support of multiple cells via the introduction of a super conductor in addition to cell-specific conductors. When this flag is not set (the default), nova is configured in the same manner as before - with a single conductor. The nova role now deploys the global services: * nova-api * nova-scheduler * nova-super-conductor (if enable_cells is true) The nova-cell role handles services specific to a cell: * nova-compute * nova-compute-ironic * nova-conductor * nova-libvirt * nova-novncproxy * nova-serialproxy * nova-spicehtml5proxy * nova-ssh This patch does not support using a single cell controller for managing more than one cell. Support for sharing a cell controller will be added in a future patch. This patch should be backwards compatible and is tested by existing CI jobs. A new CI job has been added that tests a multi-cell environment. ceph-mon has been removed from the play hosts list as it is not necessary - delegate_to does not require the host to be in the play. Documentation will be added in a separate patch. Partially Implements: blueprint support-nova-cells Co-Authored-By: Mark Goddard <mark@stackhpc.com> Change-Id: I810aad7d49db3f5a7fd9a2f0f746fd912fe03917	2019-10-16 17:42:36 +00:00
Radosław Piliszek	bc053c09c1	Implement IPv6 support in the control plane Introduce kolla_address filter. Introduce put_address_in_context filter. Add AF config to vars. Address contexts: - raw (default): <ADDR> - memcache: inet6:[<ADDR>] - url: [<ADDR>] Other changes: globals.yml - mention just IP in comment prechecks/port_checks (api_intf) - kolla_address handles validation 3x interface conditional (swift configs: replication/storage) 2x interface variable definition with hostname (haproxy listens; api intf) 1x interface variable definition with hostname with bifrost exclusion (baremetal pre-install /etc/hosts; api intf) neutron's ml2 'overlay_ip_version' set to 6 for IPv6 on tunnel network basic multinode source CI job for IPv6 prechecks for rabbitmq and qdrouterd use proper NSS database now MariaDB Galera Cluster WSREP SST mariabackup workaround (socat and IPv6) Ceph naming workaround in CI TODO: probably needs documenting RabbitMQ IPv6-only proto_dist Ceph ms switch to IPv6 mode Remove neutron-server ml2_type_vxlan/vxlan_group setting as it is not used (let's avoid any confusion) and could break setups without proper multicast routing if it started working (also IPv4-only) haproxy upgrade checks for slaves based on ipv6 addresses TODO: ovs-dpdk grabs ipv4 network address (w/ prefix len / submask) not supported, invalid by default because neutron_external has no address No idea whether ovs-dpdk works at all atm. ml2 for xenapi Xen is not supported too well. This would require working with XenAPI facts. rp_filter setting This would require meddling with ip6tables (there is no sysctl param). By default nothing is dropped. Unlikely we really need it. ironic dnsmasq is configured IPv4-only dnsmasq needs DHCPv6 options and testing in vivo. KNOWN ISSUES (beyond us): One cannot use IPv6 address to reference the image for docker like we currently do, see: https://github.com/moby/moby/issues/39033 (docker_registry; docker API 400 - invalid reference format) workaround: use hostname/FQDN RabbitMQ may fail to bind to IPv6 if hostname resolves also to IPv4. This is due to old RabbitMQ versions available in images. IPv4 is preferred by default and may fail in the IPv6-only scenario. This should be no problem in real life as IPv6-only is indeed IPv6-only. Also, when new RabbitMQ (3.7.16/3.8+) makes it into images, this will no longer be relevant as we supply all the necessary config. See: https://github.com/rabbitmq/rabbitmq-server/pull/1982 For reliable runs, at least Ansible 2.8 is required (2.8.5 confirmed to work well). Older Ansible versions are known to miss IPv6 addresses in interface facts. This may affect redeploys, reconfigures and upgrades which run after VIP address is assigned. See: https://github.com/ansible/ansible/issues/63227 Bifrost Train does not support IPv6 deployments. See: https://storyboard.openstack.org/#!/story/2006689 Change-Id: Ia34e6916ea4f99e9522cd2ddde03a0a4776f7e2c Implements: blueprint ipv6-control-plane Signed-off-by: Radosław Piliszek <radoslaw.piliszek@gmail.com>	2019-10-16 10:24:35 +02:00
Michal Nasiadka	bdc8df0c90	Change ceph_client caps to use profile rbd Using profiles in cephx is the recommended way since Mimic, this also adds support for blacklist ops. Change-Id: Ib9f65644637a5761c6cd7ca8925afc6bb2b8d5f5 Closes-Bug: #1760065	2019-10-09 14:18:17 +02:00
Zuul	87e671c6b4	Merge "Add a job that only deploys updated containers"	2019-09-30 13:19:28 +00:00
Doug Szumski	0d7a34e8c7	Remove Nova legacy upgrade The rolling upgrade has been the default since Stein. The legacy upgrade has been removed because it doesn't follow the upgrade guide [1]. [1] https://docs.openstack.org/nova/latest/user/upgrade.html Change-Id: I2aa879699cb4e9955bf5c38053eada5a53fb6211	2019-09-26 18:04:50 +01:00
Kris Lindgren	2fe0d98ebb	Add a job that only deploys updated containers Sometimes as cloud admins, we want to only update code that is running in a cloud. But we dont need to do anything else. Make an action in kolla-ansible that allows us to do that. Change-Id: I904f595c69f7276e71692696471e32fd1f88e6e8 Implements: blueprint deploy-containers-action	2019-09-26 17:51:14 +01:00
Zuul	340b6d9456	Merge "Add support for libvirt+tls"	2019-09-26 09:19:33 +00:00
Kris Lindgren	f8cfccb99e	Add support for libvirt+tls To securely support live migration between computenodes we should enable tls, with cert auth, instead of TCP with no auth support. Implements: blueprint libvirt-tls Change-Id: I22ea6233933c840b853fdcc8e03400b2bf577271	2019-09-19 15:32:41 +01:00
Zuul	a21b9b5430	Merge "Refactor service, endpoint and user registration"	2019-09-18 17:34:15 +00:00
Zuul	c11ada4148	Merge "Ignore create_cells and discover_computes when nova-api is disabled"	2019-09-18 11:43:18 +00:00
Mark Goddard	3522d235bd	Refactor service, endpoint and user registration Use upstream Ansible modules for registration of services, endpoints, users, projects, roles, and role grants. Change-Id: I7c9138d422cc91c177fd8992347176bb54156b5a	2019-09-17 10:13:56 -07:00
Radosław Piliszek	b4ef4638a6	Fix enforced horizon redirect to https Also fixes similar issues introduced by the same recent change. Added FIXME note about possible TLS malfunction regarding horizon. Change-Id: I5f46a9306139eb550d3849757c8bdf0767537c78 Closes-Bug: #1844016 Signed-off-by: Radosław Piliszek <radoslaw.piliszek@gmail.com>	2019-09-14 22:00:56 +02:00
Zuul	ff86c2f2e3	Merge "Implement TLS encryption for internal endpoints"	2019-09-12 09:20:54 +00:00
Yang Youseok	f1f12d70a3	Ignore create_cells and discover_computes when nova-api is disabled When nova-api group have no hosts, we don't have to run create_cells and discover_computes. Add conditional blocks to prevent to run them. Change-Id: Ia1ba058c1b74b06b678f45544883e567e2b4eb55 Closes-Bug: #1843235	2019-09-11 17:51:27 +09:00
Zuul	556f08b5c1	Merge "Fix Nova cell search"	2019-09-07 19:46:24 +00:00
Zuul	8661599b9e	Merge "[nova] Fix service catalog lookup of Neutron endpoint"	2019-09-04 09:18:37 +00:00

1 2 3 4 5 ...

632 Commits