kolla-ansible

Author	SHA1	Message	Date
Mark Goddard	46aeb9843f	Fix prechecks in check mode When running in check mode, some prechecks previously failed because they use the command module which is silently not run in check mode. Other prechecks were not running correctly in check mode due to e.g. looking for a string in empty command output or not querying which containers are running. This change fixes these issues. Closes-Bug: #2002657 Change-Id: I5219cb42c48d5444943a2d48106dc338aa08fa7c	2023-01-12 14:27:36 +00:00
Matt Crees	6c2aace8d6	Integrate oslo-config-validator Regularly, we experience issues in Kolla Ansible deployments because we use wrong options in OpenStack configuration files. This is because OpenStack services ignore unknown options. We also need to keep on top of deprecated options that may be removed in the future. Integrating oslo-config-validator into Kolla Ansible will greatly help. Adds a shared role to run oslo-config-validator on each service. Takes into account that services have multiple containers, and these may also use multiple config files. Service roles are extended to use this shared role. Executed with the new command ``kolla-ansible validate-config``. Change-Id: Ic10b410fc115646d96d2ce39d9618e7c46cb3fbc	2022-12-21 17:19:09 +00:00
Michal Nasiadka	e1ec02eddf	Replace ElasticSearch and Kibana with OpenSearch This change replaces ElasticSearch with OpenSearch, and Kibana with OpenSearch Dashboards. It migrates the data from ElasticSearch to OpenSearch upon upgrade. No TLS support is in this patch (will be a followup). A replacement for ElasticSearch Curator will be added as a followup. Depends-On: https://review.opendev.org/c/openstack/kolla/+/830373 Co-authored-by: Doug Szumski <doug@stackhpc.com> Co-authored-by: Kyle Dean <kyle@stackhpc.com> Change-Id: Iab10ce7ea5d5f21a40b1f99b28e3290b7e9ce895	2022-12-01 10:27:50 +00:00
Doug Szumski	adb8f89a36	Remove support for deploying OpenStack Monasca Kolla Ansible is switching to OpenSearch and is dropping support for deploying ElasticSearch. This is because the final OSS release of ElasticSearch has exceeded its end of life. Monasca is affected because it uses both Logstash and ElasticSearch. Whilst it may continue to work with OpenSearch, Logstash remains an issue. In the absence of any renewed interest in the project, we remove support for deploying it. This helps to reduce the complexity of log processing configuration in Kolla Ansible, freeing up development time. Change-Id: I6fc7842bcda18e417a3fd21c11e28979a470f1cf	2022-11-11 15:48:11 +00:00
Ivan Halomi	4ca2d41762	Adding container_engine to kolla_toolbox module Second part of patchset: https://review.opendev.org/c/openstack/kolla-ansible/+/799229/ in which was suggested to split patch into smaller ones. THis change adds container_engine to module parameters so when we introduce podman, kolla_toolbox can be used for both engines. Signed-off-by: Ivan Halomi <i.halomi@partner.samsung.com> Co-authored-by: Martin Hiner <m.hiner@partner.samsung.com> Change-Id: Ic2093aa9341a0cb36df8f340cf290d62437504ad	2022-11-04 15:32:30 +01:00
Ivan Halomi	7a9f04573a	Adding container engine to kolla_container_facts Second part of patchset: https://review.opendev.org/c/openstack/kolla-ansible/+/799229/ in which was suggested to split patch into smaller ones. This change adds container_engine variable to kolla_container_facts module, this prepares module to be used with docker and podman as well without further changes in roles. Signed-off-by: Ivan Halomi <i.halomi@partner.samsung.com> Co-authored-by: Martin Hiner <m.hiner@partner.samsung.com> Change-Id: I9e8fa30646844ab4a288555f3aafdda345b3a118	2022-11-02 13:44:45 +01:00
Michal Arbet	4838591c6c	Add loadbalancer-config role and wrap haproxy-config role inside This patch adds loadbalancer-config role which is "wrapper" around haproxy-config and proxysql-config role which will be added in follow-up patches. Change-Id: I64d41507317081e1860a94b9481a85c8d400797d	2022-08-09 12:15:49 +02:00
Michal Arbet	baad47ac61	Edit services roles to support database sharding Depends-On: https://review.opendev.org/c/openstack/kolla/+/769385 Depends-On: https://review.opendev.org/c/openstack/kolla/+/765781 Change-Id: I3c4182a6556dafd2c936eaab109a068674058fca	2022-08-09 12:15:26 +02:00
Michal Nasiadka	dcf5a8b65f	Fix var-spacing ansible-lint introduced var-spacing - let's fix our code. Change-Id: I0d8aaf3c522a5a6a5495032f6dbed8a2be0251f0	2022-07-25 22:15:15 +02:00
Pierre Riteau	13b0f3b861	Make external access to monitoring services configurable Change-Id: Iaf6bf36ae0adce3342981c36c859fc138b172f6b	2022-06-27 11:57:53 +02:00
T0125936 - LALLAU Bertrand	13af278708	Fix typo in endpoint influxdb_internal_endpoint variable This patch simply fix a typo in 'influxdb_internal_endpoint' variable. Change-Id: I1b1068e84be7f7eaff1a4eab1ba9ddcd6f4241c7	2022-06-08 11:31:38 +02:00
Radosław Piliszek	3e75a33ad4	Use the new image naming scheme Change-Id: Ib4b15ed4feac82d8492b1c0f0238a752eac668e6	2022-05-23 06:37:25 +00:00
Pierre Riteau	555cd39f1a	Fix typos in docs This is a follow up to I7e5c1e20c7b66b64cbd333f669ef8d8da60daaa8. Change-Id: I11a86f59c1fb9cddde3370b544ee7bf4e8ae4fb4	2022-05-02 15:44:34 +02:00
Zuul	2c15d36fed	Merge "Adds prometheus_scrape_interval"	2022-04-21 16:55:35 +00:00
Marcin Juszkiewicz	1620ab5be9	drop install_type from image names We have only one value for install_type now and it gets removed from image names. Change-Id: I8bf95fd7aa9dd26b80d618ca0fcb097003b4cb0a	2022-04-20 12:29:12 +02:00
Jan Horstmann	3d91e69aab	Change grafana provisioning.yaml indentation This commit changes the indentation scheme used in `ansible/roles/grafana/templates/provisioning.yaml.j2` to the commonly used pattern of two whitespaces. Change-Id: I2f9d34930ed06aa2e63f7cc28bfdda7046fc3e67	2022-03-25 09:26:24 +01:00
Pierre Riteau	f37562827d	Remove grafana [session] configuration These configuration settings were removed in Grafana 6.2. Instead we can use [remote_cache], but it is not required since it will use database settings by default. Change-Id: I37966027aea9039b2ecba4214444507e9d87f513	2022-02-22 10:26:37 +01:00
Will Szumski	033db44f1c	Adds prometheus_scrape_interval Grafana requires the scrape interval to be set to be able to compute $__rate_interval. The default is 15s which does not match the kolla default of 60s. The symptom of not setting this is that you will see "no data" when zooming graphs that use rate queries. This occurs as the interval will be set to a period shorter than the scrape interval. The recommendation is that you use a common scrape interval for all jobs. See: - https://grafana.com/blog/2020/09/28/new-in-grafana-7.2-__rate_interval-for-prometheus-rate-queries-that-just-work/ - https://stackoverflow.com/questions/66369969/set-scrape-interval-in-provisioned-prometheus-data-source-in-grafana Change-Id: I7e5c1e20c7b66b64cbd333f669ef8d8da60daaa8	2022-02-14 11:10:44 +00:00
Pierre Riteau	56fc74f231	Move project_name and kolla_role_name to role vars Role vars have a higher precedence than role defaults. This allows to import default vars from another role via vars_files without overriding project_name (see related bug for details). Change-Id: I3d919736e53d6f3e1a70d1267cf42c8d2c0ad221 Related-Bug: #1951785	2021-12-31 09:26:25 +00:00
Dr. Jens Harbott	f8f34e0c47	Bump timeout for grafana startup The initial migrations when starting grafana for the first time may sometimes take much longer than 20s, we have seen samples up to near 60s. Allow 120s to have some margin. Also make the timeout parameters configurable. Closes-Bug: 1769962 Signed-off-by: Dr. Jens Harbott <harbott@osism.tech> Change-Id: If9186d8aa65150c492657550064789e211dbb570	2021-12-09 08:05:57 +01:00
Uwe Grawert	82b0e095a5	Grafana: Run priviliged when copying home dashboard file The copy job for the grafana home dashboard file needs to run priviliged, otherwise permission denied error occurs. Closes-Bug: #1947710 Change-Id: Ib15e961e5193af55e45a443305a96667295f3cb7	2021-10-20 11:26:09 +02:00
Radosław Piliszek	9ff2ecb031	Refactor and optimise image pulling We get a nice optimisation by using a filtered loop instead of task skipping per service with 'when'. Partially-Implements: blueprint performance-improvements Change-Id: I8f68100870ab90cb2d6b68a66a4c97df9ea4ff52	2021-08-10 11:57:54 +00:00
Zuul	6ea8390a12	Merge "Extend support for custom Grafana dashboards"	2021-07-12 16:00:47 +00:00
Mark Goddard	ade5bfa302	Use ansible_facts to reference facts By default, Ansible injects a variable for every fact, prefixed with ansible_. This can result in a large number of variables for each host, which at scale can incur a performance penalty. Ansible provides a configuration option [0] that can be set to False to prevent this injection of facts. In this case, facts should be referenced via ansible_facts.<fact>. This change updates all references to Ansible facts within Kolla Ansible from using individual fact variables to using the items in the ansible_facts dictionary. This allows users to disable fact variable injection in their Ansible configuration, which may provide some performance improvement. This change disables fact variable injection in the ansible configuration used in CI, to catch any attempts to use the injected variables. [0] https://docs.ansible.com/ansible/latest/reference_appendices/config.html#inject-facts-as-vars Change-Id: I7e9d5c9b8b9164d4aee3abb4e37c8f28d98ff5d1 Partially-Implements: blueprint performance-improvements	2021-06-23 10:38:06 +01:00
Doug Szumski	82cf40edf2	Remove Monasca Grafana service In the Xena cycle it was decided to remove the Monasca Grafana fork due to lack of maintenance. This commit removes the service and provides a limited workaround using the Monasca Grafana datasource with vanilla Grafana. Depends-On: I9db7ec2df050fa20317d84f6cea40d1f5fd42e60 Change-Id: I4917ece1951084f6665722ba9a91d47764d3709a	2021-04-27 11:06:25 +00:00
Doug Szumski	d01192c160	Extend support for custom Grafana dashboards The current behaviour is to support supplying a single folder of Grafana dashboards which can then be populated into a single folder in Grafana. Some users may wish to have sub-folders of Dashboards, and load these into separate dashboard folders in Grafana via a custom provisioning file. For example, a user may have a sub-folder of Ceph dashboards that they wish to keep separate from OpenStack dashboards. This patch supports sub-folders whilst not affecting the original mechanism. Trivial-Fix Change-Id: I9cd289a1ea79f00cee4d2ef30cbb508ac73f9767	2021-04-19 11:11:43 +01:00
Zuul	2ba4c88c8d	Merge "Add support for custom grafana dashboards"	2021-03-17 16:48:48 +00:00
Bartosz Bezak	a9e30382fe	Add support for custom grafana dashboards Allow users to import custom grafana dashboards. Dashboards as JSON files should be placed into "{{ node_custom_config }}/grafana/dashboards/" folder. Change-Id: Id0f83b8d08541b3b74649f097b10c9450201b426	2021-03-16 17:10:19 +01:00
Will Szumski	31f97d6cca	Do not wait for grafana to start when kolla_action=config Prior to this change it was not possible to generate the config before deploying the services as you'd hit: RUNNING HANDLER [Waiting for grafana to start on first node] *********************** Monday 18 January 2021 15:06:35 +0000 (0:00:00.182) 0:04:39.213 ****** skipping: [sv-h22a8-u19] skipping: [sv-h22a5-u36] FAILED - RETRYING: Waiting for grafana to start on first node (10 retries left). This would never succeed as the service has not yet been deployed. TrivialFix Change-Id: I9437a049b24e5e613a7e66add481a8983b84867a	2021-01-18 15:42:31 +00:00
Mark Goddard	db4fc85c33	Revert "Performance: Use import_tasks in the main plays" This reverts commit 9cae59be51e8d2d798830042a5fd448a4aa5e7dc. Reason for revert: This patch was found to introduce issues with fluentd customisation. The underlying issue is not currently fully understood, but could be a sign of other obscure issues. Change-Id: Ia4859c23d85699621a3b734d6cedb70225576dfc Closes-Bug: #1906288	2020-12-14 10:36:55 +00:00
Radosław Piliszek	9cae59be51	Performance: Use import_tasks in the main plays Main plays are action-redirect-stubs, ideal for import_tasks. This avoids 'include' penalty and makes logs/ara look nicer. Fixes haproxy and rabbitmq not to check the host group as well. Change-Id: I46136fc40b815e341befff80b54a91ef431eabc0 Partially-Implements: blueprint performance-improvements	2020-10-27 19:09:32 +01:00
Radosław Piliszek	3411b9e420	Performance: optimize genconfig Config plays do not need to check containers. This avoids skipping tasks during the genconfig action. Ironic and Glance rolling upgrades are handled specially. Swift and Bifrost do not use the handlers at all. Partially-Implements: blueprint performance-improvements Change-Id: I140bf71d62e8f0932c96270d1f08940a5ba4542a	2020-10-12 19:30:06 +02:00
Mark Goddard	b685ac44e0	Performance: replace unconditional include_tasks with import_tasks Including tasks has a performance penalty when compared with importing tasks. If the include has a condition associated with it, then the overhead of the include may be lower than the overhead of skipping all imported tasks. For unconditionally included tasks, switching to import_tasks provides a clear benefit. Benchmarking of include vs. import is available at [1]. This change switches from include_tasks to import_tasks where there is no condition applied to the include. [1] https://github.com/stackhpc/ansible-scaling/blob/master/doc/include-and-import.md#task-include-and-import Partially-Implements: blueprint performance-improvements Change-Id: Ia45af4a198e422773d9f009c7f7b2e32ce9e3b97	2020-08-28 16:12:03 +00:00
Rafael Weingärtner	f425c0678f	Standardize use and construction of endpoint URLs The goal for this push request is to normalize the construction and use of internal, external, and admin URLs. While extending Kolla-ansible to enable a more flexible method to manage external URLs, we noticed that the same URL was constructed multiple times in different parts of the code. This can make it difficult for people that want to work with these URLs and create inconsistencies in a large code base with time. Therefore, we are proposing here the use of "single Kolla-ansible variable" per endpoint URL, which facilitates for people that are interested in overriding/extending these URLs. As an example, we extended Kolla-ansible to facilitate the "override" of public (external) URLs with the following standard "<component/serviceName>.<companyBaseUrl>". Therefore, the "NAT/redirect" in the SSL termination system (HAproxy, HTTPD or some other) is done via the service name, and not by the port. This allows operators to easily and automatically create more friendly URL names. To develop this feature, we first applied this patch that we are sending now to the community. We did that to reduce the surface of changes in Kolla-ansible. Another example is the integration of Kolla-ansible and Consul, which we also implemented internally, and also requires URLs changes. Therefore, this PR is essential to reduce code duplicity, and to facility users/developers to work/customize the services URLs. Change-Id: I73d483e01476e779a5155b2e18dd5ea25f514e93 Signed-off-by: Rafael Weingärtner <rafael@apache.org>	2020-08-19 07:22:17 +00:00
Mark Goddard	146b00efa7	Mount /etc/timezone based on host OS Previously we mounted /etc/timezone if the kolla_base_distro is debian or ubuntu. This would fail prechecks if debian or ubuntu images were deployed on CentOS. While this is not a supported combination, for correctness we should fix the condition to reference the host OS rather than the container OS, since that is where the /etc/timezone file is located. Change-Id: Ifc252ae793e6974356fcdca810b373f362d24ba5 Closes-Bug: #1882553	2020-08-10 10:14:18 +01:00
Mark Goddard	9702d4c3c3	Performance: use import_tasks for check-containers.yml Including tasks has a performance penalty when compared with importing tasks. If the include has a condition associated with it, then the overhead of the include may be lower than the overhead of skipping all imported tasks. In the case of the check-containers.yml include, the included file only has a single task, so the overhead of skipping this task will not be greater than the overhead of the task import. It therefore makes sense to switch to use import_tasks there. Partially-Implements: blueprint performance-improvements Change-Id: I65d911670649960708b9f6a4c110d1a7df1ad8f7	2020-07-28 12:10:59 +01:00
Doug Szumski	2c730590d7	Improve Grafana DB bootstrap This fixes an issue where multiple Grafana instances would race to bootstrap the Grafana DB. The following changes are made: - Only start additional Grafana instances after the DB has been configured. - During upgrade, don't allow old instances to run with an upgraded DB schema. Change-Id: I3e0e077ba6a6f43667df042eb593107418a06c39 Closes-Bug: #1888681	2020-07-27 08:23:05 +00:00
Mark Goddard	56ae2db7ac	Performance: Run common role in a separate play The common role was previously added as a dependency to all other roles. It would set a fact after running on a host to avoid running twice. This had the nice effect that deploying any service would automatically pull in the common services for that host. When using tags, any services with matching tags would also run the common role. This could be both surprising and sometimes useful. When using Ansible at large scale, there is a penalty associated with executing a task against a large number of hosts, even if it is skipped. The common role introduces some overhead, just in determining that it has already run. This change extracts the common role into a separate play, and removes the dependency on it from all other roles. New groups have been added for cron, fluentd, and kolla-toolbox, similar to other services. This changes the behaviour in the following ways: * The common role is now run for all hosts at the beginning, rather than prior to their first enabled service * Hosts must be in the necessary group for each of the common services in order to have that service deployed. This is mostly to avoid deploying on localhost or the deployment host * If tags are specified for another service e.g. nova, the common role will not automatically run for matching hosts. The common tag must be specified explicitly The last of these is probably the largest behaviour change. While it would be possible to determine which hosts should automatically run the common role, it would be quite complex, and would introduce some overhead that would probably negate the benefit of splitting out the common role. Partially-Implements: blueprint performance-improvements Change-Id: I6a4676bf6efeebc61383ec7a406db07c7a868b2a	2020-07-07 15:00:47 +00:00
Radosław Piliszek	7bd8805004	Fix Grafana datasource update Grafana changed the error message wording. Match on the shortest sane string to play it safe. Change-Id: Ic175ebdb1da6ef66047309ff07bcbba98fc67008 Closes-Bug: #1881890	2020-06-15 11:34:30 +02:00
Zuul	87984f5425	Merge "Add Ansible group check to prechecks"	2020-04-16 15:33:46 +00:00
Zuul	7f42813159	Merge "Refactor copy certificates task"	2020-04-16 14:03:37 +00:00
James Kirsch	4d155d69cd	Refactor copy certificates task Refactor service configuration to use the copy certificates task. This reduces code duplication and simplifies implementing encrypting backend HAProxy traffic for individual services. Change-Id: I0474324b60a5f792ef5210ab336639edf7a8cd9e	2020-04-14 17:26:19 +00:00
Dincer Celik	4b5df0d866	Introduce /etc/timezone to Debian/Ubuntu containers Some services look for /etc/timezone on Debian/Ubuntu, so we should introduce it to the containers. In addition, added prechecks for /etc/localtime and /etc/timezone. Closes-Bug: #1821592 Change-Id: I9fef14643d1bcc7eee9547eb87fa1fb436d8a6b3	2020-04-09 18:53:36 +00:00
Zuul	f867373a73	Merge "support ipv6 for grafana.ini.j2"	2020-03-11 02:29:54 +00:00
yj.bai	3e582a05fa	support ipv6 for grafana.ini.j2 grafana not support ipv6 in grafana.ini.j2. Closes-Bug: #1866141 Change-Id: Ia89a9283e70c10a624f25108b487528dbb370ee4 Signed-off-by: yj.bai <bai.yongjun@99cloud.net>	2020-03-10 17:47:34 +00:00
Zuul	2a2ce059dc	Merge "Add notify restart container when cert changed"	2020-03-10 12:12:55 +00:00
Zuul	f4d2b6e092	Merge "Construct service REST API urls using configured protocol"	2020-03-10 08:42:57 +00:00
yj.bai	d3cc2f670e	Add notify restart container when cert changed When change the cert file in /etc/kolla/certificate/. The certificate in the container has not changed. So I think can use kolla-ansible deploy when certificate is changed. restart <container> Partially-Implements: blueprint custom-cacerts Change-Id: Iaac6f37e85ffdc0352e8062ae5049cc9a6b3db26 Signed-off-by: yj.bai <bai.yongjun@99cloud.net>	2020-03-10 16:23:09 +08:00
Radosław Piliszek	266fd61ad7	Use "name:" instead of "role:" for *_role modules Both include_role and import_role expect role's name to be given via "name" param instead of "role". This worked but caused errors with ansible-lint. See: https://review.opendev.org/694779 Change-Id: I388d4ae27111e430d38df1abcb6c6127d90a06e0	2020-03-02 10:01:17 +01:00
Mark Goddard	49fb55f182	Add Ansible group check to prechecks We assume that all groups are present in the inventory, and quite obtuse errors can result if any are not. This change adds a precheck that checks for the presence of all expected groups in the inventory for each service. It also introduces a common service-precheck role that we can use for other common prechecks. Change-Id: Ia0af1e7df4fff7f07cd6530e5b017db8fba530b3 Partially-Implements: blueprint improve-prechecks	2020-02-28 16:23:14 +00:00

1 2 3

121 Commits