kolla-ansible

Author	SHA1	Message	Date
Juan Pablo Suazo	ab7bb56df6	Adds FluentD configuration for OpenvSwitch log matching Adds a new configuration file that provides fluentd with an appropiate regex to match with OpenvSwitch logs in both default files. The regex is segmented with variable as to isolate the relevant parts of each log message. Closes-Bug: #1965815 Signed-off-by: Juan Pablo Suazo <jsuazo@whitestack.com> Change-Id: Ife83c50c048d517a5c8a5dee588f8f7846fcee00	2022-03-25 14:01:28 +00:00
Mark Goddard	80b311bef7	libvirt: add nova-libvirt-cleanup command Change Ia1239069ccee39416b20959cbabad962c56693cf added support for running a libvirt daemon on the host, rather than using the nova_libvirt container. It did not cover migration of existing hosts from using a container to using a host daemon. This change adds a kolla-ansible nova-libvirt-cleanup command which may be used to clean up the nova_libvirt container, volumes and related items on hosts, once it has been disabled. The playbook assumes that compute hosts have been emptied of VMs before it runs. A future extension could support migration of existing VMs, but this is currently out of scope. Change-Id: I46854ed7eaf1d5b5e3ccd8531c963427848bdc99	2022-03-21 11:54:54 +00:00
Mark Goddard	4e41acd8f0	libvirt: make it possible to run libvirt on the host In some cases it may be desirable to run the libvirt daemon on the host. For example, when mixing host and container OS distributions or versions. This change makes it possible to disable the nova_libvirt container, by setting enable_nova_libvirt_container to false. The default values of some Docker mounts and other paths have been updated to point to default host directories rather than Docker volumes when using a host libvirt daemon. This change does not handle migration of existing systems from using a nova_libvirt container to libvirt on the host. Depends-On: https://review.opendev.org/c/openstack/ansible-collection-kolla/+/830504 Change-Id: Ia1239069ccee39416b20959cbabad962c56693cf	2022-03-21 11:54:31 +00:00
Zuul	ed148cd8dd	Merge "[external-ceph] Use template instead of copy"	2022-03-19 00:04:33 +00:00
Imran Hussain	4c221be86e	[external-ceph] Use template instead of copy Consistently use template instead of copy. This has the added advantage of allowing variables inside ceph conf files and keyrings. Closes-Bug: 1959565 Signed-off-by: Imran Hussain <ih@imranh.co.uk> Change-Id: Ibd0ff2641a54267ff06d3c89a26915a455dff1c1	2022-03-18 15:09:30 +00:00
Zuul	6c04445c7b	Merge "cinder: restart services after upgrade"	2022-03-18 13:47:31 +00:00
Zuul	25fd28598b	Merge "Ironic: Avoid setting deprecated pxe_append_params"	2022-03-18 13:37:59 +00:00
Zuul	3a9597fc37	Merge "ADD venus for kolla-ansible"	2022-03-18 13:10:21 +00:00
jinyuanliu	3ccb176f13	ADD venus for kolla-ansible This project [1] can provide a one-stop solution to log collection, cleaning, indexing, analysis, alarm, visualization, report generation and other needs, which involves helping operator or maintainer to quickly solve retrieve problems, grasp the operational health of the platform, and improve the level of platform management. [1] https://wiki.openstack.org/wiki/Venus Change-Id: If3562bbed6181002b76831bab54f863041c5a885	2022-03-17 20:35:08 +08:00
Zuul	668fecf397	Merge "Adds etcd endpoints as a Prometheus scrape target"	2022-03-16 17:55:00 +00:00
Maksim Malchuk	3169bdfb95	CI: pin ansible-lint to <6 ansible-lint 6.0.0 introduced requirement to use FQCNs Change-Id: I3f27c6bcdd200252ebb089f6377294f7e3a911a0 Signed-off-by: Maksim Malchuk <maksim.malchuk@gmail.com>	2022-03-16 13:13:21 +00:00
Zuul	33415ed93e	Merge "libvirt: support SASL authentication"	2022-03-12 16:46:54 +00:00
Zuul	f56f070130	Merge "docs: state supported Python version"	2022-03-11 10:17:39 +00:00
Mark Goddard	d2d4b53d47	libvirt: support SASL authentication In Kolla Ansible OpenStack deployments, by default, libvirt is configured to allow read-write access via an unauthenticated, unencrypted TCP connection, using the internal API network. This is to facilitate migration between hosts. By default, Kolla Ansible does not use encryption for services on the internal network (and did not support it until Ussuri). However, most other services on the internal network are at least authenticated (usually via passwords), ensuring that they cannot be used by anyone with access to the network, unless they have credentials. The main issue here is the lack of authentication. Any client with access to the internal network is able to connect to the libvirt TCP port and make arbitrary changes to the hypervisor. This could include starting a VM, modifying an existing VM, etc. Given the flexibility of the domain options, it could be seen as equivalent to having root access to the hypervisor. Kolla Ansible supports libvirt TLS [1] since the Train release, using client and server certificates for mutual authentication and encryption. However, this feature is not enabled by default, and requires certificates to be generated for each compute host. This change adds support for libvirt SASL authentication, and enables it by default. This provides base level of security. Deployments requiring further security should use libvirt TLS. [1] https://docs.openstack.org/kolla-ansible/latest/reference/compute/libvirt-guide.html#libvirt-tls Depends-On: https://review.opendev.org/c/openstack/kolla/+/833021 Closes-Bug: #1964013 Change-Id: Ia91ceeb609e4cdb144433122b443028c0278b71e	2022-03-10 16:57:16 +00:00
Adrian Andreias	78b18ffc0e	docs: state supported Python version Closes-Bug: #1880290 Change-Id: If9e66c505ab1672ae6b7639872a626ad5a9408ab	2022-03-10 10:52:44 +00:00
Zuul	f26b9cd8ad	Merge "Fix prechecks for "Ironic iPXE" container"	2022-03-10 09:38:57 +00:00
Zuul	da476a7fea	Merge "Explicitly unset net.ipv4.ip_forward sysctl"	2022-03-09 15:40:32 +00:00
Zuul	801c2d8fc2	Merge "[TrivialFix] Remove old comment"	2022-03-08 18:12:42 +00:00
Zuul	02a3cbcde3	Merge "Make cron logfile minsize,maxsize configurable"	2022-03-08 16:33:27 +00:00
Nathan Taylor	0f2794a075	Adds etcd endpoints as a Prometheus scrape target Add "enable_prometheus_etcd_integration" configuration parameter which can be used to configure Prometheus to scrape etcd metrics endpoints. The default value of "enable_prometheus_etcd_integration" is set to the combined values of "enable_prometheus" and "enable_etcd". Change-Id: I7a0b802c5687e2d508e06baf55e355d9761e806f	2022-03-08 08:42:19 -07:00
Zuul	19b169e139	Merge "[CI] Remove an old override"	2022-03-07 18:34:56 +00:00
Mark Goddard	caf33be54b	Explicitly unset net.ipv4.ip_forward sysctl While I8bb398e299aa68147004723a18d3a1ec459011e5 stopped setting the net.ipv4.ip_forward sysctl, this change explicitly removes the option from the Kolla sysctl config file. In the absence of another source for this sysctl, it should revert to the default of 0 after the next reboot. A deployer looking to more aggressively change the value may set neutron_l3_agent_host_ipv4_ip_forward to 0. Any deployments still relying on the previous value may set neutron_l3_agent_host_ipv4_ip_forward to 1. Related-Bug: #1945453 Change-Id: I9b39307ad8d6c51e215fe3d3bc56aab998d218ec	2022-03-07 17:31:46 +00:00
Radosław Piliszek	e818d3124d	[CI] Remove an old override Since [1] we are not running keepalived directly on CI network, and are therefore safeguarded against such collisions. [1] 8e40629161a329a22022d38b3cb48dea66121b36 Change-Id: Ie25b2d6d48f10c6b295795b3c82c1f8a213f2a8c	2022-03-07 16:58:46 +01:00
Radosław Piliszek	a6145d678d	[CI] Ironic: use ipmitool retries In Ironic jobs with Tenks, we saw issues with IPMI commands failing, resuling in job failures: Error setting Chassis Boot Parameter 5 A metal3.io commit [1] was found that fixes the issue by moving IPMI retries from ironic to ipmitool, which has a side-effect of increasing the timeout. This change applies the same configuration. This change has been adapted from an analogous change in kayobe-config-dev. [2] [1] `6bc1499d8b` [2] Ib4fce74cebebe85c31049eafe2eeb6b28dfab041 Co-Authored-By: Mark Goddard <mark@stackhpc.com> Change-Id: I552417b9da03b8dfc9406e0ff644092579bc7122	2022-03-07 10:21:56 +01:00
Mark Goddard	91778aca76	[CI] Use Tenks in Ironic job Installs Tenks [1] and uses it to create virtual machines to pose as bare metal compute nodes. The nodes are registered in Ironic, and used to provision instances. [1] https://docs.openstack.org/tenks/latest/ Depends-On: https://review.opendev.org/c/openstack/tenks/+/830182 Depends-On: https://review.opendev.org/c/openstack/tenks/+/830675 Depends-On: https://review.opendev.org/c/openstack/kolla-ansible/+/831055 Change-Id: Idfb8fbb50dc7442225967b2a2ec38ae7114f3c11 Co-Authored-By: Radosław Piliszek <radoslaw.piliszek@gmail.com>	2022-03-05 20:03:36 +00:00
Radosław Piliszek	833c45ea82	[TrivialFix] Remove old comment Ironic is dropping default_boot_option and the new default has been around for quite a while now so let's remove this old scary comment. Change-Id: I80d645cb97251ac63e04d7ec1c87d4600d17d4ee	2022-03-04 21:22:48 +01:00
Radosław Piliszek	19c5f2f033	Fix prechecks for "Ironic iPXE" container Since I30c2ad2bf2957ac544942aefae8898cdc8a61ec6 this container is always enabled and thus the port should always be checked. Change-Id: I94a70d89123611899872061bd69593280d0a68c4	2022-03-04 18:50:11 +01:00
Radosław Piliszek	87f7586340	Ironic: Avoid setting deprecated pxe_append_params Set kernel_append_params instead. Change-Id: I4fb42d376636dc363cd86950ed37de4a3d28df73	2022-03-04 18:11:43 +01:00
Zuul	68bc4f8f52	Merge "Enable Ironic iPXE support by default"	2022-03-03 16:40:09 +00:00
Zuul	9a1bd06098	Merge "[CI] Collect info about environment before deploy"	2022-03-03 16:35:34 +00:00
Zuul	44517dd7b7	Merge "Add Rocky Linux support as Host OS"	2022-03-03 15:45:35 +00:00
Zuul	5dc04b9f47	Merge "rabbitmq: add node parameter in rabbitmq_user call"	2022-03-03 12:47:03 +00:00
Michal Nasiadka	7080ccfc3d	Add Rocky Linux support as Host OS Depends-On: https://review.opendev.org/c/openstack/ansible-collection-kolla/+/831642 Change-Id: I70dcd2d0cade52a23b3e219b7e0aaa31193ec938	2022-03-03 09:59:16 +00:00
IDerr	38729dc39c	rabbitmq: add node parameter in rabbitmq_user call Change-Id: I4cf48620f03d67ea4a9ef327afbf3b1ebe28550b Closes-Bug: #1946506	2022-03-02 12:57:42 +00:00
Zuul	8b75f583e3	Merge "[CI] Add kolla-ansible-debian-source-zun job"	2022-02-28 15:33:20 +00:00
Zuul	30e0c01413	Merge "Remove grafana [session] configuration"	2022-02-28 13:42:20 +00:00
Zuul	09db789a65	Merge "Fix hard coded OIDC response type"	2022-02-28 13:42:17 +00:00
Radosław Piliszek	baeca81a43	Enable Ironic iPXE support by default Ironic has changed the default PXE to be iPXE (as opposed to plain PXE) in Yoga. Kolla Ansible supports either one or the other and we tend to stick to upstream defaults so this change enables iPXE instead of plain PXE - by default - the users are allowed to change back and they need to take one other action so it is good to remind them via upgrade notes either way. Change-Id: If14ec83670d2212906c6e22c7013c475f3c4748a	2022-02-25 23:02:42 +01:00
Zuul	5e58d6d502	Merge "Add openvswitch and prometheus to logrotate"	2022-02-24 10:37:34 +00:00
Juan Pablo Suazo	80ee3f2e5c	Add openvswitch and prometheus to logrotate Closes-Bug: #1961795 Change-Id: I5547cce5c389846ed216bb898b78e45b8f231e1e	2022-02-24 08:03:17 +00:00
Zuul	6e267aed1d	Merge "Remove classic queue mirroring for internal RabbitMQ"	2022-02-23 11:43:26 +00:00
Piotr Parczewski	d32197271f	Fix hard coded OIDC response type Closes-bug: 1959781 Change-Id: If574d2242aa6a875dcf624d95495e6cec6fefddd	2022-02-23 10:57:33 +01:00
Zuul	594d31629c	Merge "Remove bslurp module"	2022-02-22 21:26:52 +00:00
Mark Goddard	a6768dd33b	Fix location of release note for ironic-neutron-agent healthcheck TrivialFix Change-Id: Id85a5d69e1222b616705e24885252425c92af527	2022-02-22 12:12:00 +00:00
Pierre Riteau	f37562827d	Remove grafana [session] configuration These configuration settings were removed in Grafana 6.2. Instead we can use [remote_cache], but it is not required since it will use database settings by default. Change-Id: I37966027aea9039b2ecba4214444507e9d87f513	2022-02-22 10:26:37 +01:00
Zuul	d25d490e4d	Merge "cloudkitty: fix URL used for Prometheus collector"	2022-02-22 09:07:51 +00:00
Zuul	8ff7b51fef	Merge "Install openstack.kolla collection"	2022-02-21 21:51:10 +00:00
Zuul	0ed32c82b9	Merge "ironic: sync default inspection UEFI iPXE bootloader with Ironic"	2022-02-21 21:35:58 +00:00
Zuul	63706667e1	Merge "Add support for deploying Prometheus libvirt exporter"	2022-02-21 21:35:55 +00:00
Doug Szumski	6bfe1927f0	Remove classic queue mirroring for internal RabbitMQ When OpenStack is deployed with Kolla-Ansible, by default there are no durable queues or exchanges created by the OpenStack services in RabbitMQ. In Rabbit terminology, not being durable is referred to as `transient`, and this means that the queue is generally held in memory. Whether OpenStack services create durable or transient queues is traditionally controlled by the Oslo Notification config option: `amqp_durable_queues`. In Kolla-Ansible, this remains set to the default of `False` in all services. The only `durable` objects are the `amq*` exchanges which are internal to RabbitMQ. More recently, Oslo Notification has introduced support for Quorum queues [7]. These are a successor to durable classic queues, however it isn't yet clear if they are a good fit for OpenStack in general [8]. For clustered RabbitMQ deployments, Kolla-Ansible configures all queues as `replicated` [1]. Replication occurs over all nodes in the cluster. RabbitMQ refers to this as 'mirroring of classic queues'. In summary, this means that a multi-node Kolla-Ansible deployment will end up with a large number of transient, mirrored queues and exchanges. However, the RabbitMQ documentation warns against this, stating that 'For replicated queues, the only reasonable option is to use durable queues: [2]`. This is discussed further in the following bug report: [3]. Whilst we could try enabling the `amqp_durable_queues` option for each service (this is suggested in [4]), there are a number of complexities with this approach, not limited to: 1) RabbitMQ is planning to remove classic queue mirroring in favor of 'Quorum queues' in a forthcoming release [5]. 2) Durable queues will be written to disk, which may cause performance problems at scale. Note that this includes Quorum queues which are always durable. 3) Potential for race conditions and other complexity discussed recently on the mailing list under: `[ops] [kolla] RabbitMQ High Availability` The remaining option, proposed here, is to use classic non-mirrored queues everywhere, and rely on services to recover if the node hosting a queue or exchange they are using fails. There is some discussion of this approach in [6]. The downside of potential message loss needs to be weighed against the real upsides of increasing the performance of RabbitMQ, and moving to a configuration which is officially supported and hopefully more stable. In the future, we can then consider promoting specific queues to quorum queues, in cases where message loss can result in failure states which are hard to recover from. [1] https://www.rabbitmq.com/ha.html [2] https://www.rabbitmq.com/queues.html [3] https://github.com/rabbitmq/rabbitmq-server/issues/2045 [4] https://wiki.openstack.org/wiki/Large_Scale_Configuration_Rabbit [5] https://blog.rabbitmq.com/posts/2021/08/4.0-deprecation-announcements/ [6] https://fuel-ccp.readthedocs.io/en/latest/design/ref_arch_1000_nodes.html#replication [7] https://bugs.launchpad.net/oslo.messaging/+bug/1942933 [8] https://www.rabbitmq.com/quorum-queues.html#use-cases Partial-Bug: #1954925 Change-Id: I91d0e23b22319cf3fdb7603f5401d24e3b76a56e	2022-02-21 18:54:04 +00:00

1 2 3 4 5 ...

12784 Commits