kolla-ansible

Author	SHA1	Message	Date
Zuul	6dff0305c0	Merge "Remove redundant Monasca Kafka client option"	2021-08-11 11:40:12 +00:00
Zuul	c0540760e0	Merge "monasca-thresh: Fix topology submission to storm"	2021-08-10 10:59:17 +00:00
Mark Goddard	ade5bfa302	Use ansible_facts to reference facts By default, Ansible injects a variable for every fact, prefixed with ansible_. This can result in a large number of variables for each host, which at scale can incur a performance penalty. Ansible provides a configuration option [0] that can be set to False to prevent this injection of facts. In this case, facts should be referenced via ansible_facts.<fact>. This change updates all references to Ansible facts within Kolla Ansible from using individual fact variables to using the items in the ansible_facts dictionary. This allows users to disable fact variable injection in their Ansible configuration, which may provide some performance improvement. This change disables fact variable injection in the ansible configuration used in CI, to catch any attempts to use the injected variables. [0] https://docs.ansible.com/ansible/latest/reference_appendices/config.html#inject-facts-as-vars Change-Id: I7e9d5c9b8b9164d4aee3abb4e37c8f28d98ff5d1 Partially-Implements: blueprint performance-improvements	2021-06-23 10:38:06 +01:00
Michal Arbet	7da770d290	Add missing region_name in keystoneauth sections Closes-Bug: #1933025 Change-Id: Ib67d715ddfa986a5b70a55fdda39e6d0e3333162	2021-06-22 08:35:35 +02:00
Scott Shambarger	aea9bf3550	monasca-thresh: Fix topology submission to storm monasca-thresh currently runs a local copy of the storm to handle the threshold topology. However, it doesn't setup the environment correctly, and the executable fails, causing the container to continually restart. This patch updates the container command to correctly submit the topology to the running Apache storm. The container will exit after it finishes the submission, so the restart_policy is updated to on-failure, this way if the storm is temporarily unavailable, the submission will be retried. (NOTE: further deploys will see the container as "changed" as it won't be running) Patch uses KOLLA_BOOTSTRAP to trigger the container to check if the topology is already submitted, and if so skips the submission command so the container doesn't fail. The config task now triggers a new reconfigure handler that spawns a one-shot container to replace any existing topology if the configuration has changed. Also, all the storm.* variables in storm.yml.j2 are removed as they were only needed for local mode and make submitted topologies fail to load when the storm is restarted (the referenced directories not mounted on nimbus). Depends-On: https://review.opendev.org/c/openstack/kolla/+/792751 Closes-Bug: #1808805 Change-Id: Ib225d76076782d695c9387e1c2693bae9a4521d7	2021-06-06 13:41:29 -07:00
Doug Szumski	2b3284b3f3	Remove redundant Monasca Kafka client option This override is now the default. Change-Id: I98cbf71532b2bc068ab4f34e648a5dad15139f6f	2021-04-27 11:20:34 +00:00
Doug Szumski	82cf40edf2	Remove Monasca Grafana service In the Xena cycle it was decided to remove the Monasca Grafana fork due to lack of maintenance. This commit removes the service and provides a limited workaround using the Monasca Grafana datasource with vanilla Grafana. Depends-On: I9db7ec2df050fa20317d84f6cea40d1f5fd42e60 Change-Id: I4917ece1951084f6665722ba9a91d47764d3709a	2021-04-27 11:06:25 +00:00
Zuul	d3a1a1a504	Merge "Support disabling Monasca alerting pipeline"	2021-03-24 19:02:54 +00:00
Zuul	6c18e5814e	Merge "Remove Monasca Log Transformer"	2021-03-24 18:21:04 +00:00
Doug Szumski	647ff667e6	Add variable for changing Apache HTTP timeout In services which use the Apache HTTP server to service HTTP requests, there exists a TimeOut directive [1] which defaults to 60 seconds. APIs which come under heavy load, such as Cinder, can sometimes exceed this which results in a HTTP 504 Gateway timeout, or similar. However, the request can still be serviced without error. For example, if Nova calls the Cinder API to detach a volume, and this operation takes longer than the shortest of the two timeouts, Nova will emit a stack trace with a 504 Gateway timeout. At some time later, the request to detach the volume will succeed. The Nova and Cinder DBs then become out-of-sync with each other, and frequently DB surgery is required. Although strictly this category of bugs should be fixed in OpenStack services, it is not realistic to expect this to happen in the short term. Therefore, this change makes it easier to set the Apache HTTP timeout via a new variable. An example of a related bug is here: https://bugs.launchpad.net/nova/+bug/1888665 Whilst this timeout can currently be set by overriding the WSGI config for individual services, this change makes it much easier. Change-Id: Ie452516655cbd40d63bdad3635fd66693e40ce34 Closes-Bug: #1917648	2021-03-04 11:25:06 +00:00
Doug Szumski	444097848c	Support disabling Monasca alerting pipeline The Monasca alerting pipeline provides multi-tenancy alerts and notifications. It runs as an Apache Storm topology and generally places a significant memory and CPU burden on monitoring hosts, particularly when there are lot of metrics. This is fine if the alerting service is in use, but sometimes it is not. For example you may use Prometheus for monitoring the control plane, and wish to offer tenants a monitoring service via Monasca without alerting and notification functionality. In this case it makes sense to disable this part of the Monasca pipeline and this patch adds support for that. If the service is ever re-enabled, all alerts and notifications should spawn back automatically since they are persisted in the central mysql database cluster. Change-Id: I84aa04125c621712f805f41c8efbc92c8e156db9	2021-03-04 09:19:44 +00:00
Doug Szumski	0743a9bf4b	Remove Monasca Log Transformer Historically Monasca Log Transformer has been for log standardisation and processing. For example, logs from different sources may use slightly different error levels such as WARN, 5, or WARNING. Monasca Log Transformer is a place where these could be 'squashed' into a single error level to simplify log searches based on labels such as these. However, in Kolla Ansible, we do this processing in Fluentd so that the simpler Fluentd -> Elastic -> Kibana pipeline also benefits. This helps to avoid spreading out log parsing configuration over many services, with the Fluentd Monasca output plugin being yet another potential place for processing (which should be avoided). It therefore makes sense to remove this service entirely, and squash any existing configuration which can't be moved to Fluentd into the Log Perister service. I.e. by removing this pipeline, we don't loose any functionality, we encourage log processing to take place in Fluentd, or at least outside of Monasca, and we make significant gains in efficiency by removing a topic from Kafka which contains a copy of all logs in transit. Finally, users forwarding logs from outside the control plane, eg. from tenant instances, should be encouraged to process the logs at the point of sending using whichever framework they are forwarding them with. This makes sense, because all Logstash configuration in Monasca is only accessible by control plane admins. A user can't typically do any processing inside Monasca, with or without this change. Change-Id: I65c76d0d1cd488725e4233b7e75a11d03866095c	2021-03-03 17:20:18 +00:00
Zuul	90a079b8a7	Merge "Update String type for Monasca ES template"	2021-02-16 17:11:55 +00:00
Bartosz Bezak	3d955f3043	Monasca log-metrics - Drop "notice" and "note" loglevel metrics by default Those loglevels can build up over time and create unnecessary high metrics cardinality. Change-Id: Ib1a03772d0bd58758430b37b4f2f67126cf86fa3 Closes-bug: #1906796	2020-12-04 10:48:40 +01:00
Pierre Riteau	c81772024c	Reduce the use of SQLAlchemy connection pooling When the internal VIP is moved in the event of a failure of the active controller, OpenStack services can become unresponsive as they try to talk with MariaDB using connections from the SQLAlchemy pool. It has been argued that OpenStack doesn't really need to use connection pooling with MariaDB [1]. This commit reduces the use of connection pooling via two configuration options: - max_pool_size is set to 1 to allow only a single connection in the pool (it is not possible to disable connection pooling entirely via oslo.db, and max_pool_size = 0 means unlimited pool size) - lower connection_recycle_time from the default of one hour to 10 seconds, which means the single connection in the pool will be recreated regularly These settings have shown better reactivity of the system in the event of a failover. [1] http://lists.openstack.org/pipermail/openstack-dev/2015-April/061808.html Change-Id: Ib6a62d4428db9b95569314084090472870417f3d Closes-Bug: #1896635	2020-09-22 17:54:45 +02:00
Radosław Piliszek	9c38a0c77b	Drop python-path It was found to be useless in [1]. It is one of distro_python_version usages. Note Freezer and Horizon still use python_path (and hence distro_python_version) for different purposes. [1] https://review.opendev.org/675822 Change-Id: I6d6d9fdf4c28cb2b686d548955108c994b685bb1 Partially-Implements: blueprint drop-distro-python-version	2020-08-24 07:38:21 +00:00
Doug Szumski	d3e87a2e4d	Update String type for Monasca ES template This updates the Elasticsearch template used by Monasca to persist logs so that is uses the 'new' string types [1]. As an aside it helps to make the template more clear; full text search for log messages, and keyword searches for everything else. [1] https://www.elastic.co/blog/strings-are-dead-long-live-strings Closes-Bug: #1892376 Change-Id: I0cd6bf22d4695d88d93241da4364d170d8d8c80e	2020-08-20 14:54:03 +00:00
James Kirsch	19b028e660	Add Keep Alive Timeout for httpd This patch introduces a global keep alive timeout value for services that leverage httpd + wsgi to handle http/https requests. The default value is one minute. Change-Id: Icf7cb0baf86b428a60a7e9bbed642999711865cd Partially-Implements: blueprint add-ssl-internal-network	2020-08-13 09:52:40 +00:00
Doug Szumski	46b68015f3	Use Confluent Kafka client in remaining Monasca services Switch to the Confluent Kafka client in all remaining Python based Monasca services. This should allow us to later un-pin the Kafka messaging version for Monasca. Change-Id: I42bc78ffe304ba21c448c2e08b025e93a70ddb44	2020-07-15 09:55:25 +01:00
Bartosz Bezak	17d8332604	Logstash 6 support Co-Authored-By: Doug Szumski <doug@stackhpc.com> Closes-Bug: #1884090 Depends-On: https://review.opendev.org/#/c/736768 Change-Id: If2d0dd1739e484b14e3c15a185a236918737b0ab	2020-07-15 08:54:53 +00:00
Doug Szumski	de84b33e12	Revert rename of Monasca API config file I9b6bf5b6690f4b4b3445e7d15a40e45dd42d2e84 was updated to use the original config file name during review, but the config file was not renamed accordingly. The result is that an empty config file is written out. TrivialFix Change-Id: I5d0384b38ddb38133e5e11df85d8cf76f4044a64	2020-06-18 09:50:18 +01:00
Zuul	522bc17981	Merge "Fix bug in deploying monasca_agent_forwarder"	2020-06-08 11:42:25 +00:00
xiaojueguan	36587e4614	Fix bug in deploying monasca_agent_forwarder Change-Id: I8633f7d250f331ca96788d8f4796889c3c312406 Closes-Bug: #1882259	2020-06-05 23:28:28 +08:00
Doug Szumski	b39a0f805a	Switch to Monasca API for logs The Monasca Log API has been removed and in this change we switch to using the unified API. If dedicated log APIs are required then this can be supported through configuration. Out of the box the Monasca API is used for both logs and metrics which is envisaged to work for most use cases. In order to use the unified API for logs, we need to disable the legacy Kafka client. We also rename the Monasca API config file to remove a warning about using the old style name. Depends-On: https://review.opendev.org/#/c/728638 Change-Id: I9b6bf5b6690f4b4b3445e7d15a40e45dd42d2e84	2020-05-23 17:49:32 +01:00
Mark Goddard	3310a142d1	Fix monasca deployment due to monasca_log_dir Monasca deployment fails on master due to an invalid variable reference (monasca_log_dir) in the config.json for monasca API and monasca log API. This change fixes the issue by correcting the variable definition. Change-Id: I2ec497fa430c2f301dca6a7653ac988e49007469 Closes-Bug: #1864181	2020-04-08 17:09:50 +01:00
Mark Goddard	0edad7138c	Remove default(omit) from openstack_cacert in templates The use of default(omit) is for module parameters, not templates. We define a default value for openstack_cacert, so it should never be undefined anyway. Change-Id: Idfa73097ca168c76559dc4f3aa8bb30b7113ab28	2020-04-03 14:49:11 +01:00
Mark Goddard	70008536a3	Python 3: Use distro_python_version for monasca agent CA file Change-Id: Ia840cd037cd2c2eded429bd0edaede4bb44caa8e Partially-Implements: blueprint python-3	2020-01-30 14:10:41 +00:00
Mark Goddard	c56d273c93	Python 3: Use distro_python_version for WSGI python_path Currently the WSGI configuration for binary images uses python2.7 site-packages in some places. This change uses distro_python_version to select the correct python path. Change-Id: Id5f3f0ede106498b9264942fa0399d7c7862c122 Partially-Implements: blueprint python-3	2020-01-30 14:08:13 +00:00
James Kirsch	c15dc20341	Configure services to use Certificate Authority Include a reference to the globally configured Certificate Authority to all services. Services use the CA to verify HTTPs connections. Change-Id: I38da931cdd7ff46cce1994763b5c713652b096cc Partially-Implements: blueprint support-trusted-ca-certificate-file	2020-01-13 11:00:11 -08:00
Michal Nasiadka	3f55b87069	Improve Apache logging Currently we don't put global Apache error logs into /var/log/kolla, this change adds statements that redirect those logs there. Adapted the logfile names to catch into openstack wsgi logging fluentd input config and existing logrotate cron entries. Change-Id: I21216e688a1993239e3e81411a4e8b6f13e138c2	2019-12-06 13:11:49 +00:00
Michal Nasiadka	0240763d7d	Add proper wsgi loglevel when openstack_logging_debug Change-Id: I51144d92f34ed51c499a4119c059e6475d02eb46	2019-10-24 09:33:05 +00:00
Radosław Piliszek	bc053c09c1	Implement IPv6 support in the control plane Introduce kolla_address filter. Introduce put_address_in_context filter. Add AF config to vars. Address contexts: - raw (default): <ADDR> - memcache: inet6:[<ADDR>] - url: [<ADDR>] Other changes: globals.yml - mention just IP in comment prechecks/port_checks (api_intf) - kolla_address handles validation 3x interface conditional (swift configs: replication/storage) 2x interface variable definition with hostname (haproxy listens; api intf) 1x interface variable definition with hostname with bifrost exclusion (baremetal pre-install /etc/hosts; api intf) neutron's ml2 'overlay_ip_version' set to 6 for IPv6 on tunnel network basic multinode source CI job for IPv6 prechecks for rabbitmq and qdrouterd use proper NSS database now MariaDB Galera Cluster WSREP SST mariabackup workaround (socat and IPv6) Ceph naming workaround in CI TODO: probably needs documenting RabbitMQ IPv6-only proto_dist Ceph ms switch to IPv6 mode Remove neutron-server ml2_type_vxlan/vxlan_group setting as it is not used (let's avoid any confusion) and could break setups without proper multicast routing if it started working (also IPv4-only) haproxy upgrade checks for slaves based on ipv6 addresses TODO: ovs-dpdk grabs ipv4 network address (w/ prefix len / submask) not supported, invalid by default because neutron_external has no address No idea whether ovs-dpdk works at all atm. ml2 for xenapi Xen is not supported too well. This would require working with XenAPI facts. rp_filter setting This would require meddling with ip6tables (there is no sysctl param). By default nothing is dropped. Unlikely we really need it. ironic dnsmasq is configured IPv4-only dnsmasq needs DHCPv6 options and testing in vivo. KNOWN ISSUES (beyond us): One cannot use IPv6 address to reference the image for docker like we currently do, see: https://github.com/moby/moby/issues/39033 (docker_registry; docker API 400 - invalid reference format) workaround: use hostname/FQDN RabbitMQ may fail to bind to IPv6 if hostname resolves also to IPv4. This is due to old RabbitMQ versions available in images. IPv4 is preferred by default and may fail in the IPv6-only scenario. This should be no problem in real life as IPv6-only is indeed IPv6-only. Also, when new RabbitMQ (3.7.16/3.8+) makes it into images, this will no longer be relevant as we supply all the necessary config. See: https://github.com/rabbitmq/rabbitmq-server/pull/1982 For reliable runs, at least Ansible 2.8 is required (2.8.5 confirmed to work well). Older Ansible versions are known to miss IPv6 addresses in interface facts. This may affect redeploys, reconfigures and upgrades which run after VIP address is assigned. See: https://github.com/ansible/ansible/issues/63227 Bifrost Train does not support IPv6 deployments. See: https://storyboard.openstack.org/#!/story/2006689 Change-Id: Ia34e6916ea4f99e9522cd2ddde03a0a4776f7e2c Implements: blueprint ipv6-control-plane Signed-off-by: Radosław Piliszek <radoslaw.piliszek@gmail.com>	2019-10-16 10:24:35 +02:00
Zuul	91108c3fac	Merge "Moves monasca-thresh java.io.tmpdir to existing docker volume"	2019-08-28 08:13:17 +00:00
Zuul	d191da6709	Merge "Fixes Monasca log transformer UTC offset exception"	2019-08-28 07:48:52 +00:00
Isaac Prior	3010d4c391	Fixes Monasca log transformer UTC offset exception Monasca log transformer currently throws exceptions on encountering a non-UTC time offset (+0000): """ "exception": "Invalid format: \"2019-08-08 17:39:45 +0100\" is malformed at \" +0100\"", "config_parsers":"yyyy-MM-dd HH:mm:ss +0000,ISO8601"} """ This fix allows logstash to interpret any valid ISO8601 offset. Change-Id: Id70c3dd9cdcf681e955931f18a054e19cc284c0a Closes-Bug: #1839597	2019-08-13 08:46:29 +00:00
Doug Szumski	65b9756127	Add support for using custom Logstash patterns A user may want to define and use Logstash patterns. This commit adds support to copy them into the Monasca Log Transformer container. In the future support could be added for other Logstash containers. Change-Id: Id8cde14af6dc7f49714f6b1cb878882d0048d293	2019-08-08 10:48:35 +01:00
Isaac Prior	df93d31fe0	Moves monasca-thresh java.io.tmpdir to existing docker volume This prevents the container's root filesystem from filling up. Change-Id: Icc5a08c82312d6688edf2ef36562967ac94e8ac9 Depends-On: https://review.opendev.org/#/c/674779 Closes-Bug: #1839149	2019-08-06 14:18:30 +00:00
Zuul	f7c0e3cdbe	Merge "Remove obsolete roles middleware"	2019-06-13 19:24:20 +00:00
Zuul	922a262345	Merge "Fix issues obtaining Keystone token with Monasca Grafana"	2019-06-13 19:15:43 +00:00
Doug Szumski	5eb58050f1	Add load monitoring plugin config for Monasca This simple plugin supports gathering load metrics from /proc. Change-Id: I536aff093a2af3c6d0d69ae6cbe454aee950f358	2019-06-07 10:54:55 +01:00
Doug Szumski	76e98472f4	Supporting monitoring time synchronisation with Monasca This plugin is useful for monitoring host clock synchronisation with an NTP reference. If the delta becomes too large, the metrics from this plugin can be used to trigger an alarm. Change-Id: Id1fe6d7c823f8404c19c81ccdeb8b311bcb46e47	2019-06-07 10:54:50 +01:00
Doug Szumski	f23901677c	Remove obsolete roles middleware Change I0ca38f2cc7d63b9b47eedb304ba7b00a94816f9a removed the roles middleware from the example paste pipeline. Change-Id: Ie9a3b0fef395aaf414407f6bae1ac4bca158240d	2019-05-24 11:31:07 +01:00
Doug Szumski	b805726ca1	Fix issues obtaining Keystone token with Monasca Grafana When using the the default domain name there are issues authenticating with Keystone. For example, you can only log in on the second attempt and the Monasca datasource fails to authenticate. Switching to the default domain id resolves these issues. Change-Id: I2cb4b2608c74dd853c97e4fc27078930bc72fdf8	2019-05-09 12:02:54 +01:00
Mark Goddard	a0e214115c	Make monasca notification templates optional backport: stein If I deploy monasca by setting enable_monasca to true, the monasca_notification restarts with the following error: ERROR:__main__:MissingRequiredSource: /var/lib/kolla/config_files/notification_templates/* file is not found These templates are optional, so we need to mark this directory as optional in config.json. Change-Id: Ia2dd835daa7ab1153617cc92f17c2d8d498c73e0 Closes-Bug: #1823726	2019-04-08 15:10:41 +01:00
Doug Szumski	e2ed302312	Parse Monasca Log API timestamps correctly By parsing the creation_time timestamp in Logstash, Elasticsearch can parse it correctly. This closes a bug where the creation_time timestamp was shown as a date shortly after the epoch (1970) when viewed in Kibana. Closes-Bug: #1816585 Change-Id: I00decfe94607845ef0eae9bec631a0e729aac3fa	2019-02-19 14:06:52 +00:00
Kien Nguyen	043943117d	Use <project>_install_type instead of kolla_install_type Use <project>_install_type instead of kolla_install_type to set python_path. For example, general kolla_install_type is 'binary', but user wants to deploy Horizon from 'source'. Horizon templates still use python_path=/usr/share/openstack-dashboard, it is wrong. Change-Id: Ide6a24e17b1f8ab6506aa5e53f70693706830418	2019-01-04 14:33:46 +07:00
Zuul	445e4f7640	Merge "Support custom monasca-notification templates"	2018-11-20 09:41:21 +00:00
Doug Szumski	cfc86645c9	Collect StatsD metrics from Monasca services Some Monasca services support sending StatsD metrics to allow monitoring those services. This commit connects these services to the StatsD service provided by the Monasca Agent. Partially-Implements: blueprint monasca-roles Change-Id: I1da376384a31b89fea1b8a6f907aea35282909a4	2018-11-07 20:24:19 +00:00
Doug Szumski	712c89760c	Add support for deploying Monasca Grafana The Monasca Grafana fork allows users to log into Grafana with their OpenStack user credentials and see metrics associated with their OpenStack project. The long term goal is to enable Keystone support in upstream Grafana, but this work seems to have stalled. Partially-Implements: blueprint monasca-grafana Change-Id: Icc04613b2571c094ae23b66d0bcc38b58c0ee4e1	2018-11-02 13:35:35 +00:00
Doug Szumski	6cbb5cbdb4	Support using external DBs in Monasca This changes allows the user to configure a Monasca database which may be different from the default database. Partially-Implements: blueprint monasca-roles Change-Id: Ia905190b8037ecb1782a758c0b65581fe9024bf6	2018-11-02 13:04:06 +00:00

1 2

73 Commits