12196 Commits

Author SHA1 Message Date
LinPeiWen
10f79eda1d Use Docker healthchecks for elasticsearch services
This change enables the use of Docker healthchecks for elasticsearch services.
Implements: blueprint container-health-check

Change-Id: Ib50d56cc206a80b20a4f96e95bcdd8d11269234f
2021-03-11 15:13:58 +00:00
Zuul
067a752206 Merge "Bump up supported Ansible version to 2.10" 2021-03-11 12:30:29 +00:00
LinPeiWen
28dbde6add Use Docker healthchecks for kibana services
This change enables the use of Docker healthchecks for kibana services.
Implements: blueprint container-health-check

Change-Id: If7525e193c245435410b4071aed6d3b566424219
2021-03-11 11:28:12 +00:00
Mark Goddard
32f1b6cf56 ansible-lint: add unnamed-task to the skip list
The 5.0.3 release of ansible-lint makes the linters job fail, due to the
addition of an unnamed-task rule. We have quite a large number of
unnamed tasks, typically include_tasks or include_role. We may want to
address this at some point, but not right now.

Change-Id: I4ee706fe2463d62e4e16412117d4373748ea43e3
2021-03-11 09:10:16 +00:00
Zuul
3c5e0f9f28 Merge "Reduce number of logs and disable ara HTML report" 2021-03-10 19:22:45 +00:00
Zuul
c2360f549e Merge "Introduce nova_libvirt_logging_debug" 2021-03-10 19:22:10 +00:00
Michał Nasiadka
191b46ef40 Reduce number of logs and disable ara HTML report
- Remove /var/log/kolla link to omit uploading the same logs twice
- Remove ARA HTML report (usually takes around 120MB) - can be easily
  generated from the sqlite db

Change-Id: I74cd6d1128689ab2c73f00ee08af3778d7d670a4
2021-03-10 15:16:05 +00:00
Michał Nasiadka
eabdf1e9f9 Introduce nova_libvirt_logging_debug
In order to disable libvirt debug in CI (which takes vast amount of storage)
this change introduces nova_libvirt_logging_debug and disables that in CI.

Change-Id: I90bfd1b300ad3202ea4d139fda6d6beb44c5820f
2021-03-10 15:15:29 +00:00
Maksim Malchuk
835920782f Correctly configure S3 Token Middleware for Swift
According the documentation [1] there need to configure auth_uri in the
[filter:s3token] section instead of www_authenticate_uri which cause an
error 'swift.common.wsgi.ConfigFileError: Invalid auth_uri; must
include scheme and host' during start the swift-proxy-server container.

1. https://docs.openstack.org/swift/ussuri/middleware.html#s3-token-middleware

Change-Id: I6b8f5807ebb746428a501dca13eae30763dede8d
Closes-Bug: 1862765
Signed-off-by: Maksim Malchuk <maksim.malchuk@gmail.com>
2021-03-10 13:03:35 +00:00
Zuul
02371e0c82 Merge "Use Docker healthchecks for trove services" 2021-03-10 10:05:37 +00:00
Zuul
343268f060 Merge "Use Docker healthchecks for octavia services" 2021-03-10 10:05:18 +00:00
Zuul
4aa44f2caa Merge "Use Docker healthchecks for ironic services" 2021-03-10 03:42:35 +00:00
wuchunyang
f332fe8bb5 Remove cinder v2 api
Cinder api v2 support removed in  wallaby.
refer to http://lists.openstack.org/pipermail/openstack-discuss/2020-November/018697.html

Change-Id: I120101d6c30f9169f4c3811357b5ed06496c90ff
2021-03-10 11:24:40 +08:00
Zuul
261cce4f45 Merge "Add missing elasticsearch cloudkitty storage and prometheus collector backend support." 2021-03-09 20:18:28 +00:00
Zuul
cc1dda3035 Merge "Add Neutron DHCP agent to OVN networking setup" 2021-03-09 20:15:28 +00:00
Zuul
f0e4b582f7 Merge "Use Docker healthchecks for aodh services" 2021-03-09 20:15:12 +00:00
Zuul
ca8ab179a1 Merge "Use Docker healthchecks for gnocchi services" 2021-03-09 19:28:43 +00:00
Zuul
8fc45da505 Merge "Use Docker healthchecks for designate services" 2021-03-09 19:08:29 +00:00
Zuul
d09647f853 Merge "Use User and Group variables instead of sudo" 2021-03-09 19:05:41 +00:00
Zuul
5ccd134cd1 Merge "Deprecate Prometheus 1.x" 2021-03-09 17:51:13 +00:00
Zuul
af5e4f3446 Merge "Rename eswitchd.conf -> eswitchd.conf.j2" 2021-03-09 12:35:55 +00:00
wuchunyang
13b6d692b0 Use User and Group variables instead of sudo
use User and Group in octavia-interface service.
this way looks more graceful

this change also fix the octavia-interface does not restart
when the octavia-interface.service changed.

Change-Id: I9d253d7519b1774985d165a080047fdbecd321b4
2021-03-09 17:25:58 +08:00
LinPeiWen
bca6a34fcc Use Docker healthchecks for vitrage services
This change enables the use of Docker healthchecks for vitrage services.
Implements: blueprint container-health-check

Change-Id: Icd2a8471a08b737352cfe941b5d38b02ed3b19fe
2021-03-08 14:49:33 +00:00
LinPeiWen
c3969c4a7d Use Docker healthchecks for aodh services
This change enables the use of Docker healthchecks for aodh services.
Implements: blueprint container-health-check

Change-Id: Ic477ac77f7ea5c50de473382fb1ec63d78043267
2021-03-08 14:29:52 +00:00
LinPeiWen
3e9ad63187 Use Docker healthchecks for gnocchi services
This change enables the use of Docker healthchecks for gnocchi services.
Implements: blueprint container-health-check

Change-Id: Ib1eb766d061c376c0bdd693dd6ea1e5b3e3fc02a
2021-03-08 14:27:33 +00:00
LinPeiWen
cb537eb8d3 Use Docker healthchecks for ironic services
This change enables the use of Docker healthchecks for ironic services.
Implements: blueprint container-health-check

Change-Id: If0a11db5470899c3a0e69ca94fdd0903daadcf8b
2021-03-08 14:18:03 +00:00
LinPeiWen
51171452a8 Use Docker healthchecks for designate services
This change enables the use of Docker healthchecks for designate services.
Implements: blueprint container-health-check

Change-Id: Ide66a914b790ad9c106ba932a923654a960f7abf
2021-03-08 14:07:43 +00:00
Michał Nasiadka
1ccccbcf0a Bump up supported Ansible version to 2.10
Min version stays as 2.9

Change-Id: I7ec8c5eb36757248c9aa016dc7d4e495ec5bb635
2021-03-08 11:40:31 +00:00
Pierre Riteau
0d33a2f598 Document calculation of indentation
Change-Id: I0f107d8239ae0cbef3bb4569ec5d77326f8974fc
2021-03-08 10:29:42 +01:00
Zuul
7c0c4dd6fd Merge "Remove whitespace around equal signs in zookeeper.cfg" 2021-03-07 11:04:54 +00:00
Zuul
4e7d1ab9cb Merge "Add variable for changing Apache HTTP timeout" 2021-03-07 10:41:37 +00:00
Doug Szumski
92f252af4b Follow up fixes for Monasca docs and formatting
Change-Id: Ief84e093829677c97c8df9a08aefca43b1e51aac
2021-03-07 10:10:22 +00:00
Pierre Riteau
2b723ead05 Remove whitespace around equal signs in zookeeper.cfg
The merge_configs action plugin is updated with an option to control
whether whitespace should be used around equal signs. The default
remains to use whitespace: only zookeeper.cfg doesn't use them.

This is to avoid issues with the zkCleanup.sh script which expects no
whitespace. The ZooKeeper documentation also uses no whitespace in
configuration file examples.

Change-Id: Ia082a1c002cc4e8b04f7696fdee827b747c6d13f
Closes-Bug: #1917490
2021-03-05 15:12:23 +01:00
Zuul
cef8c8b3de Merge "CI: octavia: create and test a load balancer" 2021-03-05 11:00:56 +00:00
Zuul
f325add2e2 Merge "CI: octavia: remove octavia from magnum scenario" 2021-03-05 11:00:14 +00:00
Zuul
aad7a731aa Merge "octavia: support tenant management network" 2021-03-05 10:59:49 +00:00
Zuul
c02a223769 Merge "CI: Move from ceph-ansible to cephadm" 2021-03-05 09:11:32 +00:00
Zuul
9194a3e99b Merge "Fix Cinder log parsing" 2021-03-04 13:03:17 +00:00
Doug Szumski
647ff667e6 Add variable for changing Apache HTTP timeout
In services which use the Apache HTTP server to service HTTP requests,
there exists a TimeOut directive [1] which defaults to 60 seconds. APIs
which come under heavy load, such as Cinder, can sometimes exceed this
which results in a HTTP 504 Gateway timeout, or similar. However, the
request can still be serviced without error. For example, if Nova calls
the Cinder API to detach a volume, and this operation takes longer
than the shortest of the two timeouts, Nova will emit a stack trace
with a 504 Gateway timeout. At some time later, the request to detach
the volume will succeed. The Nova and Cinder DBs then become
out-of-sync with each other, and frequently DB surgery is required.

Although strictly this category of bugs should be fixed in OpenStack
services, it is not realistic to expect this to happen in the short
term. Therefore, this change makes it easier to set the Apache HTTP
timeout via a new variable.

An example of a related bug is here:

https://bugs.launchpad.net/nova/+bug/1888665

Whilst this timeout can currently be set by overriding the WSGI
config for individual services, this change makes it much easier.

Change-Id: Ie452516655cbd40d63bdad3635fd66693e40ce34
Closes-Bug: #1917648
2021-03-04 11:25:06 +00:00
Doug Szumski
ca1a80ab2f Support bypassing Monasca Log API for control plane logs
This change allows a user to forward control plane logs
directly to Elasticsearch from Fluentd, rather than via
the Monasca Log API when Monasca is enabled. The Monasca
Log API can continue to handle tenant logs.

For many use cases this is simpler, reduces resource
consumption and helps to decouple control plane logging
services from tenant logging services.

It may not always be desired, so is optional and off by
default.

Change-Id: I195e8e4b73ca8f573737355908eb30a3ef13b0d6
2021-03-04 09:19:50 +00:00
Doug Szumski
444097848c Support disabling Monasca alerting pipeline
The Monasca alerting pipeline provides multi-tenancy alerts and
notifications. It runs as an Apache Storm topology and generally
places a significant memory and CPU burden on monitoring hosts,
particularly when there are lot of metrics. This is fine if the
alerting service is in use, but sometimes it is not. For example
you may use Prometheus for monitoring the control plane, and
wish to offer tenants a monitoring service via Monasca without
alerting and notification functionality. In this case it makes
sense to disable this part of the Monasca pipeline and this patch
adds support for that.

If the service is ever re-enabled, all alerts and notifications
should spawn back automatically since they are persisted in the
central mysql database cluster.

Change-Id: I84aa04125c621712f805f41c8efbc92c8e156db9
2021-03-04 09:19:44 +00:00
wuchunyang
366ba8526e CI: octavia: create and test a load balancer
Co-Authored-By: Mark Goddard <mark@stackhpc.com>
Change-Id: I20c1d9bddee5a046b521b9378bade2ab50612be0
2021-03-04 09:27:38 +08:00
Doug Szumski
a52d661219 Disable Monasca Log Metrics service by default
The Log Metrics service is an admin only service. We now have
support in Fluentd via the Prometheus plugin to create metrics
from logs. These metrics can be scraped into Monasca or Prometheus.
It therefore makes sense to deprecate this service, starting by
disabling it by default, and then removing it in the Xena release.
This should improve the stability of the Monasca metrics pipeline
by ensuring that all metrics pass via the Monasca API for
validation, and ensure that metrics generated from logs are
available to both Prometheus and Monasca users by default.

Change-Id: I704feb4434c1eece3eb00c19dc5f934fd4bc27b4
2021-03-03 17:20:18 +00:00
Doug Szumski
0743a9bf4b Remove Monasca Log Transformer
Historically Monasca Log Transformer has been for log
standardisation and processing. For example, logs from different
sources may use slightly different error levels such as WARN, 5,
or WARNING. Monasca Log Transformer is a place where these could
be 'squashed' into a single error level to simplify log searches
based on labels such as these.

However, in Kolla Ansible, we do this processing in Fluentd so
that the simpler Fluentd -> Elastic -> Kibana pipeline also
benefits. This helps to avoid spreading out log parsing
configuration over many services, with the Fluentd Monasca output
plugin being yet another potential place for processing (which
should be avoided). It therefore makes sense to remove this
service entirely, and squash any existing configuration which
can't be moved to Fluentd into the Log Perister service. I.e.
by removing this pipeline, we don't loose any functionality,
we encourage log processing to take place in Fluentd, or at least
outside of Monasca, and we make significant gains in efficiency
by removing a topic from Kafka which contains a copy of all logs
in transit.

Finally, users forwarding logs from outside the control plane,
eg. from tenant instances, should be encouraged to process the
logs at the point of sending using whichever framework they are
forwarding them with. This makes sense, because all Logstash
configuration in Monasca is only accessible by control plane
admins. A user can't typically do any processing inside Monasca,
with or without this change.

Change-Id: I65c76d0d1cd488725e4233b7e75a11d03866095c
2021-03-03 17:20:18 +00:00
Zuul
1fb56cf5be Merge "Use Docker healthchecks for magnum services" 2021-03-03 17:07:03 +00:00
Zuul
21b9b5092f Merge "[CI] Cinder upgrade testing" 2021-03-03 16:47:27 +00:00
Zuul
933d575bd5 Merge "Revert "CI: Temporarily disable rabbitmq internal tls"" 2021-03-03 16:38:22 +00:00
wuchunyang
2cb83b9ae4 Use Docker healthchecks for watcher services
This change enables the use of Docker healthchecks for watcher
services.
Implements: blueprint container-health-check

Change-Id: I0774063dd970507e566637138167ed1af9a2874c
2021-03-03 21:43:57 +08:00
LinPeiWen
d2d91d9ec1 Use Docker healthchecks for senlin services
This change enables the use of Docker healthchecks for senlin services.
Implements: blueprint container-health-check

Change-Id: I148a0174c46244241580265d0fedc0fb1b92602c
2021-03-03 11:23:09 +00:00
wuchunyang
6973c71a60 Use Docker healthchecks for cyborg services
This change enables the use of Docker healthchecks for cyborg
services.
Implements: blueprint container-health-check

Change-Id: I5326b142eaa826f97c32498cd2a9a0cba65be698
2021-03-03 11:10:39 +00:00