1176 Commits

Author SHA1 Message Date
Steve Wilkerson
9422e970a7 Add Armada deployment job to openstack-helm-infra
This adds both a periodic and experimental job for deplying Ceph
and the LMA components via Armada to openstack-helm-infra

Change-Id: Ia3b557801d4f4b667d82eb47a6ef1825394ee526
2019-02-04 14:07:07 +00:00
Zuul
d1c14976ba Merge "Disable nonfunctional experimental check jobs" 2019-02-04 13:11:15 +00:00
Zuul
d3531f1fba Merge "[CEPH] Enable iostat mgr module" 2019-02-04 09:45:13 +00:00
Steve Wilkerson
fc6ac671fb Disable nonfunctional experimental check jobs
This disables the centos and fedora experimental checks, as they
need work to resolve issues before they can begin to add value

Change-Id: If523a9163426dc4bc78b5e19a000d0d677cb20e4
2019-02-04 09:02:36 +00:00
Pete Birley
3e012063e7 HTK: Simplify s3 bucket creation job and support locked down acl
This PS simplifies the s3 bucket creation job and updates it to
work with newer versions of s3cli

Change-Id: Idd1f3baf8356329db8a6ede54d53f7599b56f00b
Signed-off-by: Pete Birley <pete@port.direct>
2019-02-03 22:15:27 -06:00
Zuul
e0a858b8c2 Merge "Ceph: Update default to use OSH image" 2019-02-04 02:45:41 +00:00
Zuul
a467aa4039 Merge "Nagios: Reduce Nagios concurrent checks and workers" 2019-02-03 05:53:28 +00:00
Zuul
d77772a147 Merge "Postgresql: Remove unused exporter user configuration" 2019-02-03 05:49:43 +00:00
Matthew Heler
5de299b9b1 [CEPH] Enable iostat mgr module
Enable the iostat mgr module for Ceph. This module show the
current throughput and IOPS done on a Ceph cluster.

Change-Id: I2fe5b47401c15e349a49f345bacd99da39889373
2019-02-02 16:44:58 -06:00
Steve Wilkerson
8cc89fb63f Postgresql: Remove unused exporter user configuration
This removes the auth credentials from the postgresql exporter's
endpoint configuration, as the secret and job for creating that
user come from the auth credentials defined in the postgresql
endpoint instead

Change-Id: Id17578b8e22a1808d7c3323bda9cd005c2584c97
2019-02-02 11:08:02 -06:00
Steve Wilkerson
4d170bea32 MariaDB: Remove unused exporter password credentials
This removes auth credentials from the mariadb exporter endpoint,
as the credentials used for the actual user are consumed from
the oslo_db endpoint instead.

Change-Id: I1419f50b7e0059763588028b55c982ad83446a33
2019-02-02 10:55:00 -06:00
Zuul
1c33bc8719 Merge "RGW: Fix helm-test pod template" 2019-02-01 23:13:44 +00:00
Zuul
82007140fc Merge "Monitoring job: Remove nagios file mount check" 2019-02-01 22:41:46 +00:00
Zuul
6116245e83 Merge "[MariaDB] Fix non-default setting for admin user" 2019-02-01 21:36:15 +00:00
Pete Birley
6ea6a85198 Ceph: Update default to use OSH image
This PS udpates the default image in the chart to the latest OSH image.

Change-Id: Ib8d2a72ad48049fe02560dc4405f0088890b6f64
Signed-off-by: Pete Birley <pete@port.direct>
2019-02-01 21:25:13 +00:00
Pete Birley
0d55b9064c RGW: Fix helm-test pod template
This PS updates the helm test driven pod template:
 * places rgw keystone conditional to correct location
 * removes unrequired roles and bindings
 * adds dependency on the rgw being running
 * corrects spelling error
 * corrects s3cmd to work with version 1.6.1

Change-Id: I665dba9fdca1d840f4d864e32f07b6185af51d25
Signed-off-by: Pete Birley <pete@port.direct>
2019-02-01 21:21:10 +00:00
Steve Wilkerson
44b5c008f1 Monitoring job: Remove nagios file mount check
This removes the elasticsearch query clause json file check from
the single node monitoring job, as it's become a bit unreliable.
Instead, we'll rely on the periodic multinode job to validate this
works as intended

Change-Id: I8d33a2625d5d666af280467dc21d76ed0302f837
2019-02-01 15:20:03 -06:00
Steve Wilkerson
3614d025dc Fluentbit: Remove database used in tail inputs
This updates the fluentbit configuration for tail inputs to remove
the values for utilizing mysqlite databases to track its location
in each file it's configured to tail.  This is intended to reduce
the pressure fluentbit exerts on the host through writing to
/var/log/foo.db. To help mitigate large amounts of traffic
sent from fluentbit to fluentd upon a pod restart, this also
adds a throttle filter to fluentbit.

As a result, Fluentbit no longer needs a writable mount to its
hostPath on /var/log on the host.  Thus, this change includes
updating the Fluentbit daemonset's mount on /var/log to be
readOnly

Change-Id: If4381f4ff47e887f3ea10beded4f6172edaf08ba
2019-02-01 16:56:31 +00:00
Steve Wilkerson
442e4985c3 Nagios: Reduce Nagios concurrent checks and workers
This reduces the maximum concurrent checks Nagios will execute to
prevent process sprawl on the host. This also reduces the number
of default workers to a single worker, to prevent Nagios from
forking off multiple processes that then execute service checks
and commands in parallel

Change-Id: I0d8445a265740b4a2491bdfd739cb0f27955f06d
2019-02-01 08:31:31 -06:00
Steve Wilkerson
25e4e5662e Update network-policy ldap deployment and test
This updates the script for deploying ldap in the network policy
job to accept ingress traffic from prometheus pods.

This also updates the network policy test to account for return
values with more than one result when checking for a pod to use,
as well as selecting pods by application and component labels
instead of simply grepping for a name (as this could cause issues
with grepping for 'fluentd', when that could return both fluentd
and fluentd-exporter pods, for example)

Change-Id: I12a4029f574ea7d5b250709adef21b07d8cf0220
2019-01-31 21:29:40 +00:00
Zuul
6ef3f58fb8 Merge "Add pre-fixes to the Selenium jobs and remove "|| true"" 2019-01-31 20:39:40 +00:00
Chris Wedgwood
b15605029e [MariaDB] Fix non-default setting for admin user
This allows the database administration username to be changed in
addition to the password (the configuration and documentation implies
this is possible, but the current implementation assumes root in a few
places).

Remove some unused env declarations.

Tested with and without Prometheus exporter enabled.

Change-Id: I8a48dac7a84fe583c7dc5ff0644ca14b1e5190d8
2019-01-31 17:29:52 +00:00
Zuul
b30012a616 Merge "[CEPH] Fixes for the OSD defrag cronjob" 2019-01-31 16:05:14 +00:00
Matthew Heler
fc76091261 [CEPH] Fixes for the OSD defrag cronjob
Fix a naming issue with the cronjob's binary, and schedule the cron
job to run every 15 minutes for the gates. Additonally check to
to ensure we are only running on block devices. Also update the
script to work with ceph-volume created devices.

Change-Id: I8aedab0ac41c191ef39a08034fff3278027d7520
2019-01-31 06:13:05 -06:00
Deokjin Kim
cbb9ec0748 Fix calling wrong variable name in gnocchi
Checking test_version seems right. test_mimic is not existing.

Change-Id: I2cbfed0f7da0b22eb753ed7bce833872a7ff707f
Signed-off-by: Deokjin Kim <deokjin81.kim@samsung.com>
2019-01-31 00:34:21 +00:00
Zuul
c3a8063fdb Merge "Fluentd: remove unused configuration section" 2019-01-30 23:30:50 +00:00
Zuul
3bd3b70e51 Merge "[Calico] Configuration robustness improvements" 2019-01-30 22:16:55 +00:00
Steve Wilkerson
f01e9d2391 Fluentd: remove unused configuration section
This removes an unused section of configuration for fluentd, as
well as cleans up the values for filtering fluentd logs

Change-Id: I0c58d3ac236af7723c64c3b9fcba877736b1f606
2019-01-30 16:03:59 -06:00
Chris Wedgwood
b7b7c5ea44 [alertmanager] default to 1 replica, multinode gate uses 3
Change-Id: Ifb1420f8dcf7237349a79f1f97aea5e547bafeab
2019-01-30 08:43:18 +00:00
Chris Wedgwood
47a2da5af0 [Calico] Configuration robustness improvements
No longer use networking.settings.ippool.ipip.mode, rather take from
conf.node.CALICO_IPV4POOL_IPIP (this avoids duplication and
possibility of setting them differently).

Logging values previously required Titlecase in some places, lower in
others (and it changed across versions); have the chart DTRT where it
matters to avoid configuration problems.

Change-Id: Idb7ccb5be8f9e1cb184ed86a9fd0875704912564
2019-01-30 06:33:22 +00:00
Zuul
33178a529d Merge "Fluentd: Remove unused liveness port" 2019-01-30 04:12:48 +00:00
Zuul
8028bcb641 Merge "[tiller] Disable monitoring by default, enable in gate" 2019-01-30 04:12:47 +00:00
Zuul
0963980b51 Merge "[Prometheus] Relax disk IO constraints" 2019-01-30 04:12:46 +00:00
Zuul
ba68a8c745 Merge "[Prometheus] Fix filesystem space checks" 2019-01-30 04:12:45 +00:00
Zuul
3fa8fbea1a Merge "[ingress] explicitly specify the Prometheus scrape port" 2019-01-30 04:12:44 +00:00
Meg Heisler
98fbc9a1e2 Add pre-fixes to the Selenium jobs and remove "|| true"
This adds xxx-job name prefixes to the Selenium jobs for consistency

This will also remove the "|| true" suffix that was added temporarily to
ensure the Kibana selenium job did not error. The fix for the issue
was merged so the quick fix is no longer needed and may prevent an
error when an issue actually occurs.
Change-Id: I16881974cbf618b31813964b17c090dbfe33fe51
2019-01-29 20:24:57 -06:00
Pete Birley
bf4713f04b HTK: Support tls secrets on non-fqdn overridden hosts in ingress
This PS adds support for tls secrets on non-fqdn overriden hosts
in ingress rules.

Change-Id: I134af614e7c2ac3fae6eba2bc4bda9f8b41f7f78
Signed-off-by: Pete Birley <pete@port.direct>
2019-01-29 23:34:18 +00:00
Zuul
a6aabe0feb Merge "Liveness probes for OpenVSwitch daemons." 2019-01-29 23:06:07 +00:00
Steve Wilkerson
39410b16bc Fluentd: Remove unused liveness port
This removes an unused port for a previous implementation of the
fluentd liveness probe

Change-Id: I80367bcf6fedc75b3ee7054eba9c382fbb4bc79d
2019-01-29 14:31:50 -06:00
Zuul
4aca509aaf Merge "[CEPH] Clean up PG troubleshooting option specific to Luminous" 2019-01-29 20:23:53 +00:00
Hemachandra Reddy
aef0ff7810 Liveness probes for OpenVSwitch daemons.
Uses ovs-vsctl for ovs-db
Uses ovs-appctl for ovs-vswitchd as "ovs-vsctl show" does not
talk to ovs-vswitchd.

Change-Id: Ia0b84e3546ff1693676ca61370e1344d75b6e308
2019-01-29 20:10:41 +00:00
Zuul
6051d5e450 Merge "Helm-Toolkit: Make ingress manifest work for more than public endpoints" 2019-01-29 20:06:01 +00:00
Chris Wedgwood
a6fa47eea5 [tiller] Disable monitoring by default, enable in gate
Change-Id: Idb7a1f0046e96261a7042d30eedfaea031b27209
2019-01-29 18:57:58 +00:00
Matthew Heler
f48c365cd3 [CEPH] Clean up PG troubleshooting option specific to Luminous
Clean up the PG troubleshooting method that was needed for
Luminous images. Since we are now on Mimic, this function is now
not needed.

Change-Id: Iccb148120410b956c25a1fed5655b3debba3412c
2019-01-29 18:57:23 +00:00
Zuul
7b5d6e9237 Merge "OSH-Infra: Update multinode and aio-monitoring/logging jobs" 2019-01-29 17:03:18 +00:00
Zuul
2de223b863 Merge "Add proxy support to Minikube gate script" 2019-01-29 17:03:17 +00:00
Pete Birley
3eb0517fc9 Helm-Toolkit: Make ingress manifest work for more than public endpoints
This PS enables the ingress manifest function to work for all endpoints
rather than just public.

Change-Id: I3b454bb24a763f51896e845b767fd9d28f5b07dc
Signed-off-by: Pete Birley <pete@port.direct>
2019-01-29 08:53:06 -06:00
Chris Wedgwood
d7808468fc [Prometheus] Relax disk IO constraints
Relax the timing constrains for disk IO to accommodate rotating disks;
a "measured IO" might be the result of a small number of physical IOs,
allow for enough time for a small number of disk rotations (this isn't
perfect but seems to be about right in testing under load).

Change-Id: Ifb067a2218528e5918d2f4b2ba169b6e739084e0
2019-01-29 06:41:51 +00:00
Chris Wedgwood
4fb6ee6e35 [Prometheus] Fix filesystem space checks
Change-Id: Id527ea6e08070cb7d2634417a7c203c1c5c3d97c
2019-01-29 06:34:54 +00:00
Chris Wedgwood
03ee843b22 [ingress] explicitly specify the Prometheus scrape port
Change-Id: I9e191257c436ca6ab74d013feb07bb0ffed2d532
2019-01-29 04:42:26 +00:00