1038 Commits

Author SHA1 Message Date
Zuul
b591e0754a Merge "Add Nagios Elasticsearch Query Command" 2018-12-06 20:50:28 +00:00
Zuul
9d5c67fb74 Merge "Disable Ceph helm test while being updated." 2018-12-06 18:42:56 +00:00
Renis Makadia
5aaa59f23f Disable Ceph helm test while being updated.
Change-Id: I6011ee6f41044647dea43fa2848058d723e194b8
2018-12-06 15:47:08 +00:00
Zuul
be06b7c441 Merge "Add divingbell to zuul for test of htk overrides" 2018-12-06 11:32:54 +00:00
Pete Birley
90700f5a76 Ceph: Add labels to secrets created by charts
This PS adds labels to secrets created by charts, which allows them
to be easily identified in deployed sites.

PS4: This PS resolves undefined variable "$envAll" issue

Change-Id: Icbe3584b0ac18b23e32489c4a04ad5aa7aad67e6
Signed-off-by: Pete Birley <pete@port.direct>
2018-12-06 04:15:29 +00:00
Zuul
0a4a76797d Merge "Modify Fluentbit to add appropriate tags" 2018-12-06 01:28:17 +00:00
Mike Pham
d09254c6de Modify Fluentbit to add appropriate tags
Adding auth tags for the logs to support special filter
for openstack and application security logs

Change-Id: Ifbd2395e4268d8d8fc4a2a3ac4d351db3d3e0845
2018-12-05 15:16:40 +00:00
Anderson, Craig (ca846m)
ab4c330b65 Add divingbell to zuul for test of htk overrides
Change-Id: I1e5cec885b4ae20f34733e6aaf1ee0a8a170ac32
2018-12-05 08:36:50 +00:00
Matthew Heler
4ad893eb1a Additional Ceph tunning parameters for openstack-helm
osd_scrub_load_threshold set to 10.0 (default 0.5)
 - With the number of multi-core processors nowadays, it's fairly
   typical to see systems over a load of 1.0. We need to adjust the
   scrub load threshold so that scrubbing runs as scheduled even
   when a node is moderately/lightly under load.

filestore_max_sync_interval set to 10s (default 5s)
 - Larger default journal sizes (>1GB) will not be effectively used
   unless the max sync interval time is increased for Filestore. The
   benefit of this change is increased performance especially around
   sequential write workloads.

mon_osd_down_out_interval set to 1800s (default 600s)
 - OSD PODs can take longer then several minutes to boot up. Mark
   an OSD as 'out' in the CRUSH map only after 15 minutes of being
   'down'.

Change-Id: I62d6d0de436c270d3295671f8c7f74c89b3bd71e
2018-12-04 20:27:52 -06:00
qingszhao
a5b8223eeb Change openstack-dev to openstack-discuss
Mailinglists have been updated. Openstack-discuss replaces openstack-dev.

Change-Id: Ibb461bf73b31d5e20a89081b281046fb5044b17e
2018-12-04 17:31:43 +00:00
Zuul
5da44ee309 Merge "Gate: minikube single node gate deployment" 2018-12-04 16:32:04 +00:00
Zuul
6985cc059b Merge "Fix racy conditions with Ceph caps updating" 2018-12-04 05:15:14 +00:00
Zuul
4dca49571e Merge "Switch Ceph to IPs when DNS is down" 2018-12-04 04:51:23 +00:00
Matthew Heler
621d3938db Fix racy conditions with Ceph caps updating
The sed commands are replaced with simpler methods of extracting
the cephx cap keys.

Change-Id: I783c360569bf17b16d128544b5ab11cb1ce9a7ed
2018-12-03 20:58:10 -06:00
Matthew Heler
76fb1fb10a Cephx caps update for new and post deployments
The update makes sure the Openstack service's cephx
user capabilities match best practices in terms of
security permissions after a site or software update.

Change-Id: I7c241cdb5d92463ac59c557ca7847ca5688d158b
2018-12-03 16:51:45 -06:00
Pete Birley
957bcb15a4 Gate: minikube single node gate deployment
This PS moves the single node gate to use a lightwight minikube
based env.

Change-Id: I285c4222795b66f3527f0daaf62a91973da5dca8
Co-authored-by: Krishna Venkata <kvenkata986@gmail.com>
Signed-off-by: Pete Birley <pete@port.direct>
2018-12-03 15:33:44 -06:00
Matthew Heler
35cce6cb43 Switch Ceph to IPs when DNS is down
Add helper scripts that are called by a POD to switch
Ceph from DNS to IPs. This POD will loop every 5 minutes
to catch cases where the DNS might be unavailable.

On a POD's Service start switch ceph.conf to using IPs rather
then DNS.

Change-Id: I402199f55792ca9f5f28e436ff44d4a6ac9b7cf9
2018-12-03 10:51:37 -06:00
Huang, Scott (sh2725)
bd05126309 Add Nagios Elasticsearch Query Command
Change-Id: I74a965a5397101793cae71228a6a5bd442bf9f5a
2018-12-03 09:09:03 -05:00
Zuul
2a0b183613 Merge "Helm tests for Ceph-OSD and Ceph-Client charts" 2018-12-03 05:03:31 +00:00
Zuul
48f5737e98 Merge "Helm Tests for Ceph-RGW chart" 2018-12-02 14:54:53 +00:00
Pete Birley
896385354e Mariadb: Support adoption of running single node mariadb deployment
This PS updates the mariadb chart to both support adoption of a
single instance of mariadb running the bash driven chart, which
did not support reforming a galera cluster by tracking state using
a configmap. Additionally basic logic is added for upgrading the
database as part of the normal rolling update flow.

Change-Id: I412de507112b38d6d2534e89f2a02f84bef3da63
Signed-off-by: Pete Birley <pete@port.direct>
2018-12-02 00:20:57 -06:00
Sangeet Gupta
eac7202774 k8s-keystone create cluster roles and tests
This commit adds roles to kubernetes-keystone-webook policy
which has permissions similar to clusterrols cluster-admin,
edit and view present in kubernetes.

Check.sh script is also modified to test and verify the new
roles.

Change-Id: I43621d2e1036259064c805d97b340589a5b68c93
2018-12-01 19:14:31 +00:00
Jean-Charles Lopez
f7e03d4763 Helm Tests for Ceph-RGW chart
Co-Authored-By: Renis Makadia <renis.makadia@att.com>

Change-Id: I81cc0cb498b2ca911d5b7bfa7c3bd9b8552e0e2b
2018-12-01 08:08:28 +00:00
Renis Makadia
b1005b23b4 Helm tests for Ceph-OSD and Ceph-Client charts
Change-Id: If4a846f0593b8679558662205a8560aa3cbb18ae
2018-12-01 08:08:00 +00:00
Zuul
5316586d9e Merge "Fluentbit/Node Exporter: Remove unused tolerations key" 2018-11-29 15:40:03 +00:00
Pete Birley
4803fe31d1 Ingress: Break out helper container images
This PS breaks out the helper container images, which is required
now that the ingress image is more compact.

Change-Id: I6afb08954f37eda1ed913a4b3acdaf6e2b89d30e
Signed-off-by: Pete Birley <pete@port.direct>
2018-11-28 20:54:35 -06:00
Zuul
2bec7040a9 Merge "Add failure domains, and device classes for custom CRUSH rules" 2018-11-29 00:42:05 +00:00
Cliff Parsons
598faeb8db Make access control annotations more generic.
This patch takes into consideration that there could be multiple
options for mandatory access control in a cluster. The previously
defined Helm toolkit function for generating a MAC annotation can
now be specified generically, like in this example:

  mandatory_access_control:
    type: apparmor
    glance-api:
      init: runtime/default
      glance-api: runtime/default
      glance-perms: runtime/default
      ceph-keyring-placement: runtime/default
    glance-registry:
      init: runtime/default
      glance-registry: runtime/default

If no MAC is required, then the "type" can be set to null,
and no annotation would be generated. The only MAC type supported
at the moment is "apparmor".

Change-Id: I6b45533d73af82e8fff353b0ed9f29f0891f24f1
2018-11-28 08:54:15 +00:00
Zuul
04c8f03532 Merge "Add charts for Elastic Beats" 2018-11-27 20:25:42 +00:00
Steve Wilkerson
26c3773983 Fluentbit/Node Exporter: Remove unused tolerations key
This removes the tolerations key from the labels entries. As the
boolean check is on the pod.tolerations.enabled key instead, the
labels.foo.tolerations key is no longer used and should be removed

Change-Id: I00536dabadf9bd354219058d8efd054c60952bbd
2018-11-27 12:38:16 -06:00
Zuul
42249d4243 Merge "Truncate long host names for overrides" 2018-11-27 16:01:53 +00:00
Matthew Heler
6e8c289c13 Add failure domains, and device classes for custom CRUSH rules
Largely inspired and taken from Kranthi's PS.

 - Add support for creating custom CRUSH rules based off of failure
domains and device classes (ssd & hdd)
- Basic logic around the PG calculator to autodetect the number of
OSDs globally and per device class (required when using custom crush
rules that specify device classes).

Change-Id: I13a6f5eb21494746c2b77e340e8d0dcb0d81a591
2018-11-27 09:37:30 -06:00
Andrey Pavlov
5ac56d9307 add parameter to allow redefining of server port for ingress
To allow to integrate TungstenFabric(Contrail) with Airship
there should be ability to redifine ports that can be conflicted.

Change-Id: Id15658c65339577cec03f25ebd22dd664bb5976a
2018-11-27 13:15:32 +03:00
Anderson, Craig (ca846m)
48a0c09fea Truncate long host names for overrides
Long hostnames can cause the 63 char name limit to be exceeded.
Truncate the hostname if hostname > 20 char.

Change-Id: Ieb7e4dafb41d1fe3ab3d663d2614f75c814afee6
2018-11-26 17:04:58 -08:00
Steve Wilkerson
4c18a421ee Add charts for Elastic Beats
This adds basic charts for Elastic metricbeat, filebeat,
packetbeat, and elastic APM server.  This also adds an experimental
job for deploying the elastic beats along with Elasticsearch and
Kibana

Change-Id: Idcdc1bfa75bcdcaa68801dbb8999f0853652af0f
2018-11-26 20:19:57 +00:00
Zuul
0730df5973 Merge "Prometheus: Add session affinity to ingress" 2018-11-26 18:21:14 +00:00
Zuul
4b76f8c280 Merge "Nagios: Update image tag" 2018-11-26 17:40:20 +00:00
Steve Wilkerson
71c1a16758 Prometheus: Add session affinity to ingress
This adds session affinity to Prometheus's ingress. This allows for
the use of cookies for Prometheus's session affinity

Change-Id: I2e7e1d1b5120c1fb3ddecb5883845e46d61273de
2018-11-26 14:30:08 +00:00
Steve Wilkerson
439079693d Nagios: Update image tag
This updates the Nagios image tag to include the updated plugin
for querying Elasticsearch for alerting on logged events

Change-Id: Idd61d82463b79baab0e94c20b32da1dc6a8b3634
2018-11-26 08:29:22 -06:00
Zuul
8e369d2c9c Merge "Ingress: Update version of ingress controller image" 2018-11-23 20:39:38 +00:00
Zuul
89b651dc1d Merge "Ingress: Make healthz port configurable" 2018-11-21 20:01:26 +00:00
Pete Birley
4d2085f0af Ingress: Update version of ingress controller image
This PS updates the version of the ingress controller image used.

This brings in the ability to update the ingress configuration without
reloading nginx. There may also need to be some changes for prom based
monitoring:
 * https://github.com/kubernetes/ingress-nginx/blob/master/Changelog.md#0100

Change-Id: Ia0bf3dbb9b726f3a5cfb1f95d7ede456af13374a
Signed-off-by: Pete Birley <pete@port.direct>
2018-11-21 19:21:40 +00:00
Zuul
16072765bf Merge "Ingress: Allow status port to be customised" 2018-11-20 18:29:16 +00:00
Pete Birley
ea875b1dcc Ingress: Make healthz port configurable
This PS updates the healthz port to be configurable

Change-Id: Ifa5ea4b7b422156a7309886ecc21668fc096065b
Signed-off-by: Pete Birley <pete@port.direct>
2018-11-20 12:28:14 -06:00
Pete Birley
f3e1fa4e72 Ingress: Allow status port to be customised
This PS updates the ingress chart to allow the status pport to be
changed.

Change-Id: Ia38223c56806f6113622a809e792b4fedd010d87
Signed-off-by: Pete Birley <pete@port.direct>
2018-11-20 09:57:56 -06:00
Matthew Heler
5ce9f2eb3b Enable Ceph charts to be rack aware for CRUSH
Add support for a rack level CRUSH map. Rack level CRUSH support is
enabled by using the "rack_replicated_rule" crush rule.

Change-Id: I4df224f2821872faa2eddec2120832e9a22f4a7c
2018-11-20 09:07:36 -06:00
Zuul
5d356f9265 Merge "Document howto recover from a Ceph namspace deletion" 2018-11-15 17:27:45 +00:00
Matthew Heler
cfc2d4abd8 Document howto recover from a Ceph namspace deletion
Change-Id: Ib1b03cd046fbdad6f18478cfa9c9f0bf70ec9430
2018-11-14 13:31:16 -06:00
Zuul
dd6b2a0a1d Merge "Additional Ceph RGW tuning and cleanups" 2018-11-14 18:48:36 +00:00
Zuul
5bf9c26bd8 Merge "Move default CEPH journal size from 5GB to 10GB" 2018-11-13 05:28:45 +00:00