1795 Commits

Author SHA1 Message Date
Venkata, Krishna (kv988c)
d1964b7b38 [Ceph]: Remove duplicate values
secret_keystone_rgw is defined twice in
same section in lines 548 and 550.

Change-Id: I8f76a6f0f4105d47efbc562d190f2eabf51764c1
2019-06-19 16:45:05 +00:00
Steve Wilkerson
f2db36862c Add openstack-exporter to aio-monitoring & openstack-support jobs
This deploys the openstack-exporter as part of the aio-monitoring
single node and openstack-support jobs, to ensure the exporter is
functional and working as expected

Change-Id: If31046769fd9032647af47ce8fce0cde5b8f7d7d
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-06-19 16:38:10 +00:00
Pete Birley
fc58be6a93 Ingress: Clean prometheus-nginx.socket on startup
This PS cleans the prometheus-nginx.socket on startup of the container,
which is required to allow the container, as opposed to the pod, restart.

Change-Id: I7906e85a200f6fb92467371218b4e5957add39f4
Signed-off-by: Pete Birley <pete@port.direct>
2019-06-19 14:02:56 +00:00
Jean-Philippe Evrard
8ee35e896f Adapt rabbitmq test for py2 and py3 support
This works well for python2, but things will become
messy when py3 will be the default.

This, at the same time, ensures the KeyErrors are
properly logged, with a way to debug them.

Change-Id: If5d8007bece9ccbff481187e757968e7d1b6f651
2019-06-19 03:19:08 +00:00
Zuul
5935a2c040 Merge "Fix indent" 2019-06-19 00:17:02 +00:00
Itxaka
565fb4606b htk: provide default domain env and secrets
We currently do not provide any env or secrets for the
default domain id for keystone

This makes it so we provide both like any other vars

Change-Id: I00c68026af25d8c5af37fcb3a6e1bb0e2da13e1e
2019-06-18 21:51:24 +00:00
Zuul
0f7ced71dc Merge "RabbitMQ: Allow clients to directly connect to servers" 2019-06-18 21:46:13 +00:00
Zuul
79582a1ba5 Merge "Elasticsearch: Allow override of job back off limits and deadlines" 2019-06-18 21:18:47 +00:00
Zuul
80d7e932e4 Merge "Switch from default values being populated for upgrade strategy for ceph components" 2019-06-18 21:18:15 +00:00
Hemant
b9a9ee323b Change the expression of defined alert in prometheus to avoid unnecessary errors
There were some false alerts about volume_claim_capacity_high_utilization
due to wrong formula used to determine the percentage of used capacity.

Change-Id: I24afed7946f915e5e13f0ba759eca252c2598af9
2019-06-18 20:19:29 +00:00
Pete Birley
0925f50e2a RabbitMQ: Allow clients to directly connect to servers
This PS updates the rabbitmq chart, to allow clients to connect directly
to backend servers, and also introduces a htk function to produce
the appropriate transport_url used by oslo.messaging to take advantage
of this functionaility.

Change-Id: I5150a64bd29fa062e30496c1f2127de138322863
Signed-off-by: Pete Birley <pete@port.direct>
2019-06-18 15:10:26 +00:00
Zuul
b2f47aabb1 Merge "rabbitmq: set hostPath for rabbitmq-data" 2019-06-18 14:46:15 +00:00
Venkata, Krishna (kv988c)
ed574f456f Switch from default values being populated
for upgrade strategy for ceph components

This PS uses HelmToolKit function to add
upgrade strategy parameters to ceph Components

Change-Id: I54e71d2a52bd639b3e93fc899c1bf2cd075b5396
2019-06-18 01:42:38 +00:00
Tin Lam
afb4c3afab Fix indent
While reviewing [0], found incorrect indentations in YAML.
This patch set redresses the indentation problems.

[0] https://review.opendev.org/#/c/665524/2/ceph-osd/templates/daemonset-osd.yaml

Change-Id: I22e84f5d87aec90ad787a61b152062452867bc17
Signed-off-by: Tin Lam <tin@irrational.io>
2019-06-18 00:49:20 +00:00
Pete Birley
146d3d5976 Ceph: fix schema error in OSD chart
This PS fixes:
 Error: error validating "": error validating data: ValidationError(DaemonSet.spec.template.spec.initContainers[3].volumeMounts[10]): unknown field "ReadOnly" in io.k8s.api.core.v1.VolumeMount

Change-Id: I62f99c2f6209fc9bc72dad19e39acb5beed4519b
Signed-off-by: Pete Birley <pete@port.direct>
2019-06-15 21:40:09 +00:00
Zuul
1f1482bcb7 Merge "Ingress: Fix security context for pod/container" 2019-06-15 01:09:31 +00:00
Zuul
8957db85db Merge "Rabbitmq: Add pod/container security context" 2019-06-15 00:03:48 +00:00
Alexander Noskov
d9b939979d Ingress: Fix security context for pod/container
During armada bootstrap, ingress pod tries to execute chroot [0]
inside root directory on host machine to load dummy kernel module
and getting permission denied error.

[0] https://opendev.org/openstack/openstack-helm-infra/src/branch/master/ingress/templates/bin/_ingress-vip-routed.sh.tpl#L22

Change-Id: Icf7e29e95e0c3cf2bf71a22711a03218390c90cb
2019-06-14 17:24:42 +00:00
Steve Wilkerson
b4b1dd9528 Add missing affinity keys to chart pod specs
This adds the affinity key to the pod spec for the grafana,
nagios, kube-state-metrics, and openstack-exporter charts as it
was previously missed

Change-Id: Ifefa88d7f33607b4d595effa5fbf72f3387e5081
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-06-13 19:15:42 +00:00
Renis Makadia
c7f5c9979c Add helm tests for Ceph Provisioners chart
- Adding helm tests for Ceph provisioner chart
- Helm test should only executed when deploying chart with
client_secrets: true.

Co-Authored-By: Chinasubbareddy Mallavarapu <cr3938@att.com>

Change-Id: I33421249246dfaf6ea4f835e76a74813dfb3b595
2019-06-12 12:32:30 -05:00
Zuul
4a4456bace Merge "Enable hugepage support in HTK resources snippet" 2019-06-11 23:41:57 +00:00
RAHUL KHIYANI
7f47169f80 Rabbitmq: Add pod/container security context
This updates the rabbitmq chart to include the pod
security context on the pod template.

This also adds the container security context to set
readOnlyRootFilesystem to true

Change-Id: I68aa4b49bf6301e1b1004a526151fa0ab4b197b4
2019-06-10 18:50:59 +00:00
Zuul
0714b9d7ae Merge "Nagios: Use helm-toolkit template renderer for config files" 2019-06-10 16:09:53 +00:00
Chinasubbareddy Mallavarapu
9e653096e1 [ceph-client] update dependncy for test pod
This is to update test pod dependency  since its getting started
right after mgr service availbe and mgr pods are in init state and
waiting for rbd-pool job.

Change-Id: Iaf9af3ffcf1f4940c1b661a853df0ec4edd99d39
2019-06-06 16:50:06 -05:00
Zuul
09ae22d849 Merge "Nagios: Add Selenium tests via helm test pod" 2019-06-06 21:23:25 +00:00
Zuul
e2cd5d99f9 Merge "[ceph-client] Handle pool min_size during ceph cluster expansion" 2019-06-06 20:45:48 +00:00
Steve Wilkerson
40769d5a60 Nagios: Add Selenium tests via helm test pod
This adds selenium tests for the Nagios chart via a helm test
pod to help ensure the Nagios deployment is functional and
accessible

Change-Id: I44f30fbac274546abadba0290de029ed2b9d1958
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-06-06 19:35:59 +00:00
Steve Wilkerson
d72299db4b Nagios: Use helm-toolkit template renderer for config files
This updates the Nagios chart to use the helm-toolkit template
renderer snippet for generating the Nagios configuration files.
This was done to make the exposure of the configuration files
simpler for those who are more familiar with traditional Nagios
configuration files, as well as allowing for values overrides for
adding custom host names or custom object definitions to nagios
objects (as Nagios doesn't easily allow for this via environment
accessible macros).

Change-Id: I84d5c83d84d6438af5f3ab57997e80e8b1fc8312
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-06-06 19:35:32 +00:00
Zuul
d23e847e1f Merge "Uplift nfs-provisioner" 2019-06-06 19:28:51 +00:00
Zuul
a3f4337952 Merge "Ceph-client: Fix security context for pod/container" 2019-06-06 18:47:30 +00:00
Chinasubbareddy Mallavarapu
6767d0ad48 [ceph-client] Handle pool min_size during ceph cluster expansion
This is to update logic for  pool min_size parameter  as  this is
not getting changed when replication changes from intilization.

Change-Id: I30f99aaf92c3dc83afce10534b1d2ac9402b7fa7
2019-06-06 15:20:43 +00:00
RAHUL KHIYANI
d3dec3d257 Ceph-mon: Fix security context for pod/container
This changes the  user from root to the nobody user instead
in ceph-mon chart wherever needed

Change-Id: I5d3c1fbc8f983688807b73867773bfa2d83b91b3
2019-06-06 12:32:01 +00:00
Zuul
5245f04d4c Merge "Ingress: Add pod/container security context" 2019-06-06 11:53:11 +00:00
Zuul
82291cfe0e Merge "Ceph-osd: Fix security context for pod/container" 2019-06-06 11:53:10 +00:00
Zuul
3ba03ed8ea Merge "Grafana: Add Selenium tests to helm test pod" 2019-06-05 22:25:09 +00:00
Evgeny L
b77385c82f Uplift nfs-provisioner
* Update the version of nfs-provisioner to the latest image.
* Allow nfs-provisioner user to manage endpoints, this is
  required because the newest version uses `leaderelection`
  package from k8s, this packages leverages labels on endpoints
  to track leader election information.

Change-Id: Ie2727bd6bcc26e57875bea38f0f665d4a0e85bd7
2019-06-05 16:57:40 +00:00
Steve Wilkerson
25e105f26a Grafana: Add Selenium tests to helm test pod
This adds selenium tests for the grafana chart to the helm test
pod to help ensure the Grafana deployment is functional and
accessible

Change-Id: Idc8d97e5111628d1ed4f25145086d54c5e0136e7
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-06-05 16:09:04 +00:00
RAHUL KHIYANI
c9a1c412e7 Ceph-osd: Fix security context for pod/container
This changes the  user from root to the nobody user instead
in ceph-osd chart wherever needed

This also permits read-only filesystems to back the containers by setting
the default to true

Change-Id: Ia777bf212e0e3414909c70a4bd839e12d4919bb2
2019-06-05 15:00:20 +00:00
Georg Kunz
5596247f3f Enable hugepage support in HTK resources snippet
This patch simplifies the resource snippet in helm-toolkit to allow for
specifying hugepage limits. Specifically, this patch replaces the
individual checks for specific system resources (e.g., cpu, memory) by
just copying over the entire resource component as defined in a
values.yaml or a corresponding overwrite.

This change is a prerequisite for enable hugepage handling in other
charts such as openvswitch or postgresql.

Change-Id: I786ff6c7aa5fb6b08b54d2e21878551e5e1e3818
2019-06-05 11:37:43 +02:00
RAHUL KHIYANI
a0d67a1117 Ingress: Add pod/container security context
This updates the etcd chart to include the pod
security context on the pod template.

This also adds the container security context to set
readOnlyRootFilesystem to true

Change-Id: I9bf05ab5c21f9afbe269e1566cfecd20b3c086c0
2019-06-04 15:19:21 -05:00
Steve Wilkerson
0d2ac10fc2 Update apparmor job after fluent-logging split
This updates the apparmor job to account for the splitting of the
fluent-logging chart, as it was missed during that change.  Now,
the apparmor job will deploy fluentbit as well as fluentd deployed
as a daemonset running as a collecting agent

Change-Id: Iefa50f474b57a10c5e7e5a9032c7b23d26d97640
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-06-04 12:58:33 +00:00
Zuul
b7f0fd8dff Merge "Replace git.openstack.org URLs with opendev.org URLs" 2019-06-04 08:57:21 +00:00
Zuul
e238b7a611 Merge "Remove readonly rootfs Elasticsearch" 2019-06-03 19:39:55 +00:00
Zuul
547cf9b6b8 Merge "Fluentd: Update configuration defaults and overrides" 2019-06-03 18:01:51 +00:00
Steve Wilkerson
dc6c1f2145 Elasticsearch: Update cluster-wait job script
This updates the cluster-wait job script to include a sleep for
when no nodes of a given type are detected.  This check was
previously executed only when a node count of (0 < x < expected)
was detected.  This update reduces the number of queries executed
against the Elasticsearch http endpoint

Change-Id: I15cb39250a5ab9a7f6df0d62c35289a55e109dbd
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-06-03 16:00:51 +00:00
Steve Wilkerson
aee9708b06 Fluentd: Update configuration defaults and overrides
This updates the default fluentd configuration to include
recommended settings for preventing the elasticsearch plugin from
reloading the connection after 10000 requests (default for the
ruby gem). This also updates the configuration overrides for the
fluentd-daemonset deployment to provide input parity with the
default fluentbit configuration by adding inputs for the docker
and kubelet systemd units, inputs for ceph, libvirt, kernel logs,
and auth logs on the host.  Finally, this updates the fluentd
template to include environment variables for the host name and
the fluentd pod name so they can be added to logged events through
fluentd filter plugins

Change-Id: I21f7a89a325c44f8b058ff01a20191bea1a210b4
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-06-03 16:00:44 +00:00
Randeep Jalli
79cb67544c Remove readonly rootfs Elasticsearch
This removes ReadonlyRootfs from
Elasticsearch data pods as this is
required in order for the data pods
to recover from outages

Change-Id: I603d3a25b6580eab20e2b20e1b1cd0cf740c7ab2
2019-06-03 15:02:51 +00:00
Zuul
2f0c2e328d Merge "Elasticsearch: kill process after data node drain" 2019-05-31 23:48:34 +00:00
Steve Wilkerson
1ee476b197 Elasticsearch: Allow override of job back off limits and deadlines
this updates the Elasticsearch cluster wait and snapshot repo jobs
to include values overrides for the job backoff limits and the
active deadline seconds field. This allows for tweaking beyond the
standard defaults for kubernetes jobs

Change-Id: I1f95a635ab4dfdb3718d5d4fa668c64a9095e899
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-05-31 15:52:16 -05:00
Steve Wilkerson
277b54fc65 Fluentd: Use fqdn hostname for Elasticsearch service
This moves Fluentd to use the helm-toolkit endpoint lookup for
using the fqdn for the Elasticsearch hostname instead of the
standard short host name

Change-Id: Ibe640979002331693f0a9b6155c9014572294664
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-05-31 14:42:09 -05:00