586 Commits

Author SHA1 Message Date
Zuul
239274dffc Merge "Prometheus: Remove block duration flags, update cadvisor job" 2018-08-22 16:47:30 +00:00
Zuul
2bbf188cbd Merge "Fluentd: Change default image" 2018-08-22 16:47:30 +00:00
Zuul
dbd3841c6e Merge "Fluentd: Filter out fluentd's logs" 2018-08-22 16:47:29 +00:00
Zuul
b1fdcfa497 Merge "Running agents on all nodes." 2018-08-22 15:51:13 +00:00
Zuul
ba93bc11c4 Merge "Helm: Move to use 2.10 release" 2018-08-22 04:19:33 +00:00
Pete Birley
c62c5dcf88 Helm: Move to use 2.10 release
This PS moves to use the Helm 2.10 release, which brings in a version
of sprig that supports TLS certificate creation from defined CAs.

Change-Id: I80233f8f31727c80bcd667cfa0d851488da39588
Signed-off-by: Pete Birley <pete@port.direct>
2018-08-21 15:52:20 +00:00
Steve Wilkerson
d5dc97a431 Prometheus: Remove block duration flags, update cadvisor job
This removes the min_block_duration and max_block_duration flags
from the Prometheus chart, as the suggested best practice is to
use the defaults (2h min, 10% of retention time as max).

This also updates the scrape target configuration for cadvisor to
match the upstream example endpoint for kubernetes versions 1.7.3
and later

Change-Id: I200969d6c4da9d17d0a7d3a34a114ccc5f5ee70f
2018-08-20 13:26:40 -05:00
Steve Wilkerson
dd986ed764 Fluentd: Change default image
This updates Fluentd to use the stable v1.2 debian fluentd
image instead of the kolla image. This images comes bundled
with the elasticsearch plugin, and provides more
flexibility in configuring the buffer behavior of the output
plugins

Change-Id: Id446ef1e050f5d9c005c94dae661cf9ae88fffea
2018-08-16 12:10:49 -05:00
Steve Wilkerson
faef231b0b Prometheus: Update version to 2.3.2
This updates the Prometheus version to 2.3.2, which includes a fix
for memory leak issues with the kubernetes client and also adds a
dashboard for evaluating prometheus rule evaluation performance

Change-Id: I7b9e7bee114fa149db3733c0dacfefae36be7fa8
2018-08-16 16:48:27 +00:00
Steve Wilkerson
a7af54e0c8 Fluentd: Filter out fluentd's logs
This filters out fluentd's logs for collection, as this can result
in infinite loops as fluentd will try to process the events in its
own logs repeatedly

Change-Id: I85cce909b6917901b964cb5cc479403143c4d211
2018-08-16 10:14:18 -05:00
Gupta, Sangeet (sg774j)
61584fdb9d kubernetes-keystone-webhook base64 encoding
Changing the chart to accept plain certificates rather than a base64
encoded string. The chart will handle the base64 encoding internally.

Change-Id: I3cd0710652b1b731fa4bcd9e92dd59ce2c436eb6
2018-08-10 15:54:49 -05:00
Seungkyu Ahn
6b6f277e7d Running agents on all nodes.
Using a node selector can not run the fluent-bit or node-exporter
on the master node. So, This PS changes the scheduling to use
either taint/toleration or the node selector.

Change-Id: I0ca80a6e645b7047469288697387f0f5bf111345
2018-08-10 08:40:52 +00:00
Steve Wilkerson
8652e14acb Add auth for prometheus
This adds authentication to Prometheus with an apache reverse
proxy, similar to elasticsearch, kibana and nagios. This adds an
admin user and password via htpasswd along with adding ldap
support.

This required modifying the grafana chart to configure the
prometheus datasource's basic auth credentials in the data sources
provisioning configuration file by checking whether basic auth is
enabled and injecting the username/password defined in the
corresponding endpoint definition.

This also modifies the nagios chart to use the authenticated
endpoint for prometheus, which is required for nagios to
successfully query the prometheus endpoint for its service
checking mechanism

Change-Id: Ia4ccc3c44a89b2c56594be1f4cc28ac07169bf8c
2018-08-08 18:49:45 +00:00
Zuul
c0c5199fb2 Merge "prometheus process-exporter for monitoring host tools" 2018-08-07 22:04:08 +00:00
Pete Birley
89be3269d6 Keystone: Update endpoints to point to current defaults
This PS updates the keysteone endpoints section used in the
webhook authenticator and the prometheus exporter.

Depends-On: https://review.openstack.org/#/c/588651
Change-Id: Ia2df0ec1b783705f7e2ac164a8729d61962e2bc8
Signed-off-by: Pete Birley <pete@port.direct>
2018-08-04 08:08:51 -05:00
rakesh-patnaik
1f4b76c06f prometheus process-exporter for monitoring host tools
Change-Id: I941688bcb0f919afaf4b6e62d2a10544146257f7
2018-08-04 04:53:34 +00:00
Paul Belanger
f4e80dfb5f Switch to fedora-latest for testing
This bumps testing of fedora to 28, and allows openstack-infra to
delete fedora-27 nodes.

Change-Id: Idd38b1e4721b7f53e20ccbc665cb16762ba6132b
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2018-08-03 02:58:58 +00:00
Zuul
a65dd0269c Merge "Revert "Openstack Exporter: Use service domain for service user"" 2018-08-03 01:08:05 +00:00
Zuul
15368dbfcd Merge "Grafana: Update Ceph Dashboards" 2018-08-02 23:22:13 +00:00
Steve Wilkerson
c524931707 Grafana: Update Ceph Dashboards
This fixes two issues with the Ceph dashboards in Grafana: the
first fix addresses an incorrect heading for Utilized Capacity in
the ceph cluster dashboard (was reporting utilized as available),
and the second fix addresses the Pool Usage gauge to accurately
reflect the percentage of the pool used (was incorrectly
multiplying the percentage result by 100 a second time, resulting
in large and inaccurate results)

Change-Id: I024a555cdb82ee181eb414337b84e7ad62717c97
2018-08-02 11:10:33 -05:00
Pete Birley
aac1c4e8c0 Helm-Toolkit: Update tls secret manifest for non public endpoints
This PS updates the tls secret manifest to allow non-public endpoints
to be specified.

Change-Id: I47606e5c8db87fac07febb114334ded710f56ed5
Signed-off-by: Pete Birley <pete@port.direct>
2018-08-02 11:04:42 -05:00
Zuul
61829c0d45 Merge "Rabbitmq: Disable bert capabilities in exporter" 2018-08-02 00:08:56 +00:00
Zuul
ed0af8642d Merge "Gate: specify user for tiller image used for bootstrapping cluster" 2018-08-01 22:59:56 +00:00
Steve Wilkerson
580ba12e9d Rabbitmq: Disable bert capabilities in exporter
This disables the bert capabilities in the rabbitmq exporter, as
the go routines for bert in the exporter are raising panics.
This capability should remain disabled until these issues are
resolved in the exporter.

References:
http://logs.openstack.org/25/587825/2/check/openstack-helm-armada-fullstack-deploy/0d5ecb6/primary/pod-logs/openstack/osh-heat-rabbitmq-rabbitmq-exporter-55b4548bb6-tgwjr/rabbitmq-exporter.txt
http://logs.openstack.org/25/587825/2/check/openstack-helm-armada-fullstack-deploy/0d5ecb6/primary/pod-logs/openstack/osh-cinder-rabbitmq-rabbitmq-exporter-664945b7c5-rsrmd/rabbitmq-exporter.txt

Change-Id: I15d03a8893331d043df589f5498aaa965559bacb
2018-08-01 13:45:19 -05:00
Zuul
a464cc85ae Merge "Quoting node_select_value in Ingress Controller" 2018-08-01 16:50:08 +00:00
Pete Birley
ac27096002 Gate: specify user for tiller image used for bootstrapping cluster
This PS updates the ansible roles to update the user used with
the tiller image used for bootstrapping to allow access to approprate
config files used. This is required for use with the current master
tiller image, which no longer deffaults to the root user.

Change-Id: I61f28a2ebeecb22eb66e0394417b0af3a9116483
Signed-off-by: Pete Birley <pete@port.direct>
2018-08-01 11:01:07 -05:00
Seungkyu Ahn
a430533e6a Quoting node_select_value in Ingress Controller
In most cases, the ingress controller's nodeSelector key and value
are "node-role.kubernetes.io/ingress" and "true".
Using quote to treat the nodeSelector value as a string.

Change-Id: Ie1745629b90795e4d888d85f35565e6d6350e09b
2018-08-01 02:39:05 +00:00
Steve Wilkerson
838b6599de Revert "Openstack Exporter: Use service domain for service user"
This reverts commit cc9944f74a18073dcaa5c82dcc18b3d40ffdb7a2.

Change-Id: Ie21beb43d3ac3d5eb6ae6a06d2b665e017ae470a
2018-07-31 21:29:45 +00:00
Zuul
cb3ad183ca Merge "K8s: update to 1.10.6 in OSH gates" 2018-07-31 18:20:33 +00:00
Zuul
91b83aeabb Merge "Fix to configure RABBIT_CAPABILITIES" 2018-07-31 00:10:48 +00:00
Zuul
d75b90f32d Merge "RabbitMQ: move RabbitMQ chart to OSH-Infra" 2018-07-31 00:10:47 +00:00
Zuul
2906c1e0f8 Merge "Deploy ingress in osh-infra dev and multinode gates" 2018-07-30 23:51:24 +00:00
Pete Birley
9e2d684188 K8s: update to 1.10.6 in OSH gates
This PS bumps the version of k8s used in the gates to 1.10.6

Change-Id: I396fe0c0e276d17eb52bfe289a464b7008b8d4d2
Signed-off-by: Pete Birley <pete@port.direct>
2018-07-30 18:30:59 -05:00
Zuul
70dd71672f Merge "Nagios/Kibana: Update configmap annotations" 2018-07-30 21:20:26 +00:00
Zuul
9c7169e40e Merge "Resources: Fix erroneous resource definitions" 2018-07-30 18:29:10 +00:00
Pete Birley
141a1a98df Fix to configure RABBIT_CAPABILITIES
This PS fixes the rabbitmq exporter configurations.

Now, RABBIT_CAPABILITIES env values can not be set because of dummy
values.

After fix values, it needs to upgrade exporter image version because
of string parsing problem in the exporter.

Additional, bert option is added.

https://github.com/kbudde/rabbitmq_exporter

Change-Id: I2a763b6730bcbef1900f7cd4c5a05066bfffadf2
co-authored-by: DaeSeong Kim <powerds0111@gmail.com>
Signed-off-by: Pete Birley <pete@port.direct>
2018-07-30 12:49:49 -05:00
Steve Wilkerson
6f6c6b8b99 Nagios/Kibana: Update configmap annotations
This changes the ordering of the configmap annotations for kibana,
as older versions of helm require the configmap with the values
template definition for the apache proxy to be listed last. This
was addressed in the elasticsearch-client template but missed in
kibana.

This also adds the configmap hash annotations to the nagios chart
as they were previously missing. It also places them in the
correct order as above

Change-Id: I13befe8684d975f310f2723c5172b8a0f9f365d6
2018-07-30 12:33:17 -05:00
Zuul
2dd762cd8b Merge "Drive apache proxy configuration via values templates" 2018-07-30 16:27:25 +00:00
Zuul
a911bb0a3f Merge "Prometheus: Update command line flags" 2018-07-30 16:04:05 +00:00
Pete Birley
59fc23abdc RabbitMQ: move RabbitMQ chart to OSH-Infra
This PS moves the RabbitMQ chart to OSH-Infra

Story: 2002204
Task: 585554

Change-Id: Ib94f7ea92aacfd35f0a13672d2a94335335575ad
Signed-off-by: Pete Birley <pete@port.direct>
2018-07-30 15:28:34 +00:00
Zuul
5ed8c53334 Merge "Grafana: Update quotes for ldap admin bind password" 2018-07-30 14:06:55 +00:00
Steve Wilkerson
4f78e1f6fc Drive apache proxy configuration via values templates
This proposes defining the apache proxy hosts entirely via values
templates. While complicated on its face, this gives flexibility
by allowing the ability to define the desired authentication
mechanism via values templates. These options can range from
using http basic auth for development purposes to defining more
complex ldap configurations without a need to modify the chart
directly

Change-Id: Ief1b6890444ff90cc9c0ca872087af74836c0771
Signed-off-by: Pete Birley <pete@port.direct>
2018-07-30 07:52:26 -05:00
Steve Wilkerson
9fd4cd0917 Deploy ingress in osh-infra dev and multinode gates
This deploys the ingress chart in the openstack-helm-infra dev
and multinode gates, which allows for enabling ingresses in the
charts where defined

Change-Id: I055c7b02d9af68f6e3c5eda33d69dd0b8b1b70ca
2018-07-30 07:14:00 -05:00
Zuul
dc248124f4 Merge "Openstack Exporter: Use service domain for service user" 2018-07-29 16:07:55 +00:00
Steve Wilkerson
397eebf995 Resources: Fix erroneous resource definitions
This fixes the resource trees for the fluent-logging and
openstack-exporter charts to match the other charts. This
also fixes the elasticsearch master template to use the
correct indentation level for the resource template

Change-Id: Ic6ec270a880216daff10d1f22128c6377ebf9933
2018-07-27 16:35:37 -05:00
Steve Wilkerson
a861c27a34 Prometheus: Update command line flags
This updates the default command line flags for Prometheus. It
explicitly sets the HTTP administrative settings to false and
gives a brief explanation of the security concerns associated
with enabling them

This also removes the honor_labels setting where set to false, as
false is the default setting for honor_labels

Change-Id: I69acdbce604864882d642e44c09a5f0b9c454a61
2018-07-27 16:33:37 -05:00
Steve Wilkerson
cc9944f74a Openstack Exporter: Use service domain for service user
This changes the openstack exporters service user to use the
service domain instead of the default domain

Change-Id: I849814ee96b99e77940904e0d0dfb210a0915560
2018-07-27 12:46:40 -05:00
Pete Birley
a29da8c70f Memcached: move Memcached chart to OSH-Infra
This PS moves the Memcached chart to OSH-Infra

Story: 2002204
Task: 21727

Change-Id: I47a226ba90a84cddcbf4911af4bf23257827e79e
Signed-off-by: Pete Birley <pete@port.direct>
2018-07-26 19:35:13 +00:00
Zuul
a72ef08c59 Merge "Ingress: move ingress chart to OSH-Infra" 2018-07-26 13:19:27 +00:00
Pete Birley
9861d7a92f KubeADM: enable shared pid ns
This PS enables the pod shared pid feature gate in k8s, which allows
the puase container to reap processes when desired.

Change-Id: I01eac64bfa029027465d47c5036119cf5799a100
Signed-off-by: Pete Birley <pete@port.direct>
2018-07-25 16:18:23 +00:00