2026 Commits

Author SHA1 Message Date
Tin Lam
3121fc24c5 Update egress HTK method
This patch set places logic to generate kubernetes egress network policy
rule based on the dependencies specified in values.yaml. This also sets
up the necessary default network policy for the OSH gate.

Change-Id: I1ac649cc9debb5d1f4ea0a32f506dcda4d8b8536
Signed-off-by: Tin Lam <tin@irrational.io>
2019-11-21 20:05:34 +00:00
Zuul
092709d875 Merge "RabbitMQ Exporter: Replace Direct Values w/ HTK" 2019-11-21 18:22:55 +00:00
Zuul
b6828f9e6a Merge "Add ceph metrics to postrun metrics gathering role" 2019-11-21 17:28:43 +00:00
Steven Fitzpatrick
ca6ad711a4 RabbitMQ Exporter: Replace Direct Values w/ HTK
This change replaces direct references to the exporter port
in values.yaml with calls to helm-toolkit lookup functions.

The referenced port number under the network key is removed,
as the helm-toolkit function will return the port number under
the endpoints key.

Change-Id: Ib6f533c49af5a88fca377920d28d5468d7387892
2019-11-21 12:52:55 +00:00
Zuul
0edd3e18de Merge "Update podManagementPolicy for Prometheus and Alertmanager" 2019-11-21 00:41:03 +00:00
Zuul
d85a41c3c1 Merge "ceph-volume integration to ceph-osd charts" 2019-11-20 22:03:45 +00:00
Steve Wilkerson
ef4cbb3b08 Add ceph metrics to postrun metrics gathering role
This updates the gather-prom-metrics role to include gathering
metrics from the active ceph-mgr endpoint

Change-Id: Icb5d27b6a070e9065f6276725bf06dec7d2cbc0d
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-11-20 21:42:49 +00:00
Steve Wilkerson
c1555920e5 Update podManagementPolicy for Prometheus and Alertmanager
This updates the podManagementPolicy to 'Parallel' for Prometheus
and Alertmanager, as there's no need to handle deploying these
two services in a sequential manner

Change-Id: I2f33b9651bed20c4cb2e0c477ae2227cbf9310cf
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-11-20 21:37:55 +00:00
Phil Sphicas
6ca136bae4 Ingress chart managed VIP fixes cleanup/startup
When the ingress pod (in routed mode, using a managed vip) moves from
one host to another, it is sometimes observed that: 1. the vip interface
is not removed on the original host, and 2. in some network topologies,
the switch fabric is unable to find the new pod.

This change updates the ingress deployment as follows:

Adds a 5s sleep before the shutdown of the ingress container in order to
allow the preStop action of the ingress-vip container to run completely.

Updates the start action of the ingress-vip-init container to check if
the vip is part of an existing connected subnet, and if so, sends a few
gratuitous ARP messages to let the switch fabric to build its ARP cache.

Change-Id: I784906865358566f42157dc2133569e4cb270cfa
2019-11-20 07:25:50 -08:00
kranthikirang
41684a3c29 ceph-volume integration to ceph-osd charts
ceph-disk has been deprecated and ceph-volume
is available from luminous release. uplifting
ceph-osd charts to use ceph-volume with support
of all below combinations

Filestore:
ceph-disk to ceph-volume
ceph-volume to ceph-volume

Bluestore: (including db, wal combinations)
ceph-disk to ceph-volume
ceph-volume to ceph-volume

support for different osds to run different stores
and upgrade with db, wal combinations

cross upgrade from store isn't supported

Story: ceph-volume-support
Signed-off-by: Kranthi Guttikonda <kranthi.guttikonda@att.com>
Co-Authored-By: Chinasubbareddy Mallavarapu <cr3938@att.com>
Change-Id: Id8b2e1bda0d35fef2cffed6a5ca5876f3888a1c7
2019-11-20 10:02:08 -05:00
Steve Wilkerson
4e7b8a183e Remove elasticsearch ldap test from osh-infra-logging
This removes the elasticsearch-ldap.sh script from the single node
osh-infra-logging job, as this step does not provide any real
value and is tightly coupled to the elasticsearch version used.
This sort of validation should be reserved for smoke tests in
future helm tests for charts

Change-Id: I7ca4805a8809568cb09c8bab6c239c008528fd6a
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-11-20 12:52:51 +00:00
Chinasubbareddy Mallavarapu
2b42632c9b [ceph-osd] Separate ceph-disk based deployment scripts
This is to create a different folder for ceph-disk based deplyoments
so that it will be easy to maintain when we introduce ceph-volume.

Separate folder for both the tools gives us flexibilty to develop or
fix the issues and commit the code to respective folders without breaking
other tool-based deployments.

Change-Id: Ib0099d292a8692dc6676eb5ed624d5d1ef677cfe
2019-11-19 22:05:36 +00:00
Zuul
ac6fa2977c Merge "Prometheus: Update version" 2019-11-19 21:57:03 +00:00
Zuul
3e18a436d2 Merge "Grafana: Update version" 2019-11-19 20:50:37 +00:00
Steve Wilkerson
0c51a9cab8 Prometheus: Update version
This updates the Prometheus version deployed by default from
2.3.2 to 2.12.0

Change-Id: Ic10e02a6b136a7f65fb686f5ef1adf1bcf6a9a9d
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-11-19 12:03:43 -06:00
Zuul
029f94a776 Merge "Grafana - Update cadvisor labels for k8s 1.16" 2019-11-18 20:55:10 +00:00
Steve Wilkerson
1bfa091203 Grafana: Update version
This updates the Grafana version deployed by default from 5.0.0 to
6.2.0

Change-Id: I39b5405cc3f3fe7754ed6544a8388ff912a4ef58
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-11-18 08:49:38 -06:00
Zuul
d0b4803b3c Merge "Add zookeeper chart to osh-infra" 2019-11-17 08:45:01 +00:00
Zuul
fb50a4dbee Merge "Fix K8s version" 2019-11-17 08:34:06 +00:00
Zuul
84596d5eba Merge "Add RabbitMQ ingress Network Policy rules" 2019-11-17 07:18:12 +00:00
Tin Lam
7b332076d7 Fix K8s version
Trivial fix to make all kubernetes version consistently 1.16.2.

Change-Id: I51d567c57604150cba2274c153817b4401a8e707
2019-11-17 06:20:33 +00:00
Zuul
7fca2677da Merge "Add default Network Policies for Mariadb Prometheus Exporter" 2019-11-15 16:41:22 +00:00
Steve Wilkerson
608d75ec8d Add zookeeper chart to osh-infra
This proposes adding a zookeeper chart to osh-infra that aligns
with the design patterns laid out by the other charts in osh-infra
and osh.

Change-Id: I25edc58fc951e7f81f7275ade6cf9c97e0afae02
Signed-off-by: Steve Wilkerson <sw5822@att.com>
Co-Authored-By: Steven Fitzpatrick <steven.fitzpatrick@att.com>
2019-11-14 19:51:20 +00:00
Zuul
be29dd6fb6 Merge "Fxing lint errors for Helm 2.16" 2019-11-14 17:33:35 +00:00
Steve Wilkerson
59dac085ce Nagios: Update ceph health check command
This updates the ceph health check command in Nagios to use the
updated plugin that determines the active ceph-mgr instance
endpoint to use before querying for ceph's health. This results in
more robust and reliable reporting of ceph's overall health

Depends-On: https://review.opendev.org/#/c/693900/

Change-Id: I5eeb076e5af3c820dbdcc3cc321cefcb5f85ef8d
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-11-13 08:51:26 -06:00
Bjoern Teipel
b500d69591 Fxing lint errors for Helm 2.16
This commit fixes helm lint errors when linting against
the recent helm version.

Change-Id: I2a940ad1cea406ba923519cd5be188ee1bc409aa
2019-11-12 11:28:22 -06:00
Tin Lam
b4a422a798 Clean up python script
Trivial change. This patch set cleans up a python script.

- Move the comment to a helm-template comment so the python comments do
not get rendered by helm.
- Remove an unused python module.

Change-Id: Id287ddae8904d2cfa88725277bb97cf027a942c3
Signed-off-by: Tin Lam <tin@irrational.io>
2019-11-11 22:45:38 +00:00
Bharat Khare
ab95e311a3 Grafana - Update cadvisor labels for k8s 1.16
This patch set will implement the grafana metrics related changes
required for kubernetes version upgrade to 1.16. Updates are mostly
specific to cadvisor metric labels. It is to make sure all
existing metrics are scraped and available in Prometheus so that
these can be consumed by Grafana & Nagios.

Change-Id: I74369ac49dd3f7d9f3682dd5318a3818a4d3f178
2019-11-11 17:57:09 +00:00
Zuul
7bcb16379e Merge "Update grafana link" 2019-11-11 09:42:14 +00:00
Zuul
c90ffb11f9 Merge "Grafana gridPos y key resolves true in chart json" 2019-11-11 09:42:13 +00:00
Zuul
cd860e9017 Merge "Add missing pod labels for CronJobs" 2019-11-11 09:38:35 +00:00
Zuul
f504a1709d Merge "Update the constraints url" 2019-11-11 09:38:34 +00:00
Evgeny L
f173d6103f Add default Network Policies for Mariadb Prometheus Exporter
Due to missing default policies for MySQL Prometheus
Exporter the Pod fails to start.

Change-Id: Ib9f013f97a83da0c2e36f2d38e54ae0a906700e5
2019-11-11 07:46:26 +00:00
Zuul
02af18d5dc Merge "Fix search of max sequence number" 2019-11-11 01:08:01 +00:00
Zuul
58f616fc26 Merge "fix: ceph-osd AppArmor profiles not applied" 2019-11-08 22:49:16 +00:00
Sphicas, Phil (ps3910)
938c6d9bb9 fix: ceph-osd AppArmor profiles not applied
AppArmor annotations require the container name to be applied properly.
Before this change, when overrides are not used, the container name is
ceph-osd-default. When overrides are used, the container name is of the
form ceph-osd-HOSTNAME-SHA, but with an identical HOSTNAME and SHA for
all the daemonsets. However, it is not possible to predict this value,
and as a result, the AppArmor profiles are not applied.

This change removes the customization of the container name, and sets
it to ceph-osd-default, allowing AppArmor annotations to be consistently
applied using:

    pod:
      mandatory_access_control:
        type: apparmor
        ceph-osd-default:
          ceph-osd-default: localhost/profilename

Change-Id: I8b6eda00f77ec7393a4311309f3ff76908d06ae6
2019-11-06 14:48:12 +00:00
Evgeny L
762dc76b5c Add RabbitMQ ingress Network Policy rules
The patch adds Network Policy ingress rules for RabbitMQ
and Prometheus RabbitMQ exporter.

It also fixes name generation for network policies,
to make sure they do not contain a prohibited '_' symbol,
which may appear in some label names.

Change-Id: I9821983b61d90e73e62c5ac669eefeb4ba9999d2
2019-11-04 18:22:16 +00:00
KHIYANI, RAHUL (rk0850)
9d808c96fa Adding missing readOnlyRootFilesystem flag for container security context
Change-Id: Ibc8e259dabad9a162f43df0b7eb4d94e3f3e5c9b
2019-11-04 16:55:39 +00:00
Zuul
81d2d687c8 Merge "Make corrections to pod lifecycle upgrade values" 2019-11-01 14:10:37 +00:00
Zuul
002cc1381d Merge "Fix Prometheus Volume Claim Use Expression" 2019-10-31 21:29:35 +00:00
Steven Fitzpatrick
1971d23da8 Make corrections to pod lifecycle upgrade values
It was observed in some charts' values.yaml that the values defining
lifecycle upgrade parameters were incorrectly placed.

This change aims to correct these instances by adding a deployment-
type subkey corresponding with the deployment types identified in
the chart's templates dir, and indenting the values appropriately.

Change-Id: Id5437b1eeaf6e71472520f1fee91028c9b6bfdd3
2019-10-31 20:34:07 +00:00
Steven Fitzpatrick
84113626bf Fix Prometheus Volume Claim Use Expression
This change updated the expression math so that the threshold value
can be reached.

Change-Id: Iae078d4c78a4403c410ae01e0a13a1dda25d40c7
2019-10-28 16:41:45 -05:00
Steve Wilkerson
74f3eb5824 Ingress: Move ingress objects back to the extensions api
This updates the ingress objects to move them back to the
extensions API.  While 1.16 moves them under the networking
api, they're still rendered and deployed as extensions/ objects.
This move prevents issues from arising where older versions of
kubernetes might still be deployed during an upgrade, as the
move to the networking API is nonfunctional at this time

Change-Id: I814bbc833b5b9f79f34aefc60b9c1f9890bca826
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-10-28 21:31:06 +00:00
Evgeny L
5919aaf60d Add missing pod labels for CronJobs
Pods for some of the CronJobs do not have correct
application and component labels applied, they are
unable to start if Network Policies are enabled.

Related-Change: Ie4eed0e9829419b4b2e40e9b712b73a86d6fc3d2
Change-Id: Ieee874bf837c7947e3681e0447d150174c99d880
2019-10-28 20:07:53 +00:00
Zuul
741758746a Merge "openvswitch: add vswitchd dependency on db" 2019-10-28 19:43:59 +00:00
Sphicas, Phil (ps3910)
e4538f714a openvswitch: add vswitchd dependency on db
The openvswitch-vswitchd pod should not start until there is a Ready
openvswitch-vswitchd-db pod on the same node. This change adds the
appropriate dependency to cause it to wait.

Change-Id: I5c827971c99639d2f1c3a24a1761524b3a165421
2019-10-24 10:41:46 -07:00
Zuul
5fdf050b12 Merge "Added missing OSD_WEIGHT variable for Ceph OSD chart" 2019-10-24 17:27:12 +00:00
Zuul
24c1cd4514 Merge "Database backup fix" 2019-10-23 19:16:28 +00:00
Oleksii Grudev
4d808243f0 Fix search of max sequence number
It was observed that sometimes during
galera ckuster restart the node with highest
seqno is determined incorrecly. After investigation
it was found that max function is invoked on the
list of string values which can lead to incorrect results.
This patch performs casting the value to integer before building
list of seqnos hence max function will return correct result

Change-Id: I604ec837f3f2d157c829ab43a44e561879775c77
2019-10-23 15:39:47 +03:00
Zuul
1c9cc1cb1d Merge "Update openstack-exporter chart to use new image" 2019-10-22 15:59:42 +00:00