2142 Commits

Author SHA1 Message Date
Chris Wedgwood
578511cd39 [htk] Increase job default backoffLimit to 1000
Sometimes jobs fail, the default of 6 retries is far too brief to get
logs (which are purged after the final failure); as we need the jobs
to succeed always, having a much higher default here seems prudent.

Change-Id: I7f20a3eb9a98669ae4af657d36a776830b82dfca
2020-01-30 19:52:54 +00:00
Chinasubbareddy Mallavarapu
eacf937221 [ceph-osd] Fix issues with ceph osd init sript
This is to fix the logic to find osd id for wal lvm and also
to find correct lvm device for osd disk.

Change-Id: Id4ee1dbd5c82dcbe9893f81c3ad3b9e18d1f9509
2020-01-30 09:35:41 +00:00
Chinasubbareddy Mallavarapu
63e43d98b7 [ceph-osd] Fix to check osd disk name instead of disk path
This is to fix the logic to use osd device name instaed of whole disk path
while osd initilizing.
also correct the ceph osd ls command to use correct keyring.

Change-Id: I90f0c3fd5d1e1b835326b1c690582990f7ca15cb
2020-01-29 21:31:22 -06:00
Zuul
792b016677 Merge "[ceph-osd] Wait for devices to initialize the osd" 2020-01-29 23:06:17 +00:00
Chinasubbareddy Mallavarapu
9a18198fca [ceph-osd] Wait for devices to initialize the osd
This is to wait for all the osd devices before initializing and also
to add few more checks to make sure disk is used or not .

Change-Id: I68e1d4c8c1ade39f856c69333585dfcba3ea35ab
2020-01-29 14:33:51 -06:00
Huang, Sophie (sh879n)
d135e2c964 Update audit user access for Mariadb
The audit user is granted SELECT permission
for all Mariadb databases and tables.

Change-Id: I621325e4a9d27d3ab0d0bc30b4926ea0fa3fd17e
2020-01-29 18:11:45 +00:00
Zuul
376bd5c066 Merge "Add audit database user for audit purposes" 2020-01-28 23:03:45 +00:00
Koffi Nogbe
914ea2bd60 Add audit database user for audit purposes
This commit adds an audit user to the postgresql database which
will have only SELECT privileges on the postgresql database tables.
This is accomplished by setting up audit user creation parameters
in the Patroni bootstrap environment settings, according to (1).

(1) https://patroni.readthedocs.io/en/latest/ENVIRONMENT.html

Change-Id: Idf1cd90b5d093f12fa4a3c5c794d4b5bbc6c8831
2020-01-28 16:48:29 +00:00
Kabanov, Dmitrii
844d2cd16d [Ceph-rgw] Add bootstrap job
The PS adds bootstrap job for ceph-rgw chart.

Change-Id: I3055e1afe8072277166b8a659c940320720a0588
2020-01-28 01:49:57 +00:00
Zuul
de5dd82ff8 Merge "Update overrides used in apparmor nonvoting check" 2020-01-27 21:32:55 +00:00
Zuul
4572110bc3 Merge "[Ceph] Fix values.yaml" 2020-01-27 21:29:57 +00:00
Zuul
c228b0c454 Merge "[LDAP] Remove duplicate manifests: keys" 2020-01-27 17:58:20 +00:00
Doug Aaser
cf7b8dbb3d Add explicit admin user to Patroni
In this PS we explicitly define the admin user rather than letting
patroni use the default username and password.

Change-Id: I9885314902c3a60e709f96e2850a719ff9586b3d
2020-01-24 21:14:32 +00:00
Tin Lam
8779b976fa [LDAP] Remove duplicate manifests: keys
The values.yaml in the LDAP chart contains a duplicate network_policy:
key in the manifests: section. This patch removes the duplicate.

Change-Id: I677acaf7d96d92fecb93c30782f1e760ab4bec84
Signed-off-by: Tin Lam <tin@irrational.io>
2020-01-23 16:32:03 +00:00
Zuul
5e5d873911 Merge "Do not set CPU resources to ovs-vswitch DPDK pod" 2020-01-22 18:26:08 +00:00
Reddy, Hemachandra (hr858f)
70d93625e8 Do not set CPU resources to ovs-vswitch DPDK pod
When DPDK is enbaled, configuring CPU resource limits
through Kubernetes affects packet throughput adversely.
DPDK PMD cores could not get 100% busy.

They need to be configured by isolating them in host grub
and later through PMD core mask.

Change-Id: Ia80880302b9c5c02fdb1c00cb62f6640860e898e
2020-01-21 23:10:39 +00:00
Sophie Huang
a4568f31e2 Add audit user to Mariadb
An audit user is added to Mariadb with only the SELECT permission
to mysql database user table for database user audit purposes.

Change-Id: I5d046dd263e0994fea66e69359931b7dba4a766c
2020-01-21 19:20:51 +00:00
Chinasubbareddy Mallavarapu
6898fa7f9e [CEPH] check ceph version from daemon
This is to update scripts to check ceph version from daemon directly
instead of client.

Change-Id: I402365a45b8c2a92420c68689c97cb2e9f2d7c0e
2020-01-17 11:46:46 -06:00
Zuul
b0ee644204 Merge "OVS: enable setting threads for handler and revalidator" 2020-01-16 15:43:45 +00:00
Zuul
cc399a08ed Merge "Fix incorrect prometheus alert names in nagios" 2020-01-15 23:43:05 +00:00
Pete Birley
601e6ca47a OVS: enable setting threads for handler and revalidator
This PS enables the ability to configure the handler and
revalidator threads.

See: https://bugs.launchpad.net/ubuntu/+source/openvswitch/+bug/1827264/comments/6
Change-Id: I789da34104ac3cfb6a38bf4435a652da45c55e63
Signed-off-by: Pete Birley <pete@port.direct>
2020-01-15 17:07:24 -06:00
Zuul
042ac52397 Merge "Update Elastic Beats versions to 7.1.0" 2020-01-14 00:09:44 +00:00
Pete Birley
641bb04d4a Apparmor: Update to use the runtime default profile
This moves from using the docker profile to the default
runtime profile - which allows container engines other than
docker to work out of the box.

Change-Id: Ica5a48f8c43b90f07969b41e10dc472a772b5b43
Signed-off-by: Pete Birley <pete@port.direct>
2020-01-13 13:04:35 -06:00
bw6938
f1ffb7dbdb [ceph-rgw] Delete bucket and objects from failed deploy
Validate that the container bucket exist and if so
delete it and its objects that were orphaned from a
a failed deployment helm-tests.

Change-Id: Ibaa6d0f6dd36b319c354b65e43dc6053418f4d1d
2020-01-11 00:01:33 +00:00
Pai, Radhika (rp592h)
370111ee48 Grafana: Updated the ceph OSD expression
In Ceph Cluster Dashboard the OSDs In, OSDs Out, OSDs Down Panel was
showing wrong values. Updated
the expression from "count" to "sum" to show the correct values.

Change-Id: I1959eeb445bf297c1ec696f3867315f05552b03e
2020-01-10 04:45:27 +00:00
Zuul
c2ece6a45a Merge "Support for local storage" 2020-01-09 23:18:16 +00:00
Tin Lam
a43ae25226 Postgresql egress netpol
This patch set places in a default kubernetes egress network
policy for postgresql database chart.

Change-Id: I6caa917faf23becc3a1c09b47f457b8b2db996e4
Signed-off-by: Tin Lam <tin@irrational.io>
2020-01-09 18:50:36 +00:00
Smruti Soumitra Khuntia
2ac08b59b4 Support for local storage
This change adds a means of introducing new storage classes
and local persistent volumes.

Change-Id: I340c75f3d0a1678f3149f3cf62e4ab104823cc49
Co-Authored-By: Steven Fitzpatrick <steven.fitzpatrick@att.com>
2020-01-09 10:24:31 -06:00
Zuul
dac7478002 Merge "Remove duplicate key in memcached" 2020-01-09 16:18:05 +00:00
Zuul
a40a32f43d Merge "Fix LDAP data" 2020-01-09 16:09:07 +00:00
Zuul
5695d40dfa Merge "Get osh-infra netpol gate passing" 2020-01-09 14:41:42 +00:00
Tin Lam
4cdc3a0f4f Fix LDAP data
This patch set fixes a mismatch in the CN in the sample LDAP data.

Change-Id: Ie4c1cc46355e930b6b5bd65b5a55da11df1acd75
Signed-off-by: Tin Lam <tin@irrational.io>
2020-01-09 13:56:54 +00:00
laizhen
9b4931fc78 Remove duplicate key in memcached
Remove the duplicate key'namespace' in the endpoints configuration of oslo_cache

Change-Id: Ib4611f27ffa1146fc3d714c85ec263b379925e31
2020-01-09 13:52:00 +00:00
Steve Wilkerson
45ac5fbe0f Update Elastic Beats versions to 7.1.0
This updates the Elastic Beats charts to 7.1.0 to keep them
aligned with the Kibana and Elasticsearch chart versions, which
is required for compatibility

This also updates the experimental job to use the single node
minikube deployment as opposed to the standard 5 node multinode
deployment

Change-Id: I4baba6ca2ea2f3785f11905138b67979a4501caa
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2020-01-09 13:11:43 +00:00
Tin Lam
c199addf3c Update apiVersion
This patch set updates and tests the apiVersion for rbac.authorization.k8s.io
from v1beta1 to v1 in preparation for its removal in k8s 1.20.

Change-Id: I4e68db1f75ff72eee55ecec93bd59c68c179c627
Signed-off-by: Tin Lam <tin@irrational.io>
2020-01-09 08:59:48 +00:00
Gage Hugo
f9713b0051 Get osh-infra netpol gate passing
This change adds in missing network policy overrides for
fluent-daemonset and prometheus-exporter, as well as removes
existing mariadb network policies overrides that were causing
the network policy check job to fail.

Change-Id: Ib7a33f3d14617f9a9fda264f32cde7729a923193
2020-01-09 04:38:25 +00:00
Steve Wilkerson
bd527e42c6 Update overrides used in apparmor nonvoting check
This updates the overrides used in the apparmor nonvoting job, as
recent changes to the Elasticsearch chart values structure have
resulted in this jobs repeated failure

Change-Id: Id5427cd19a382e72435ab361003bbd5f99d678ce
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2020-01-08 20:08:54 +00:00
Zuul
a78ee39dcb Merge "Prometheus: Add feature-gate support in deployment scripts" 2020-01-08 05:28:40 +00:00
Zuul
f6775eaae0 Merge "Ingress: nginx-ingress-controller 0.26.1+ support" 2020-01-08 00:35:21 +00:00
Steve Wilkerson
ddd5a74319 Prometheus: Add feature-gate support in deployment scripts
This updates the deployment scripts for Prometheus to leverage the
feature gate functionality rather than bash generation of the list
of override files to use for alerting rules

Change-Id: Ie497ae930f7cc4db690a4ddc812a92e4491cde93
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2020-01-07 22:06:19 +00:00
Chinasubbareddy Mallavarapu
e2a60422df [ceph-osd] fix the issue with ceph-osd init
This is to fix the issue with ceph-osd initilization when deployed
with wal and db on same disk as pod restart always trying to prepare
the disk.

this ps will make sure to handle the case and skip the ceph-volume prepare
step in case of already deployed osd disk.

Change-Id: I5c37568f342cb4362a0de0a9c11a52b7aea3e147
2020-01-06 15:27:19 +00:00
Tin Lam
2dcad95fa2 [Ceph] Fix values.yaml
This removes a duplicated key in the values.yaml in the
ceph-client chart.

Change-Id: Iff4969fc1de7f0b1d34d3aac63ffac835c8fc7ed
Signed-off-by: Tin Lam <tin@irrational.io>
2020-01-04 03:45:20 -06:00
Zuul
954a06b3fd Merge "Add bandit check to osh-infra" 2020-01-03 20:51:03 +00:00
Phil Sphicas
927c018d29 Fix ansible docker support task
Removes become: and become_user: when including another role (that
already defines become: true and become_user: root)

Fixes an error occurring in the gates:
ERROR! 'become_user' is not a valid attribute for a IncludeRole

Change-Id: I362eefbe5b09ad64e97b3b541d07db2e6b990613
2020-01-03 08:37:11 -08:00
Steven Fitzpatrick
4fdcff593c Fix incorrect prometheus alert names in nagios
I noticed a some nagios service checks were checking prometheus
alerts which did not exist in our default prometheus configuration.
In one case a prometheus alert did not match the naming convention
of similar alerts.

One nagios service check, ceph_monitor_clock_skew_high, does not
have a corresponding alert  at all, so I've changed it to check the

node_ntmp_clock_skew_high

alert, where a node has the label ceph-mon="enabled".

Change-Id: I2ebf9a4954190b8e2caefc8a61270e28bf24d9fa
2020-01-03 10:30:08 -06:00
Phil Sphicas
e7b4242c3b Ingress: nginx-ingress-controller 0.26.1+ support
nginx-ingress-controller 0.26.1 introduces configurable parameters for
streamPort and profilerPort, and changes the default for statusPort.

This change allows those parameters to be configured, while maintaining
compatibility with earlier versions of nginx-ingress.controller. It also
modifies the default status port value from 18080 to 10246.

Reference: https://github.com/kubernetes/ingress-nginx/blob/master/Changelog.md#0261

Change-Id: I88a7315f2ed47c31b8c2862ce1ad47b590b32137
2020-01-01 13:34:00 -08:00
Phil Sphicas
bcecbad652 Ingress: k8s and ingress version compatibility
k8s 1.14 first enabled Ingress in the networking.k8s.io/v1beta1 API
group, while still serving it in the extensions/v1beta1 API group. The
extensions/v1beta1 API endpoint is deprecated in 1.16 and scheduled for
removal in 1.20. [0]

ingress-nginx 0.25.0 actually uses the networking.k8s.io/v1beta1 API,
which requires updated RBAC rules. [1]

This change updates the ClusterRole used by the ingress service account
to grant access to Ingress resources via either the extensions/v1beta1
or networking.k8s.io/v1beta1 API, aligning with the static manifests
from the kubernetes/ingress-nginx repo [2]. It does not change the
apiVersion used when creating Ingress resources.

[0] https://kubernetes.io/blog/2019/07/18/api-deprecations-in-1-16/
[1] https://github.com/kubernetes/ingress-nginx/releases/tag/nginx-0.25.0
[2] 870be3bcd8/deploy/static/mandatory.yaml (L50-L106)

Change-Id: I67d4dbdb3834ca4ac8ce90ec51c8d6414ce80a01
2020-01-01 13:34:00 -08:00
Zuul
e389f51865 Merge "Kafka - Implement SASL Authentication" 2019-12-31 19:02:38 +00:00
Zuul
e9d93980f3 Merge "Disable kubeadm-aio jobs while issues addressed" 2019-12-31 19:02:37 +00:00
Gage Hugo
30c66126ee Add bandit check to osh-infra
This change adds a non-voting bandit check to openstack-helm-infra
similar to what is ran in the openstack-helm repo.

This check will be made voting in a future change once the current
failures are addressed.

Similarly this check will be modified in a future change to
only be ran when affected python files are changed.

Change-Id: I177940f7b050fbe8882d298628c458bbd935ee89
2019-12-31 18:29:01 +00:00