2161 Commits

Author SHA1 Message Date
KHIYANI, RAHUL (rk0850)
cce2e61c16 Add Docker default AppArmor profile to memcached chart
Adding apparmor profile to memcached and memcached-exporter charts

Change-Id: I40ece825d75b6884714b9121d8d501efcbce2f53
2020-02-13 10:51:15 -06:00
Zuul
f17b6de1a2 Merge "Add Docker default AppArmor profile to mariadb" 2020-02-12 20:15:18 +00:00
KHIYANI, RAHUL (rk0850)
483d6f0047 Add Docker default AppArmor profile to mariadb
Change-Id: I256f169d6ff2de71b7218ab522bac9975d971c41
2020-02-12 10:32:22 -06:00
Zuul
b21fdfabad Merge "Fix MariaDB Single Database Restore" 2020-02-11 22:22:06 +00:00
Zuul
69fabcc1e4 Merge "[Ceph-Mon] Check for ceph-mon messenger V2" 2020-02-11 22:21:55 +00:00
Zuul
7fa99287a1 Merge "[ceph-client] Enable Nautilus PG autoscaler for all ceph pools" 2020-02-11 22:18:47 +00:00
Zuul
4a770bb908 Merge "Fix postgresql database backup issue" 2020-02-11 21:11:45 +00:00
Chinasubbareddy Mallavarapu
622f604cbe [Ceph-Mon] Check for ceph-mon messenger V2
This adds a new check to make sure msgr2 is enabled if it is
supported by all of the mons. When mon quorum is lost the
mons revert to the v1 protocol, which results in a Ceph
warning state if v2 is supported by all of the available
mons.

Change-Id: Ib85243d38f122c1993aba945b7ae943eed262dbf
2020-02-10 16:43:19 -06:00
Cliff Parsons
c18ee59aff Fix postgresql database backup issue
Currently postgresql database backup job will fail due to not having
correct permissions on the mounted PVC. This patchset corrects the
permissions on the PVC mount so that the backup pods can write to the
/var/backup directory structure.

Another problem was that pg_dumpall was not able to get the correct
password from the admin_user.conf. This may be due to the extra lines
in the file, so this patchset reads it differently in order to find
the password. This was a change to the backup and restore scripts.

Also there are a number of small corrections made to the error handling
for both backup and restore scripts, to be consistent with the MariaDB
backup/restore scripts.

Change-Id: Ica361764c591099e16d03a0988f73c6976583ceb
2020-02-10 17:38:10 +00:00
Brian Wickersham
41924e1618 [ceph-client] Enable Nautilus PG autoscaler for all ceph pools
enabling pg autoscaler across all pools will ensure pg_num is
automatically adjusted.

https://ceph.io/rados/new-in-nautilus-pg-merging-and-autotuning/

Change-Id: Ic2f635700a32c0b7e8c67ed9571efa520638474c
2020-02-07 21:38:24 +00:00
Tin Lam
aa48b16896 Add train release support
This patch set adds in needed override to support OpenStack Train
release by moving the libvirt version to > 3.0.0.

Change-Id: I36097544024df5c6dfc87a032bd8383be98f1a3a
Signed-off-by: Tin Lam <tin@irrational.io>
2020-02-07 08:56:21 -06:00
Parsons, Cliff (cp769u)
ef9d8392f2 Fix MariaDB Single Database Restore
This patchset fixes a serious database restoration problem where the
user is trying to restore a single database, but in the process of
restoring the database, the script inadvertently also removes all
tables from the other databases.

The root cause was that the mysql "--one-database" restore option
achieves the single database restoration, but somehow corrupts the
other databases. The new approach taken in this patchset is to
create a temporary database user which only has permission to
restore the chosen database, and that will leave the other databases
unharmed. This approach, which can be applied for restoring
individual databases and even database tables, was recommended in (1).
After the database is restored, the temporary user is deleted.

(1) https://mariadb.com/kb/en/restoring-data-from-dump-files/

Also improved some of the error handling as well.

Change-Id: I805c605ed2b424640ad6a0a379b1c0b9c0004e94
2020-02-06 16:17:28 +00:00
Taylor, Stephen (st053q)
92dfac645a [Ceph Nautilus] Fix _checkPGs.py.tpl for Nautilus compatibility
The output of 'ceph pg ls-by-pool' changed format in Nautilus,
which caused the checkPGs.py script to fail in some scenarios.
This change addresses that format change and fixes Nautilus
compatibility in the script. Mimic compatibility is maintained.

Change-Id: I11d8337b548f959d0a4b58b7e8f76720a0371e73
2020-02-04 08:23:17 -07:00
Zuul
3dd0eb0cdf Merge "Fluentd: Update kernel and auth inputs to use systemd" 2020-01-31 22:20:22 +00:00
Zuul
26208fa3a7 Merge "Add ability to add rally cleanup script" 2020-01-31 03:29:01 +00:00
Zuul
210a5187af Merge "Prevent splitbrain during full Galera restart" 2020-01-31 03:29:00 +00:00
Tin Lam
a6b1bd293d Add ability to add rally cleanup script
This patch set provides a way to specify clean up scripts for rally tests
to clean up orphaned resources in the event of rally test failures.

Change-Id: Ifc988002711d34186975988abb33ecd8a9a2fba4
Signed-off-by: Tin Lam <tin@irrational.io>
2020-01-30 21:44:49 +00:00
Chris Wedgwood
578511cd39 [htk] Increase job default backoffLimit to 1000
Sometimes jobs fail, the default of 6 retries is far too brief to get
logs (which are purged after the final failure); as we need the jobs
to succeed always, having a much higher default here seems prudent.

Change-Id: I7f20a3eb9a98669ae4af657d36a776830b82dfca
2020-01-30 19:52:54 +00:00
Chinasubbareddy Mallavarapu
eacf937221 [ceph-osd] Fix issues with ceph osd init sript
This is to fix the logic to find osd id for wal lvm and also
to find correct lvm device for osd disk.

Change-Id: Id4ee1dbd5c82dcbe9893f81c3ad3b9e18d1f9509
2020-01-30 09:35:41 +00:00
Chinasubbareddy Mallavarapu
63e43d98b7 [ceph-osd] Fix to check osd disk name instead of disk path
This is to fix the logic to use osd device name instaed of whole disk path
while osd initilizing.
also correct the ceph osd ls command to use correct keyring.

Change-Id: I90f0c3fd5d1e1b835326b1c690582990f7ca15cb
2020-01-29 21:31:22 -06:00
Zuul
792b016677 Merge "[ceph-osd] Wait for devices to initialize the osd" 2020-01-29 23:06:17 +00:00
Chinasubbareddy Mallavarapu
9a18198fca [ceph-osd] Wait for devices to initialize the osd
This is to wait for all the osd devices before initializing and also
to add few more checks to make sure disk is used or not .

Change-Id: I68e1d4c8c1ade39f856c69333585dfcba3ea35ab
2020-01-29 14:33:51 -06:00
Huang, Sophie (sh879n)
d135e2c964 Update audit user access for Mariadb
The audit user is granted SELECT permission
for all Mariadb databases and tables.

Change-Id: I621325e4a9d27d3ab0d0bc30b4926ea0fa3fd17e
2020-01-29 18:11:45 +00:00
Zuul
376bd5c066 Merge "Add audit database user for audit purposes" 2020-01-28 23:03:45 +00:00
Koffi Nogbe
914ea2bd60 Add audit database user for audit purposes
This commit adds an audit user to the postgresql database which
will have only SELECT privileges on the postgresql database tables.
This is accomplished by setting up audit user creation parameters
in the Patroni bootstrap environment settings, according to (1).

(1) https://patroni.readthedocs.io/en/latest/ENVIRONMENT.html

Change-Id: Idf1cd90b5d093f12fa4a3c5c794d4b5bbc6c8831
2020-01-28 16:48:29 +00:00
Kabanov, Dmitrii
844d2cd16d [Ceph-rgw] Add bootstrap job
The PS adds bootstrap job for ceph-rgw chart.

Change-Id: I3055e1afe8072277166b8a659c940320720a0588
2020-01-28 01:49:57 +00:00
Zuul
de5dd82ff8 Merge "Update overrides used in apparmor nonvoting check" 2020-01-27 21:32:55 +00:00
Zuul
4572110bc3 Merge "[Ceph] Fix values.yaml" 2020-01-27 21:29:57 +00:00
Zuul
c228b0c454 Merge "[LDAP] Remove duplicate manifests: keys" 2020-01-27 17:58:20 +00:00
Doug Aaser
cf7b8dbb3d Add explicit admin user to Patroni
In this PS we explicitly define the admin user rather than letting
patroni use the default username and password.

Change-Id: I9885314902c3a60e709f96e2850a719ff9586b3d
2020-01-24 21:14:32 +00:00
Oleksii Grudev
b0bb8dfa7a Prevent splitbrain during full Galera restart
This patch introduces new cluster status "reboot"
which is set by leader node hence other nodes will
start mysql without "--wsrep-new-cluster" option.
Before this following situation took place:

All pods go down one by one with some offset;
First and second nodes have max seqno;
The script on the first node detects there are no active
backends and starts timeout loop;
The script on the second node detects there are no active
backends and starts timeout loop (with approx. 20 sec offset
from first node) ;
Timeout loop finishes on first node, it checks highest seqno
and lowest hostname  and wins the ability to start cluster.
Mysql is started with “--wsrep-new-cluster” parameter.
Seqno is set to “-1” for this node after mysql startup;
Periodic job syncs values from grastate file to configmap;
Timeout loop finishes on second node. It checks node with
highest seqno and lowest hostname and since seqno is already
“-1” for first node, the second node decides that it should
lead the cluster startup and executes mysql with “--wsrep-new-cluster”
option as well which leads to split brain

Change-Id: Ic63fd916289cb05411544cb33d5fdeed1352b380
2020-01-23 18:45:18 +02:00
Tin Lam
8779b976fa [LDAP] Remove duplicate manifests: keys
The values.yaml in the LDAP chart contains a duplicate network_policy:
key in the manifests: section. This patch removes the duplicate.

Change-Id: I677acaf7d96d92fecb93c30782f1e760ab4bec84
Signed-off-by: Tin Lam <tin@irrational.io>
2020-01-23 16:32:03 +00:00
Zuul
5e5d873911 Merge "Do not set CPU resources to ovs-vswitch DPDK pod" 2020-01-22 18:26:08 +00:00
Reddy, Hemachandra (hr858f)
70d93625e8 Do not set CPU resources to ovs-vswitch DPDK pod
When DPDK is enbaled, configuring CPU resource limits
through Kubernetes affects packet throughput adversely.
DPDK PMD cores could not get 100% busy.

They need to be configured by isolating them in host grub
and later through PMD core mask.

Change-Id: Ia80880302b9c5c02fdb1c00cb62f6640860e898e
2020-01-21 23:10:39 +00:00
Sophie Huang
a4568f31e2 Add audit user to Mariadb
An audit user is added to Mariadb with only the SELECT permission
to mysql database user table for database user audit purposes.

Change-Id: I5d046dd263e0994fea66e69359931b7dba4a766c
2020-01-21 19:20:51 +00:00
Chinasubbareddy Mallavarapu
6898fa7f9e [CEPH] check ceph version from daemon
This is to update scripts to check ceph version from daemon directly
instead of client.

Change-Id: I402365a45b8c2a92420c68689c97cb2e9f2d7c0e
2020-01-17 11:46:46 -06:00
Zuul
b0ee644204 Merge "OVS: enable setting threads for handler and revalidator" 2020-01-16 15:43:45 +00:00
Steve Wilkerson
781e65ac5d Fluentd: Update kernel and auth inputs to use systemd
This updates the overrides provided for deploying fluentd as a
daemonset to get kernel messages from the journal instead of
/var/log/kern.log directly, and also uses the journal to get
messages associated with logging to auth.log (syslog facility
10). This provides additional metadata and
a cleaner interface for gathering these logs via fluentd

Change-Id: I8e832db276095771d6a869e998d7a69795dfee37
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2020-01-16 13:38:52 +00:00
Zuul
cc399a08ed Merge "Fix incorrect prometheus alert names in nagios" 2020-01-15 23:43:05 +00:00
Pete Birley
601e6ca47a OVS: enable setting threads for handler and revalidator
This PS enables the ability to configure the handler and
revalidator threads.

See: https://bugs.launchpad.net/ubuntu/+source/openvswitch/+bug/1827264/comments/6
Change-Id: I789da34104ac3cfb6a38bf4435a652da45c55e63
Signed-off-by: Pete Birley <pete@port.direct>
2020-01-15 17:07:24 -06:00
Zuul
042ac52397 Merge "Update Elastic Beats versions to 7.1.0" 2020-01-14 00:09:44 +00:00
Pete Birley
641bb04d4a Apparmor: Update to use the runtime default profile
This moves from using the docker profile to the default
runtime profile - which allows container engines other than
docker to work out of the box.

Change-Id: Ica5a48f8c43b90f07969b41e10dc472a772b5b43
Signed-off-by: Pete Birley <pete@port.direct>
2020-01-13 13:04:35 -06:00
bw6938
f1ffb7dbdb [ceph-rgw] Delete bucket and objects from failed deploy
Validate that the container bucket exist and if so
delete it and its objects that were orphaned from a
a failed deployment helm-tests.

Change-Id: Ibaa6d0f6dd36b319c354b65e43dc6053418f4d1d
2020-01-11 00:01:33 +00:00
Pai, Radhika (rp592h)
370111ee48 Grafana: Updated the ceph OSD expression
In Ceph Cluster Dashboard the OSDs In, OSDs Out, OSDs Down Panel was
showing wrong values. Updated
the expression from "count" to "sum" to show the correct values.

Change-Id: I1959eeb445bf297c1ec696f3867315f05552b03e
2020-01-10 04:45:27 +00:00
Zuul
c2ece6a45a Merge "Support for local storage" 2020-01-09 23:18:16 +00:00
Tin Lam
a43ae25226 Postgresql egress netpol
This patch set places in a default kubernetes egress network
policy for postgresql database chart.

Change-Id: I6caa917faf23becc3a1c09b47f457b8b2db996e4
Signed-off-by: Tin Lam <tin@irrational.io>
2020-01-09 18:50:36 +00:00
Smruti Soumitra Khuntia
2ac08b59b4 Support for local storage
This change adds a means of introducing new storage classes
and local persistent volumes.

Change-Id: I340c75f3d0a1678f3149f3cf62e4ab104823cc49
Co-Authored-By: Steven Fitzpatrick <steven.fitzpatrick@att.com>
2020-01-09 10:24:31 -06:00
Zuul
dac7478002 Merge "Remove duplicate key in memcached" 2020-01-09 16:18:05 +00:00
Zuul
a40a32f43d Merge "Fix LDAP data" 2020-01-09 16:09:07 +00:00
Zuul
5695d40dfa Merge "Get osh-infra netpol gate passing" 2020-01-09 14:41:42 +00:00