1795 Commits

Author SHA1 Message Date
Sreejith Punnapuzha
7314edc57d Fix Grafana helm test exception error
Grafana helm test is failing with the below error
"NameError: name 'exception' is not defined"
This is because exception is defined in smaller case. changing
exception to Exception fixes this issue

Change-Id: I533ae822babb4f063242fee1cd42b5b821519b5f
Signed-off-by: Sreejith Punnapuzha <Sreejith.Punnapuzha@outlook.com>
2019-07-08 14:02:42 +00:00
Zuul
d7520fe006 Merge "Rabbit: Move to deploy the default number of replicas in gate" 2019-07-05 17:51:01 +00:00
Zuul
0afd1fadef Merge "Ceph-Provisioner: wait for helm tests related pods" 2019-07-05 17:50:59 +00:00
Chinasubbareddy Mallavarapu
ddbe20a30b Ceph-Provisioner: wait for helm tests related pods
This is to update logic to wait for  pods created by helm tests.

Change-Id: Ic1ee293613a0b050be9fa01a29642bfc9e8de2d7
2019-07-05 10:18:38 -05:00
Pete Birley
fff09d1066 Rabbit: Move to deploy the default number of replicas in gate
This PS moves to deploy the default number of RMQ replicas in the gate.

Change-Id: I36734a64b45adce8de89dfe3b020d0dae0e66d94
Signed-off-by: Pete Birley <pete@port.direct>
2019-07-05 13:13:42 +00:00
Pete Birley
9b5b901104 Rabbit: Ensure node has joined cluster on initial startup
This PS extends the rabbit startup locgic to ensure nodes have
actually joined the cluster on startup.

Change-Id: Ib876d9abd89209d0a7972983bdf4daacf5f8f582
Signed-off-by: Pete Birley <pete@port.direct>
2019-07-05 13:11:18 +00:00
Zuul
819cf51083 Merge "Grafana: Disable analytics settings" 2019-07-04 18:36:59 +00:00
Zuul
ce2c0e08d2 Merge "Update helm version to 2.14.1" 2019-07-04 18:36:57 +00:00
Sphicas, Phil (ps3910)
09bf431fe7 MariaDB: Disable SSL chain completion on ingress controller
This PS sets `--enable-ssl-chain-completion=false` for the MariaDB
ingress controller. This is the default for current versions of
the nginx-ingress-controller, but for 0.9.0 needs to be set.

If enableSSLChainCompletion is left on, nginx will attempt to
autocomplete SSL certificate chains with missing intermediate CA
certificates, causing unnecessary network and errors in pod logs.

Change-Id: I088b33fe994281dca6997baa87a6b599c3f10c14
Closes-Bug: #1835364
2019-07-03 23:20:15 -07:00
Zuul
683f755f7e Merge "Add a configmap for mariadb ingress controller" 2019-07-04 01:07:53 +00:00
Zuul
bf752cdf89 Merge "[Ceph Enhancement] Move ceph-defragosds cron job to ceph-client chart" 2019-07-04 00:36:13 +00:00
Zuul
389f27618a Merge "Ceph-Client: Cleanup for unused values" 2019-07-04 00:30:31 +00:00
Zuul
1f305d2939 Merge "Ceph-osd: update helm tests logic for osds" 2019-07-04 00:30:29 +00:00
Zuul
e45a7def59 Merge "Add OpenStack instance panel to grafana dashboard" 2019-07-04 00:09:45 +00:00
Renis Makadia
25f4f17f8e [Ceph Enhancement] Move ceph-defragosds cron job to ceph-client chart
- Move the cron manifests to ceph-client chart
- Keep the script that actually does the work in Ceph-OSD
- with this PS, ceph-defragosds will be started after Ceph-Client chart
gets deployed. In the cronjob, it will exec to a running OSD pod and
execute the script.

Change-Id: I6e7f7b32572308345963728f2f884c1514ca122d
2019-07-03 21:08:37 +00:00
Yi Wang
c53b1fa86d Add a configmap for mariadb ingress controller
The configmap is for mariab ingress controller configuration. It is
to enable the capability of overriding default nginx configurations
in the controller.

Change-Id: I25eb8a237a6f8ad63bde725b1d4f31a928fa7c49
Signed-off-by: Yi Wang <yi.c.wang@intel.com>
2019-07-03 21:07:51 +00:00
Zuul
0c5cc1db7c Merge "Postgres PVC Soak Fix" 2019-07-03 15:50:12 +00:00
Chinasubbareddy Mallavarapu
c8dd26f1d3 Ceph-osd: update helm tests logic for osds
This is to update helm test logic to test and exit if
there are no osds up in the cluster.

This may heppen when we miss ceph-osd label on the nodes.

Change-Id: I98971106e202a9c4fd9d236f368492c6c6498ce1
2019-07-03 10:10:47 -05:00
Zuul
26ab03c56d Merge "Nagios: Add init container for generating hosts" 2019-07-03 14:59:36 +00:00
Zuul
565e1a38f4 Merge "Ignore Selenium test failures in jobs" 2019-07-03 13:28:25 +00:00
Chinasubbareddy Mallavarapu
8275ad7d2f Ceph-Client: Cleanup for unused values
This is just to cleanup client chart for provisioner references.

Change-Id: I4b2e4b0eee244d102b503bee4adbbae857863c8d
2019-07-03 12:32:34 +00:00
Zuul
4ed7bf33b3 Merge "Ceph-Client: Wait for inactive pgs in ceph cluster" 2019-07-03 04:28:45 +00:00
Zuul
2a160101d9 Merge "Libvirt: Add Ubuntu Bionic image for stein openstack" 2019-07-02 23:26:40 +00:00
Zuul
f2cbeb1496 Merge "Ceph-client: update pg validation in helm tests." 2019-07-02 23:10:49 +00:00
Pete Birley
4a8167d718 Libvirt: Add Ubuntu Bionic image for stein openstack
This PS adds a libvirt image based on Ubuntu Bionic for
use with the stein release of nova.

Change-Id: I8a0c524feadd79bc0632b3c4cff2f692b10633de
Signed-off-by: Pete Birley <pete@port.direct>
2019-07-02 20:18:07 +00:00
Zuul
5c1ea97367 Merge "Helm-toolkit: Add snippet template for kubernetes probes" 2019-07-02 19:37:39 +00:00
Steve Wilkerson
c1d9063a86 Ignore Selenium test failures in jobs
This adds the ability to tolerate failures of the selenium tests
in our jobs, as we intermittenly see these tests fail. The failure
of these tests should not necessarily indicate failure of the job
overall, so this change prevents exactly that

Change-Id: I4f97fad96f63d42fdb3bb5b8958dbed3dfd7dfc7
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-07-02 13:30:13 -05:00
Zuul
20f4274aee Merge "Nagios: Update Nagios image, check_prometheus_hosts command" 2019-07-02 17:59:43 +00:00
Chinasubbareddy Mallavarapu
897ebbc75c Ceph-client: update pg validation in helm tests.
This is to update the logic to check for incomplete pgs in ceph
cluster and proceed if there are no incomplete/inactive pgs and
will not wait for healthy ceph cluster.

Change-Id: I026d6cc378053e805680c31d75fdfb40bbb636f5
2019-07-02 12:38:44 -05:00
Aaser, Douglas (da519m)
4a730bb34b Postgres PVC Soak Fix
This patch fixes an issue with Postgres HA where
the PVC which stores the database was filling up with
WAL records and not deleting them due to some
misconfigurations with Postgres. Once the PVC
would fill up, replication would fail across the node
and the database would not be able to start, crashing
the system.

Specifically, archive_mode was turned on, but was not
supplied with a function through which to archive the
logs. When WAL archiving is turned on, old WAL files
cannot be removed until the system has archived them first.
However, since we never told the system how to archive the
files, it would repeatedly fail so the WAL files would
never be cleaned up.

Also in this patch are some small house keeping items:
- Lowered the wal_keep_segments drastically so Postgres
  can't keep as many WAL segments around to minimize the
  chance of PVC fill issues
- Turned the wal_level from 'logical' to 'hot_standby'
  to keep it consistent with the fact that Patroni uses
  streaming replication and not logical replication
- Removed the autovaccuum configurations as they are not
  needed

Change-Id: Id48c3ee9976823b2bdb4395a029fe75476bdaa62
2019-07-02 17:32:46 +00:00
Chinasubbareddy Mallavarapu
b69f393b54 Ceph-Client: Wait for inactive pgs in ceph cluster
This is to add wait logic for inactive pgs after adjusting pgs in
manage pool job.

Change-Id: I3353262644ae649ed9f495ac83a2567d9da263ae
2019-07-02 11:50:20 -05:00
Zuul
9559016cbb Merge "Update ldap overrides values file reference" 2019-07-02 16:48:43 +00:00
Zuul
0b45428f90 Merge "Fix templates of alert rules (ceph.rules)" 2019-07-02 16:48:42 +00:00
Steve Wilkerson
b7e2d6839c Helm-toolkit: Add snippet template for kubernetes probes
This adds a basic helm-toolkit snippet template for adding
kubernetes liveness and readiness probes to a container. This adds
flexibility by defining the probes contents via values overrides
wholesale

Change-Id: I0862ae59c87b8c0c4e2412030b1801bceb3e3c99
Signed-off-by: Pete Birley <pete@port.direct>
2019-07-02 11:44:11 -05:00
Steve Wilkerson
cb35bd1616 Nagios: Add init container for generating hosts
This updates the Nagios chart to include an init container for
generating the host and host group definitions Nagios requires to
function. The benefit is that Nagios does not need to constantly
attempt to update its host and host group definitions, which
currently triggers a restart of the Nagios service even in cases
where the host file hasn't changed.  With the introduction of an
init container for handling this, we can also remove the service
check definition and command definition for executing the plugin
at periodic intervals

Depends-On: https://review.opendev.org/668197

Change-Id: Id1d63d8c99850b960eb352361d7796162bd6be2f
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-07-02 15:48:12 +00:00
Steve Wilkerson
b2415bf023 Nagios: Update Nagios image, check_prometheus_hosts command
This updates the Nagios image used to the image that is built
out of openstack-helm-images instead of the image hosted in quay.

This new image includes the updated host definition plugin that
uses the kubernetes python client instead of prometheus queries,
so the check_prometheus_hosts command has also been updated to
reflect the change in required arguments

Change-Id: If3440ca9be3227fc48cd698a7d44501e6747bb1e
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-07-02 15:47:39 +00:00
Zuul
5e88d9d636 Merge "ignore errors if set zuul_site_mirror_fqdn fails" 2019-07-02 15:47:19 +00:00
Zuul
ebd59710e9 Merge "Grafana: Remove tests that query API" 2019-07-02 15:47:18 +00:00
sungil
fae650722f Fix templates of alert rules (ceph.rules)
This PS fix templates which generate errors on alert-manager.

Change-Id: I4201cc353848a8f121c2a755a93c1b462d1ab816
2019-07-02 14:50:37 +00:00
chengli3
13c89f9721 Update ldap overrides values file reference
The ldap overrides values file had been moved to
keystone/values_overrides[1]. This patch is to update the reference.

[1]
cede6c0d48 (diff-89208df3c46570cf56141a9353ce27a7)

Change-Id: Ib03bb979dc681a647abd36df77f55fd82e0d4df6
2019-07-02 11:02:06 +08:00
Zuul
3384d68c78 Merge "Ceph-Client: allow helm test to proceed with 80% of osds avialble" 2019-07-01 23:28:38 +00:00
Chinasubbareddy Mallavarapu
a20e7177c6 Ceph-osd: Fix static osd id to variable
This is to fix static osd id logic to variable as we have an issue
in our current logic.
this is happening only when we have file backed journals and
block backed data as shown below.

ex:

 storage:
    osd:
      - data:
          type: block-logical
          location: /dev/vdb
        journal:
          type: directory
          location: /var/lib/openstack-helm/ceph/osd/journal-one
      - data:
          type: block-logical
          location: /dev/vdc
        journal:
          type: directory
          location: /var/lib/openstack-helm/ceph/osd/journal-two

Change-Id: I36d08b1b7aa5925831a64c03259098f6c4753c3e
2019-07-01 12:34:46 -05:00
Chinasubbareddy Mallavarapu
961a58dada Ceph-Client: allow helm test to proceed with 80% of osds avialble
This is to adjust helm test logic to  proceed the deployment if 80% of
osds are up  and running in the cluster .

Change-Id: I128266fd374426f75928332690e275b7f0175318
2019-07-01 15:11:43 +00:00
Kostiantyn Kalynovskyi
1ef750c8bb ignore errors if set zuul_site_mirror_fqdn fails
It can be that zuul_site_mirror_fqdn env variable will not be set,
in this case the whole job will fail, instead of simply not configuring
mirrors during image build. With this patch, if set_fact fails, mirrors
simply will not be configured during image build, as planned in lines 62
and 88 in this playbook

Change-Id: I049c696c7fb0d7cadb527a9f17dd01a42a671baa
2019-07-01 16:15:37 +03:00
Zuul
24841d6dea Merge "Adding hostname to the node details" 2019-06-29 16:39:09 +00:00
John Lawrence
27ef45017b Adding hostname to the node details
Change-Id: I558c100cbb70475d55122b194ef18287c76a68b3
2019-06-28 13:20:06 +00:00
Zuul
b37405dbe6 Merge "Ceph-Clients: Explicity set prom exporter params" 2019-06-27 16:58:52 +00:00
Zuul
d187f6d234 Merge "Ceph-Mgr: update how we set config params" 2019-06-27 16:51:18 +00:00
Pete Birley
57e1e3ce40 Ceph-Clients: Explicity set prom exporter params
Occasionally the default config can result in attempts
to bind to ipv6 which fail - so we explicity set the
host to ipv4.

Change-Id: I3c01ed0ef7c84cf779d88386c14f7c7bd2003310
Signed-off-by: Pete Birley <pete@port.direct>
2019-06-27 14:03:58 +00:00
Zuul
d5655e95ea Merge "[Kibana] Add improvements to Selenium tests for Kibana" 2019-06-27 07:07:14 +00:00