12 Commits

Author SHA1 Message Date
Radhika Pai
e9fce11161 [Update] NPD systemd-monitor lookback duration
This ps adds the lookback duration of 5m to the systemd-monitor to avoid
looking back indefinitely in journal log and causing the alert to stick around.

Change-Id: Ia32f043c0c7484d0bb92cfc4b68b506eae8e9d72
2021-03-26 15:24:10 +00:00
jinyuan
f33e27cf51 Update rbac api version for kubernetes-node-problem-detector
When using a helm3 to deploy , it fails. Helm3 no more support rbac.authorization.k8s.io/v1beta1 , but v1 can support helm2 and helm3.

Change-Id: I2760befdc20e73989bce5cc581d086de57f91383
2021-02-25 08:49:14 +00:00
Andrii Ostapenko
9d5b9a9e42 Unpin images built with osh-images
Enabling ability to automate testing and auto promotion.
Unpinning ovs, mariadb and node-problem-detector images.

Change-Id: I6256452d575d23f84f4fd5c728437b0e4e9423f3
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-10-01 18:43:13 +00:00
Andrii Ostapenko
1532958c80
Change helm-toolkit dependency version to ">= 0.1.0"
Since we introduced chart version check in gates, requirements are not
satisfied with strict check of 0.1.0

Change-Id: I15950b735b4f8566bc0018fe4f4ea9ba729235fc
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-09-24 12:19:28 -05:00
Zuul
2bfce96304 Merge "Run chart-testing on all charts" 2020-09-17 14:38:19 +00:00
radhika pai
949724ad8f [update] Node problem detector path for conntrack
The path to get the conntrack value was incorrect.
Also the logic of the script is updated to raise conntrack alert.

Change-Id: I4d3ea74396eb726458d05df3d9c9a50fec74cf05
2020-09-11 19:29:30 +00:00
Mohammed Naser
c7a45f166f Run chart-testing on all charts
Added chart lint in zuul CI to enhance the stability for charts.
Fixed some lint errors in the current charts.

Change-Id: I9df4024c7ccf8b3510e665fc07ba0f38871fcbdb
2020-09-11 18:02:38 +03:00
KHIYANI, RAHUL (rk0850)
117fae68e6 Run node-problem-detector from 0 user
This adds the runAsUser flag to node-problem-detector at pod level

Change-Id: I0db38599e037ac38b24e50bbc2228ef1fd264c4e
2020-08-11 19:36:15 +00:00
diwakar thyagaraj
e986c6f8c3 Enable Read-Only for Node-Problem Detector
Change-Id: I1f45455abcd812d2c4df186f7047949230f210fd
Signed-off-by: diwakar thyagaraj <diwakar.chitoor.thyagaraj@att.com>
2020-07-30 23:34:57 +00:00
diwakar thyagaraj
b82a146640 [FIX] Apparmor to Node-problem Detector
Change-Id: I11876e7ca9af3e37071716c34ccdb9361f98828d
Signed-off-by: diwakar thyagaraj <diwakar.chitoor.thyagaraj@att.com>
2020-07-28 21:22:48 +00:00
Pai, Radhika (rp592h)
efac80f2d2 Updated the Node Problem Detector chart
The image for the npd is updated to use from the openstackhelm images
repo rather than the k8 image . The k8 image had some security
vulnerabities.
The version for the image is updated to latest ie v0.8.2.

Added the apparmor file.

Change-Id: I4cb40d8bac0533d516d2105f9589636c81fa4111
2020-07-20 08:55:36 -05:00
Steve Wilkerson
a31bb2b049 Add node-problem-detector chart
This adds a chart for the node problem detector. This chart
will help provide additional insight into the status of the
underlying infrastructure of a deployment.

Updated the chart with new yamllint checks.

Change-Id: I21a24b67b121388107b20ab38ac7703c7a33f1c1
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2020-06-22 13:00:55 -05:00