Commit Graph

502 Commits

Author SHA1 Message Date
Kubernetes Prow Robot
06e5a875be Merge pull request #430 from wawa0210/linux-only
avoid npd pod schedule on windows node
2020-11-16 14:06:04 -08:00
Kubernetes Prow Robot
35bfe697a5 Merge pull request #484 from karan/trial-metric
Collect CPU load averages in a separate metric
2020-11-12 12:00:28 -08:00
Karan Goel
925ea7393c Collect CPU load averages in a separate metric 2020-11-09 09:41:52 -08:00
Kubernetes Prow Robot
f42281ee26 Merge pull request #459 from abansal4032/logging-improvements
Add logging levels to custom plugin logs.
v0.8.4
2020-08-28 17:05:19 -07:00
Archit Bansal
8c94d5e60c Add logging levels to custom plugin logs. 2020-08-28 12:51:50 -07:00
Kubernetes Prow Robot
7fa34545b7 Merge pull request #458 from abansal4032/logging-improvements
Log custom plugin stderr only if the status is not ok.
2020-08-27 10:41:53 -07:00
Archit Bansal
3a9370e01b Log custom plugin stderr only if the status is not ok.
Otherwise with plugins that run frequently and report ok status, the
logs are filled with unnecessary noise and significantly increases log
size.
2020-08-27 10:17:05 -07:00
Kubernetes Prow Robot
8a41d4abe3 Merge pull request #453 from vteratipally/docker_failures
Detect docker startup failures
2020-08-14 15:26:18 -07:00
vteratipally
edfd70a16c Update docker-monitor.json
fixed json format error as it doesn't allow trailing commas
2020-08-11 10:02:17 -07:00
vteratipally
fbdd9eec9a Update docker-monitor.json
making DockerContainerStartup failure as temporary
2020-08-11 09:59:46 -07:00
Kubernetes Prow Robot
860e6b0145 Merge pull request #452 from vteratipally/add_fstypes
Add more info to disk metrics
2020-08-07 13:37:57 -07:00
varsha teratipally
4ce29a95d5 removed the $ symbol as npd handles end of the line 2020-08-06 01:30:11 +00:00
varsha teratipally
50127b0512 changed labelname after code review 2020-08-06 00:43:45 +00:00
varsha teratipally
4c40b7e468 updated readme 2020-08-05 21:43:58 +00:00
varsha teratipally
95237efb4d Detect docker startup failures 2020-08-05 21:29:11 +00:00
varsha teratipally
e13210157d Add more info to disk metrics 2020-08-05 21:12:25 +00:00
Kubernetes Prow Robot
c01ea4f582 Merge pull request #450 from saintube/master
Fix typo in custom-plugin-monitor
2020-08-04 12:14:21 -07:00
Frame
9678892546 Fix typo in custom-plugin-monitor 2020-08-03 17:08:42 +08:00
Kubernetes Prow Robot
f3ab10eddb Merge pull request #442 from abansal4032/custom-plugin-logs-capture
Capture the logs from stderr of custom plugins
v0.8.3
2020-07-29 14:18:03 -07:00
Archit Bansal
6acf5b1edb Capture the logs from stderr of custom plugins. 2020-07-29 11:57:05 -07:00
Kubernetes Prow Robot
c3cf941e98 Merge pull request #441 from abansal4032/custom-plugin-log-fix
Generate new status log only on condition change
2020-07-28 09:45:48 -07:00
Archit Bansal
f80f3e0dfa Generate status generation logs from custom plugin run only on condition change. 2020-07-24 09:39:39 -07:00
Kubernetes Prow Robot
ca34880303 Merge pull request #444 from abansal4032/health-check-cooldown-fix
Fix for cooldown time in health checker plugin
2020-07-17 18:32:54 -07:00
Kubernetes Prow Robot
27f1e774ef Merge pull request #443 from abansal4032/health-check-enable-repair
Set auto-repair=true by default for health check monitors.
2020-07-17 17:48:51 -07:00
Archit Bansal
f56d0a929d Use InactiveExitTimestamp instead of ActiveEnterTimestamp for cooldown
period in health check monitor.
2020-07-16 18:53:47 -07:00
Archit Bansal
84188cc0aa Set auto-repair=true by default for health check monitors. 2020-07-15 18:57:53 -07:00
Kubernetes Prow Robot
061e977d1c Merge pull request #433 from bengadbois/add-health-checker-image
docker image: add health-checker binary
2020-06-09 10:47:20 -07:00
Ben Gadbois
32f770dd4e docker image: add health-checker binary 2020-06-09 08:25:31 -07:00
Kubernetes Prow Robot
452818cef8 Merge pull request #426 from abansal4032/health-check-monitor
Add health-check-monitor
v0.8.2
2020-05-27 18:02:02 -07:00
Archit Bansal
44dc4aa6c1 Add health-check-monitor 2020-05-27 14:08:42 -07:00
wawa0210
9dea1cf665 avoid npd pod schedule on windows node 2020-05-27 16:37:15 +08:00
Kubernetes Prow Robot
1d03b66f15 Merge pull request #424 from stpabhi/rhel-support
Add rhel support for osversion
2020-04-15 14:30:45 -07:00
Abhilash Pallerlamudi
5342a50874 Add rhel support for osversion 2020-04-15 13:19:56 -07:00
Kubernetes Prow Robot
20e0147106 Merge pull request #422 from blackwith/patch-1
update system-log-monitor and image version
2020-04-08 10:45:43 -07:00
Mathieu Collin
74554c4b26 update system-log-monitor and image version 2020-04-08 11:24:56 +02:00
Kubernetes Prow Robot
633ced6c8e Merge pull request #421 from majst01/lsblk
Install util-linux to have lsblk binary
2020-03-25 10:17:03 -07:00
Stefan Majer
70c457e5df Install util-linux to have lsblk binary 2020-03-25 11:43:12 +01:00
Kubernetes Prow Robot
c709314cd7 Merge pull request #419 from KohlsTechnology/remedy-system-docs
Document Using Descheudler As a Remedy System
2020-03-10 14:01:36 -07:00
Kubernetes Prow Robot
ab5ea72c74 Merge pull request #418 from muff1nman/namespace-option
Add namespace option for events
2020-03-10 09:49:36 -07:00
Sean Malloy
f603f26afa Document Using Descheudler As a Remedy System
In addition to using draino as a remedy system the k8s descheduler can
also be used as a remedy system.
2020-03-08 22:30:51 -05:00
Andrew DeMaria
7fd465e195 Add namespace option for events 2020-03-05 19:04:31 -07:00
Kubernetes Prow Robot
4ad6227196 Merge pull request #414 from SHLo/patch-1
fix wording
2020-02-27 14:12:38 -08:00
shlo
925d69a18d fix wording 2020-02-24 11:07:57 +08:00
Kubernetes Prow Robot
450c6c3b01 Merge pull request #410 from xueweiz/stats
Collect more CPU/disk/memory metrics
v0.8.1
2020-02-06 10:49:25 -08:00
Xuewei Zhang
8c02c6d4d2 Check metric sanity in e2e tests 2020-02-03 15:38:12 -08:00
Xuewei Zhang
83b09277f0 Collect more cpu/disk/memory metrics 2020-02-03 15:29:45 -08:00
Xuewei Zhang
9ade82734d Add github.com/prometheus/procfs library 2020-01-31 16:02:15 -08:00
Xuewei Zhang
7f9437cba0 Add github.com/shirou/gopsutil/load library 2020-01-31 15:42:57 -08:00
Kubernetes Prow Robot
aadb2b88d1 Merge pull request #405 from xueweiz/test-pr
Rent Boskos project only once per test run.
2020-01-07 13:40:19 -08:00
Kubernetes Prow Robot
140a850b63 Merge pull request #404 from xueweiz/queue
Fix disk metrics unit and queue_length calculation
2020-01-06 13:14:16 -08:00