Commit Graph

29 Commits

Author SHA1 Message Date
Andrew Garrett
a39a7c6e0f Add condition message to event message
If you're using some monitoring solution that aggregates events from
your Kubernetes cluster, having the underlying reason why a condition
triggered could be very useful, especially if you are using custom
plugin monitors.

Co-authored-by: Micah Norman <micnorman@paypal.com>
Signed-off-by: Ryan Eschinger <reschinger@paypal.com>
2022-06-08 21:42:40 +00:00
Jeremy Edwards
adc587f222 Support filelog watching in Windows. 2021-01-13 17:16:46 +00:00
Jeremy Edwards
4adec4bbc6 Introduce Windows build of Node Problem Detector 2020-12-05 23:54:52 +00:00
yuzhiquan
b458f0d028 fix: modify typo 2019-12-03 15:21:57 +08:00
Lantao Liu
be7cc78aa0 Properly close channel when monitor exits.
Signed-off-by: Lantao Liu <lantaol@google.com>
2019-10-25 14:11:39 -07:00
wenjun93
4a4ebc7097 avoid log channel closed caused endless loop 2019-10-25 11:43:49 +08:00
Andrew Stribblehill
09c498ad74 Empty LogPath will use journald's default path. 2019-08-27 01:55:30 +02:00
Zhen Wang
182a9450dd Print monitor config path in the logs 2019-07-30 11:00:47 -07:00
Xuewei Zhang
94af7de97b Report metrics from custom-plugin-monitor 2019-07-25 11:28:38 -07:00
Xuewei Zhang
fbebcf311b Report metrics from system-log-monitor 2019-07-12 14:38:21 -07:00
Xuewei Zhang
a07176073a Add existing monitors into the problem daemon registration hook. 2019-06-13 00:51:17 -07:00
Xuewei Zhang
c6c4e80c9d Remove TestGoroutineLeak unit tests.
We are seeing some flakes on these tests because some goroutine
fluctuation:
https://github.com/kubernetes/node-problem-detector/pull/275#issuecomment-499306727

Removing the tests, as it's robust to test leakage in a soak/stress
test, rather than unit test.
2019-06-06 16:27:59 -07:00
Zhen Wang
1f636381b8 Detect kubelet and container runtime frequent crashes 2018-11-26 22:41:06 -08:00
AdamDang
392ebe9c1b Typo fix in systemlogmonitor/README.md
configurtion->configuration
2018-09-25 10:13:48 +08:00
k8s-ci-robot
aabd369760 Merge pull request #151 from Random-Liu/improve-cpm
Improve cpm
2018-06-22 01:10:05 -07:00
AdamDang
e6e42175fa Typo fix: encounts->encounters
encounts->encounters
2018-06-22 14:04:45 +08:00
Lantao Liu
ee103dd4ac Generate event for condition change and support unknown status. 2018-06-21 15:29:53 -07:00
Andy Xie
10dbfef1a8 add custom problem detector plugin 2017-11-22 10:14:09 +08:00
Random-Liu
f5a7ead8d6 Clarify the limitation of log matching pattern. 2017-06-20 18:11:29 -07:00
Random-Liu
51351f91b2 Cleanup kmsg log wather. 2017-05-30 15:58:45 -07:00
Lantao Liu
be6c516cfd Merge pull request #41 from euank/kmsg-parser
logwatchers: add new kmsg-based kernel log watcher
2017-05-30 15:53:24 -07:00
Euan Kemp
73cba49db0 kmsg: update the docs to reference kmsg parser too 2017-03-09 21:38:11 -08:00
Euan Kemp
9c23921c11 logwatchers/kmsg: add initial kmsg watcher impl
This adds a logwatcher which is able to parse kernel messages directly
from the /dev/kmsg interface. This supports any modern linux distro,
while also avoiding any dependency on libraries (e.g. as journald
needs).
2017-03-09 20:40:49 -08:00
Random-Liu
02d6b89536 Fix journald plugin to only look at the current boot. 2017-03-02 13:57:38 -08:00
fate-grand-order
a756ef48f3 fix misspell "timestamp" 2017-02-21 23:01:30 +08:00
Random-Liu
889d9efbc1 Add unit test for goroutine leak. 2017-02-16 00:08:56 -08:00
Random-Liu
6170b0c87f Add multiple log monitoring support. 2017-02-15 13:15:18 -08:00
Random-Liu
dba47bdc27 Update the README.md. 2017-02-15 13:07:01 -08:00
Random-Liu
10fc831409 Change kernel specific name in code base and change syslog to filelog. 2017-02-15 13:07:01 -08:00