Kubernetes Prow Robot
2a07254f96
Merge pull request #253 from finn-no/master
...
Empty LogPath will use journald's default path.
2019-08-27 09:22:41 -07:00
Andrew Stribblehill
09c498ad74
Empty LogPath will use journald's default path.
2019-08-27 01:55:30 +02:00
Xuewei Zhang
82c2368795
Metric format fixes on host/uptime and disk/*
...
1. host/uptime, disk/io_time and disk/weighted_io should be
counter/cumulative metrics. SO we have to use the Sum aggregation method
rather than LastValue aggregation method (which will declare the metric
as gauge metric).
2. Renamed label "device" for disk/* metrics to "device_name".
This is to clarify that it is device_name (sda1) rather than device_path
(/dev/sda1)
2019-08-16 15:14:54 -07:00
Kubernetes Prow Robot
424b864291
Merge pull request #323 from xueweiz/test
...
Add a simple e2e test
2019-08-16 14:56:09 -07:00
Xuewei Zhang
f9b5e60a43
Add e2e test for NPD
...
The first test is a very simple test. It installs NPD on a VM, and then
verifies that NPD reports metric host_uptime in Prometheus format.
2019-08-16 01:33:29 -07:00
Lang Chi
4d37d6fb68
fix a spelling error
...
Signed-off-by: Lang Chi <21860405@zju.edu.cn >
2019-08-13 15:12:01 +08:00
Kubernetes Prow Robot
e280e2075a
Merge pull request #320 from wangzhen127/custom-plugin-fix
...
Don't update condition if status stays False/Unknown for custom plugin
2019-08-07 17:09:18 -07:00
Zhen Wang
30e20c6a20
Validate that permanent problem has preset default condition
2019-08-01 23:40:16 -07:00
Zhen Wang
2f5d03280a
Don't update condition if status stays False/Unknown for custom plugin
2019-08-01 23:40:16 -07:00
Zhen Wang
182a9450dd
Print monitor config path in the logs
2019-07-30 11:00:47 -07:00
Kubernetes Prow Robot
599ca532e8
Merge pull request #315 from xueweiz/metrics
...
Report metrics from custom-plugin-monitor
2019-07-25 11:58:44 -07:00
Xuewei Zhang
94af7de97b
Report metrics from custom-plugin-monitor
2019-07-25 11:28:38 -07:00
Kubernetes Prow Robot
b8ce6360d9
Merge pull request #300 from xueweiz/metrics
...
Report metrics from system-log-monitor
2019-07-12 15:17:06 -07:00
Xuewei Zhang
fbebcf311b
Report metrics from system-log-monitor
2019-07-12 14:38:21 -07:00
Yang Guo
ddb1d76178
Support waiting for kube-apiserver to be ready with timout during NPD startup
2019-07-09 10:24:25 -07:00
Xuewei Zhang
4944ac3e48
Implement host collector as part of system-stats-monitor
...
Host collector report three things today:
1. Host OS uptime (in seconds)
2. Host kernel version (as a metric label)
3. Host OS version (as a metric label)
2019-06-27 16:40:11 -07:00
Xuewei Zhang
29b0740f4c
Refactor systemstatsmonitor/metric_helper.go into a metrics package
2019-06-27 16:40:05 -07:00
Xuewei Zhang
225de07427
Correctly identify failures in problem daemon starting.
2019-06-26 17:55:11 -07:00
Xuewei Zhang
cf6624661a
Update READMEs
2019-06-13 00:51:17 -07:00
Xuewei Zhang
7ad5dec712
Add disk metrics support.
2019-06-13 00:51:17 -07:00
Xuewei Zhang
23dc265971
Add Prometheus exporter.
2019-06-13 00:51:17 -07:00
Xuewei Zhang
a07176073a
Add existing monitors into the problem daemon registration hook.
2019-06-13 00:51:17 -07:00
Xuewei Zhang
63f0e35e56
Implement dynamic problemdaemon registration and initialization.
...
Added package problemdaemon. All future problem daemons should be
registered by calling problemdaemon.register().
CLI interfaces will be automatically generated for all registered
problem daemons in the form of "--config.DAEMON_NAME"
2019-06-12 18:29:18 -07:00
Xuewei Zhang
5814195ad5
Move apiserver-reporting logic into k8s_exporter.
...
Added CLI option "enable-k8s-exporter" (default to true). Users can use
this option to enable/disable exporting to Kubernetes control plane.
This commit also removes all the apiserver-specific logic from package
problemdetector.
Future exporters (e.g. to local journald, Prometheus, other control
planes) should implement types.Exporter interface.
2019-06-12 18:29:18 -07:00
Xuewei Zhang
c6c4e80c9d
Remove TestGoroutineLeak unit tests.
...
We are seeing some flakes on these tests because some goroutine
fluctuation:
https://github.com/kubernetes/node-problem-detector/pull/275#issuecomment-499306727
Removing the tests, as it's robust to test leakage in a soak/stress
test, rather than unit test.
2019-06-06 16:27:59 -07:00
Yang Guo
468a23d09a
Run custom plugins immediately on startup
2019-06-04 09:42:34 -07:00
Andy Xie
33dffe0761
enable codnition updaet when message change for custom plugin
2018-12-11 13:14:49 +08:00
Zhen Wang
3062622d7c
More fix to custom plugin monitor condition change
2018-11-27 10:59:40 -08:00
k8s-ci-robot
d793330dcd
Merge pull request #203 from andyxning/fix_custom_plugin_monitor_condition_change
...
fix custom plugin monitor condition change
2018-11-27 10:37:42 -08:00
Zhen Wang
1f636381b8
Detect kubelet and container runtime frequent crashes
2018-11-26 22:41:06 -08:00
SataQiu
91adf37050
fix typo: NDDE -> NODE, permenantly -> permanently
2018-11-21 17:36:08 +08:00
Jason Stangroome
38330605c5
Fix the spelling of monitor in the error message
2018-11-20 14:00:30 +11:00
Andy Xie
e3b37719ec
fix custom plugin monitor condition change
2018-11-12 17:57:55 +08:00
AdamDang
392ebe9c1b
Typo fix in systemlogmonitor/README.md
...
configurtion->configuration
2018-09-25 10:13:48 +08:00
Andy Xie
89cfb5261d
bump kubernetes to 1.9
2018-07-09 14:59:51 +08:00
k8s-ci-robot
f479d09e58
Merge pull request #183 from andyxning/adjust_client-go_user-agent
...
adjust client-go User-Agent
2018-06-25 18:15:51 -07:00
Andy Xie
866ae661da
adjust client-go User-Agent
2018-06-24 10:39:28 +08:00
k8s-ci-robot
aabd369760
Merge pull request #151 from Random-Liu/improve-cpm
...
Improve cpm
2018-06-22 01:10:05 -07:00
AdamDang
e6e42175fa
Typo fix: encounts->encounters
...
encounts->encounters
2018-06-22 14:04:45 +08:00
Lantao Liu
ee103dd4ac
Generate event for condition change and support unknown status.
2018-06-21 15:29:53 -07:00
David Ashpole
bf730e9c63
add log-counter go plugin
2018-06-20 15:55:19 -07:00
Lantao Liu
9acad906ff
Merge pull request #158 from cimomo/small-fix
...
Use camelCase instead of snake_case per Golang convention
2018-02-22 22:18:17 -08:00
Tim Hockin
3468934b7d
Pushes go to staging-k8s.gcr.io
2018-02-01 20:11:55 -08:00
Kai Chen
bc08bd0b80
Use camelCase instead of snake_case per Golang convention
2018-01-22 23:42:13 +08:00
Tim Hockin
547c65ef89
Convert registry to k8s.gcr.io
2017-12-22 09:55:16 -08:00
Andy Xie
10dbfef1a8
add custom problem detector plugin
2017-11-22 10:14:09 +08:00
Cao Shufeng
b939fb575a
return an error when error happens in SetConditions()
2017-08-23 17:56:15 +08:00
Random-Liu
f5a7ead8d6
Clarify the limitation of log matching pattern.
2017-06-20 18:11:29 -07:00
Random-Liu
51351f91b2
Cleanup kmsg log wather.
2017-05-30 15:58:45 -07:00
Lantao Liu
be6c516cfd
Merge pull request #41 from euank/kmsg-parser
...
logwatchers: add new kmsg-based kernel log watcher
2017-05-30 15:53:24 -07:00