355 Commits

Author SHA1 Message Date
Kubernetes Prow Robot
599ca532e8 Merge pull request #315 from xueweiz/metrics
Report metrics from custom-plugin-monitor
v0.7.0
2019-07-25 11:58:44 -07:00
Xuewei Zhang
94af7de97b Report metrics from custom-plugin-monitor 2019-07-25 11:28:38 -07:00
Kubernetes Prow Robot
b8ce6360d9 Merge pull request #300 from xueweiz/metrics
Report metrics from system-log-monitor
2019-07-12 15:17:06 -07:00
Xuewei Zhang
fbebcf311b Report metrics from system-log-monitor 2019-07-12 14:38:21 -07:00
Kubernetes Prow Robot
dbe7cafe1e Merge pull request #308 from yguo0905/master
Support waiting for kube-apiserver to be ready with timout during NPD startup
2019-07-09 16:50:24 -07:00
Yang Guo
ddb1d76178 Support waiting for kube-apiserver to be ready with timout during NPD startup 2019-07-09 10:24:25 -07:00
Kubernetes Prow Robot
30babe906e Merge pull request #303 from xueweiz/self
Implement host collector as part of system-stats-monitor
2019-07-03 13:38:12 -07:00
Xuewei Zhang
4944ac3e48 Implement host collector as part of system-stats-monitor
Host collector report three things today:
1. Host OS uptime (in seconds)
2. Host kernel version (as a metric label)
3. Host OS version (as a metric label)
2019-06-27 16:40:11 -07:00
Xuewei Zhang
ed16a29ec2 Add github.com/cobaugh/osrelease as dependency
This done via:
GO111MODULE=on go get github.com/cobaugh/osrelease
GO111MODULE=on go mod vendor
2019-06-27 16:40:05 -07:00
Xuewei Zhang
935fab705e Add github.com/shirou/gopsutil/host to vendor
This is needed for a coming PR to measure system uptime.
I separated vendor changes out, because they are larger while easier to
review.

This done via:
GO111MODULE=on go get github.com/shirou/gopsutil/host
GO111MODULE=on go mod vendor
2019-06-27 16:40:05 -07:00
Xuewei Zhang
29b0740f4c Refactor systemstatsmonitor/metric_helper.go into a metrics package 2019-06-27 16:40:05 -07:00
Kubernetes Prow Robot
146dfd70b2 Merge pull request #299 from xueweiz/start
Correctly identify failures in problem daemon starting.
2019-06-27 10:47:22 -07:00
Xuewei Zhang
225de07427 Correctly identify failures in problem daemon starting. 2019-06-26 17:55:11 -07:00
Kubernetes Prow Robot
c95c37532b Merge pull request #292 from wangzhen127/systemd-monitor
Add systemd monitor for kubelet, docker, and containerd restart events
2019-06-20 19:14:38 -07:00
Zhen Wang
ea6a141351 Allow using custom flags in build.sh 2019-06-18 10:26:53 -07:00
Zhen Wang
b94a555dfc Add systemd monitor for kubelet, docker, and containerd restart events 2019-06-18 10:26:53 -07:00
Kubernetes Prow Robot
b667a12ee4 Merge pull request #294 from xueweiz/compile
Allow compilation time disabling for each type of Problem Daemon.
2019-06-17 16:32:15 -07:00
Xuewei Zhang
be2647a686 Allow compilation time disabling for each type of Problem Daemon. 2019-06-17 16:02:45 -07:00
Kubernetes Prow Robot
e10e6cc106 Merge pull request #293 from Random-Liu/do-not-import-plugins-unnecessarily
Do not import plugins unnecessarily.
2019-06-13 20:32:23 -07:00
Lantao Liu
d520ca89bd Build node-problem-detector from a directory.
Signed-off-by: Lantao Liu <lantaol@google.com>
2019-06-13 18:54:23 -07:00
Lantao Liu
f2d17ee77b Do not import plugins unnecessarily.
Signed-off-by: Lantao Liu <lantaol@google.com>
2019-06-13 17:57:53 -07:00
Kubernetes Prow Robot
975dc718a5 Merge pull request #275 from xueweiz/exp
node-problem-detector: report disk queue length in Prometheus format
2019-06-13 15:24:14 -07:00
Xuewei Zhang
cf6624661a Update READMEs 2019-06-13 00:51:17 -07:00
Xuewei Zhang
7ad5dec712 Add disk metrics support. 2019-06-13 00:51:17 -07:00
Xuewei Zhang
23dc265971 Add Prometheus exporter. 2019-06-13 00:51:17 -07:00
Xuewei Zhang
a07176073a Add existing monitors into the problem daemon registration hook. 2019-06-13 00:51:17 -07:00
Xuewei Zhang
63f0e35e56 Implement dynamic problemdaemon registration and initialization.
Added package problemdaemon. All future problem daemons should be
registered by calling problemdaemon.register().

CLI interfaces will be automatically generated for all registered
problem daemons in the form of "--config.DAEMON_NAME"
2019-06-12 18:29:18 -07:00
Xuewei Zhang
5814195ad5 Move apiserver-reporting logic into k8s_exporter.
Added CLI option "enable-k8s-exporter" (default to true). Users can use
this option to enable/disable exporting to Kubernetes control plane.

This commit also removes all the apiserver-specific logic from package
problemdetector.

Future exporters (e.g. to local journald, Prometheus, other control
planes) should implement types.Exporter interface.
2019-06-12 18:29:18 -07:00
Kubernetes Prow Robot
df2bc3df22 Merge pull request #290 from xueweiz/test
Remove TestGoroutineLeak unit tests.
v0.6.4
2019-06-12 18:24:15 -07:00
Xuewei Zhang
c6c4e80c9d Remove TestGoroutineLeak unit tests.
We are seeing some flakes on these tests because some goroutine
fluctuation:
https://github.com/kubernetes/node-problem-detector/pull/275#issuecomment-499306727

Removing the tests, as it's robust to test leakage in a soak/stress
test, rather than unit test.
2019-06-06 16:27:59 -07:00
Kubernetes Prow Robot
831f7fbd75 Merge pull request #289 from xueweiz/dep
Add OpenCensus and gopsutil as dependency
2019-06-05 19:38:11 -07:00
Xuewei Zhang
aa44e54c50 Add OpenCensus and gopsutil as dependency 2019-06-05 17:10:25 -07:00
Kubernetes Prow Robot
b454a42dbd Merge pull request #287 from wangzhen127/bump
Migrate from godep to modules and bump versions
2019-06-05 00:25:54 -07:00
Kubernetes Prow Robot
e2afd71715 Merge pull request #288 from yguo0905/master
Run custom plugins immediately on startup
2019-06-04 13:31:52 -07:00
Zhen Wang
fa804088e9 go mod vendor # for other packages 2019-06-04 12:15:47 -07:00
Zhen Wang
43663b2d8b bump versions of other packages 2019-06-04 12:15:47 -07:00
Zhen Wang
86e0f5da4e Update README on dependency management 2019-06-04 12:15:47 -07:00
Zhen Wang
49d7048a2e Update travis config 2019-06-04 12:15:47 -07:00
Zhen Wang
78634e8a16 go mod vendor # for k8s version bump 2019-06-04 12:15:47 -07:00
Zhen Wang
e4fd4a9e8f bump kubernetes version to 1.14.2 2019-06-04 12:15:40 -07:00
Zhen Wang
6a1beac0eb Update Makefile and remove godep 2019-06-04 11:49:25 -07:00
Yang Guo
468a23d09a Run custom plugins immediately on startup 2019-06-04 09:42:34 -07:00
Zhen Wang
d34cd0c15b go mod vendor 2019-06-04 01:07:41 -07:00
Zhen Wang
6fd01e8c43 git mod init 2019-06-04 01:07:41 -07:00
Kubernetes Prow Robot
520b65018a Merge pull request #286 from xueweiz/lib
Update dependencies in Godep for #275
2019-06-03 23:28:59 -07:00
Xuewei Zhang
ff8025a955 Update golang.org/x/sys/unix
Newer version of golang.org/x/sys/unix is required by github.com/shirou/gopsutil
2019-06-03 15:04:42 -07:00
Xuewei Zhang
d744b1cde5 Add github.com/shirou/gopsutil/disk in Godep 2019-06-03 15:04:42 -07:00
Xuewei Zhang
5fe3709d67 Update github.com/golang/protobuf/proto
OpenCensus requires a newer github.com/golang/protobuf/proto.

This is done via:
go get -u github.com/golang/protobuf/proto
godep update ./...
godep save ./...
2019-06-03 15:04:42 -07:00
Xuewei Zhang
3c0fe2289b Add OpenCensus in Godep
This is generated using below commands:
go get -u -v go.opencensus.io/...
go get -u -v contrib.go.opencensus.io/exporter/prometheus
godep update ./...
godep save ./...
2019-06-03 15:04:42 -07:00
Xuewei Zhang
adc6c0da98 Comment changes in Godeps
This change is purely git tag changes, with no real change.

This change is generated via below commands in an empty go1.9
environment:
mkdir -p $GOPATH/src/k8s.io
cd $GOPATH/src/k8s.io && go get k8s.io/node-problem-detector
cd $GOPATH/src/k8s.io/node-problem-detector
godep restore ./...
godep save ./...
2019-06-03 15:04:42 -07:00