Commit Graph

415 Commits

Author SHA1 Message Date
Xiaoyu Zhang(Tim)
7a5cecaa1c typo: delete redundant description. 2019-11-08 08:55:53 +08:00
Kubernetes Prow Robot
ad76b93208 Merge pull request #375 from Random-Liu/fix-channel-close-issue
Properly close channel when monitors exit.
v0.8.0
2019-10-25 14:37:38 -07:00
Lantao Liu
be7cc78aa0 Properly close channel when monitor exits.
Signed-off-by: Lantao Liu <lantaol@google.com>
2019-10-25 14:11:39 -07:00
Kubernetes Prow Robot
705cb01e0c Merge pull request #339 from wenjun93/logmonitor
avoid log channel closed caused endless loop
2019-10-25 11:27:39 -07:00
Kubernetes Prow Robot
bac3429522 Merge pull request #359 from gmemcc/hotfix-closed-channel
fix close of closed channel
2019-10-24 20:57:38 -07:00
wenjun93
4a4ebc7097 avoid log channel closed caused endless loop 2019-10-25 11:43:49 +08:00
Kubernetes Prow Robot
a999207a56 Merge pull request #367 from grosser/grosser/unwrap
untangle plugin runner a bit
2019-10-24 20:29:38 -07:00
Kubernetes Prow Robot
2c14eb1075 Merge pull request #373 from wojtek-t/decrease_heartbeat_frequency
Decrease default frequency of forced heartbeats to 5m
2019-10-24 10:43:12 -07:00
wojtekt
43728fb0fc Decrease default frequency of forced heartbeats to 5m 2019-10-24 10:39:01 +02:00
Michael Grosser
3be50a088a untangle plugin runner a bit
add some docs and make it clearer what is actually going on
(parallel rule execution on start and then on timer)
2019-10-10 15:46:04 -07:00
Kubernetes Prow Robot
c2d850ca10 Merge pull request #371 from rhysemmas/update-readme
Update README
2019-10-08 09:25:12 -07:00
rhysemmas
80e3428d75 Update background section 2019-10-08 14:14:53 +01:00
Kubernetes Prow Robot
a1a7234878 Merge pull request #363 from grosser/grosser/old
remove kubernetes 1.8 support
2019-09-28 19:27:37 -07:00
Kubernetes Prow Robot
850ecf1f12 Merge pull request #368 from xueweiz/problem-state
Add stackdriver exporter endpoint for problem_gauge
2019-09-26 23:41:36 -07:00
Xuewei Zhang
794300af59 Add stackdriver exporter endpoint for problem_gauge 2019-09-26 13:45:17 -07:00
Kubernetes Prow Robot
76865bda54 Merge pull request #356 from mm4tt/heartbeat-period-flag
Make heartbeatPeriod const into a flag
2019-09-26 08:57:07 -07:00
Matt Matejczyk
2e9da8569d Make heartbeatPeriod const into a flag. 2019-09-26 09:59:03 +02:00
Michael Grosser
f77e80a8c4 remove kubernetes 1.8 support 2019-09-25 16:41:13 -07:00
Alex Wong
60e048d2ce fix close of closed channel 2019-09-24 16:07:47 +08:00
Kubernetes Prow Robot
219b408222 Merge pull request #352 from xueweiz/test
Set SSH timeout to 5 minutes
2019-09-19 12:30:25 -07:00
Xuewei Zhang
ec4b615844 Set SSH timeout to 5 minutes 2019-09-19 12:01:16 -07:00
Kubernetes Prow Robot
56f42d902e Merge pull request #353 from xueweiz/family
Allow e2e test to pick up test VM image using image family
2019-09-18 23:45:00 -07:00
Xuewei Zhang
1989ab3681 Allow e2e test to pick up test VM image using image family 2019-09-18 16:09:14 -07:00
Kubernetes Prow Robot
9828ab7f06 Merge pull request #349 from xueweiz/test
Allow e2e test to rent project from Boskos
2019-09-16 12:22:38 -07:00
Kubernetes Prow Robot
5345185ec2 Merge pull request #341 from iranzo/patch-1
Update network_problem.sh
2019-09-15 01:00:37 -07:00
Kubernetes Prow Robot
9870e774d3 Merge pull request #350 from lang710/fixSpelling
fix a spelling error
2019-09-15 00:30:38 -07:00
Lang Chi
28233337fc fix a spelling error
Signed-off-by: Lang Chi <21860405@zju.edu.cn>
2019-09-15 12:31:19 +08:00
Xuewei Zhang
fb7fd239bb Add logic for renting test project from Boskos 2019-09-14 15:22:09 -07:00
Xuewei Zhang
e1939ebc03 Handle vendor change in k8s.io/apimachinery/pkg/util/clock
clock.Clock used to have Tick() method, but is now replaced with
NewTicker() method to prevent leaking. Changed NPD code to adapt to it.

See https://github.com/kubernetes/apimachinery/commit/10ebc22e for more
detail.
2019-09-14 15:22:09 -07:00
Xuewei Zhang
3fc6c7f306 Add vendor code for Boskos
Added replace statement for apache/thrift, since it has been recently
moved from git.apache.org/thrift.git to github.com/apache/thrift, and is
causing `go get` to fail.

See https://github.com/jenkins-x/jx/pull/3321 for more detail.

Commands used:
GO111MODULE=on go get k8s.io/test-infra/boskos/client
GO111MODULE=on go mod vendor
2019-09-14 15:22:09 -07:00
Kubernetes Prow Robot
aea91e385c Merge pull request #335 from xueweiz/sd
Add Stackdriver exporter
2019-09-13 23:36:39 -07:00
Xuewei Zhang
0f0e5eff0f Adding stackdriver exporter 2019-09-12 18:30:00 -07:00
Xuewei Zhang
9e789b5f99 Refactor on metrics so that names for all the views are tracked 2019-09-11 12:07:13 -07:00
Xuewei Zhang
0f2fce56e5 Change host/uptime to GAUGE metrics 2019-09-10 16:58:06 -07:00
Xuewei Zhang
42285cb8db vendor changes 2019-09-10 16:58:06 -07:00
Kubernetes Prow Robot
0fdff95f22 Merge pull request #342 from iranzo/fixtypo
Fixes typo in README
2019-09-06 01:46:56 -07:00
Pablo Iranzo Gómez
eea584e78d Fixes typo in README 2019-09-05 16:26:27 +02:00
Pablo Iranzo Gómez
fa94b42849 Use bashate recommendations on network_problem script 2019-09-05 15:46:45 +02:00
Kubernetes Prow Robot
2a07254f96 Merge pull request #253 from finn-no/master
Empty LogPath will use journald's default path.
v0.7.1
2019-08-27 09:22:41 -07:00
Andrew Stribblehill
09c498ad74 Empty LogPath will use journald's default path. 2019-08-27 01:55:30 +02:00
Kubernetes Prow Robot
6aa308db81 Merge pull request #334 from xueweiz/cumulative
Metric format fixes on host/uptime and disk/*
2019-08-19 12:27:31 -07:00
Xuewei Zhang
82c2368795 Metric format fixes on host/uptime and disk/*
1. host/uptime, disk/io_time and disk/weighted_io should be
counter/cumulative metrics. SO we have to use the Sum aggregation method
rather than LastValue aggregation method (which will declare the metric
as gauge metric).

2. Renamed label "device" for disk/* metrics to "device_name".
This is to clarify that it is device_name (sda1) rather than device_path
(/dev/sda1)
2019-08-16 15:14:54 -07:00
Kubernetes Prow Robot
424b864291 Merge pull request #323 from xueweiz/test
Add a simple e2e test
2019-08-16 14:56:09 -07:00
Xuewei Zhang
f9b5e60a43 Add e2e test for NPD
The first test is a very simple test. It installs NPD on a VM, and then
verifies that NPD reports metric host_uptime in Prometheus format.
2019-08-16 01:33:29 -07:00
Kubernetes Prow Robot
81fcdcebb8 Merge pull request #331 from pigletfly/fix-cm
Move NPD into kube-system namespace
2019-08-13 23:04:24 -07:00
pigletfly
4118c56385 Move NPD into kube-system namespace 2019-08-14 12:06:07 +08:00
Xuewei Zhang
db2dbd1eb2 vendor changes for e2e tests 2019-08-13 17:34:20 -07:00
Kubernetes Prow Robot
a442e71190 Merge pull request #325 from lang710/fixSpelling
fix a spelling error
2019-08-13 10:53:42 -07:00
Lang Chi
4d37d6fb68 fix a spelling error
Signed-off-by: Lang Chi <21860405@zju.edu.cn>
2019-08-13 15:12:01 +08:00
Kubernetes Prow Robot
e280e2075a Merge pull request #320 from wangzhen127/custom-plugin-fix
Don't update condition if status stays False/Unknown for custom plugin
2019-08-07 17:09:18 -07:00