Lantao Liu
6af7bbbe86
Merge pull request #32 from Random-Liu/remove-get-start-point
...
NPD: Remove get start point
2016-09-12 14:08:27 -07:00
Lantao Liu
aa9e268be7
Remove the function getStartPoint, because in current logic, it is not
...
needed anymore.
2016-09-12 14:04:23 -07:00
Dawn Chen
f265f79fd7
Merge pull request #31 from Random-Liu/enable-travis-test
...
NPD: Enable travis test
2016-09-12 14:03:29 -07:00
Lantao Liu
8f6ba9efa6
Add missing godep.
2016-09-09 20:53:52 -07:00
Lantao Liu
cd0f96424b
Add travis ci config.
2016-09-09 20:27:29 -07:00
Lantao Liu
a8f491c0d3
Fix unit test.
2016-09-09 20:00:18 -07:00
Dawn Chen
ea83111c80
Merge pull request #22 from Random-Liu/add-look-back
...
Kernel Monitor: Add look back support and kernel panic handling
2016-08-23 17:13:58 -07:00
Dawn Chen
92a3c35f9a
Merge pull request #30 from Random-Liu/get-node-name-from-downward-api
...
NPD: Get node name from the downward api.
v0.2.0
2016-08-23 16:57:26 -07:00
Lantao Liu
9054dab4c8
Get node name from the downward api.
2016-08-22 17:51:15 -07:00
Lantao Liu
532f933bd8
This PR:
...
1) Add lookback support in kernel monitor. After started, Kernel monitor
will check some old logs to detect problems which happened before last
node reboot.
2) Add `lookback` and `startPattern` in kernel monitor configuration.
* `lookback` specifies how long time kernel monitor should look back.
* `startPattern` specifies which log indicates the node is started.
kernel monitor will clear all current node conditions once it finds
a node start log. This makes sure that old problems won't change the
node condition.
3) Add support for kernel panic monitoring, the null pointer and divide
0 kernel panic will be surfaced as event. Usually kernel monitor will
report these events during looking back phase.
2016-08-20 19:11:26 -07:00
Lantao Liu
09af299a88
Change the wrong image path in pod yaml.
...
Change the wrong image path of pod yaml.
2016-08-11 17:03:02 -07:00
Lantao Liu
501e94bf10
Merge pull request #29 from Random-Liu/get-node-name-from-pod
...
NPD: Get node name from pod
2016-08-11 14:23:46 -07:00
Lantao Liu
5a19ac1868
Get node name from pod, this makes sure that the node
...
name should always be consistent with kubelet.
2016-08-11 14:22:29 -07:00
Dawn Chen
75c7bdb4b1
Merge pull request #20 from Random-Liu/add-kernel-monitor-readme
...
Add README.md for kernel monitor
2016-06-24 17:08:11 -07:00
Lantao Liu
acabf68e06
Add README.md for kernel monitor
2016-06-24 16:19:44 -07:00
Lantao Liu
797c7834fa
Merge pull request #18 from Random-Liu/remove-unnecessary-config
...
Remove unnecessary ENV configuration.
v0.1.0
2016-06-09 17:59:50 -07:00
Lantao Liu
dc7ed6fe06
Remove unnecessary ENV configuration.
2016-06-09 15:27:53 -07:00
Dawn Chen
f13ef4323e
Merge pull request #16 from Random-Liu/add-node-problem-detector-readme
...
Add initial README.md for node-problem-detector.
2016-06-09 08:46:45 -07:00
Lantao Liu
55af4e729f
Add initial README.md for node-problem-detector.
2016-06-08 17:38:34 -07:00
Lantao Liu
b6a6db33e6
Merge pull request #17 from girishkalele/nethealth
...
Containerize the nethealth bandwidth measurement utility
2016-06-08 14:53:34 -07:00
Girish Kalele
b687dfaafc
Containerize the nethealth bandwidth measurement utility
2016-06-07 20:51:30 -07:00
Dawn Chen
db90081e96
Merge pull request #15 from Random-Liu/add-entrypoint-in-dockerfile
...
Add entrypoint and copy kernel monitor config into docker image
2016-06-03 17:17:33 -07:00
Girish Kalele
33a43545ca
Node network health check utility - performs a quick HTTP GET test
2016-06-03 14:26:12 -07:00
Lantao Liu
1e4d6e2800
Add entrypoint and copy kernel monitor config into docker image
2016-06-03 13:34:44 -07:00
Dawn Chen
2b47e14fa4
Merge pull request #13 from Random-Liu/hack-for-unsupported-os-distro
...
Hack for unsupported OS distros.
2016-06-03 11:32:34 -07:00
Lantao Liu
29ff791f08
Hack for unsupported OS distros.
2016-06-03 01:48:26 -07:00
Dawn Chen
9ab546a2e2
Merge pull request #12 from Random-Liu/make-more-things-configurable
...
Make condition and source configurable
2016-06-02 17:42:13 -07:00
Lantao Liu
5b07afd325
1. Make source and conditions configurable.
...
2. Add multiple events and conditions support in problem interface.
2016-06-02 15:32:02 -07:00
Dawn Chen
63b4ba7206
Merge pull request #11 from Random-Liu/use-patch-instead-of-update-status
...
Use patch instead of update status
2016-06-01 16:26:07 -07:00
Lantao Liu
03b28c9b4c
Update Godeps.
2016-05-30 19:22:32 -07:00
Lantao Liu
8759e4d610
Use Patch instead of UpdateStatus.
2016-05-30 19:22:32 -07:00
Lantao Liu
891d858a10
Merge pull request #10 from freeformz/fixFSNotify
...
Vendor the "right" fsnotify and several godep fix.
2016-05-26 16:57:11 -07:00
Edward Muller
3785e2b218
These tags refer to the same commit
2016-05-26 16:01:55 -07:00
Edward Muller
82fc40476a
godep v71+ preserves permissions
2016-05-26 16:00:58 -07:00
Edward Muller
051247ffee
We don't need golang.org/x/sys anymore
...
The version of fsnotify that is vendored doesn't use it.
2016-05-26 16:00:05 -07:00
Edward Muller
28e557f301
Vendor the "right" fsnotify
...
This is the version that hpcloud/tail vendors
2016-05-26 15:55:42 -07:00
Lantao Liu
cce6b46c07
Merge pull request #8 from Random-Liu/fix-godeps
...
Manually merged. Misclick just now...
2016-05-23 12:32:24 -07:00
Lantao Liu
0b95f6dbe9
Add godep dependency golang.org/x/sys/unix
2016-05-23 11:20:43 -07:00
Lantao Liu
64a5cdf199
Merge pull request #6 from Random-Liu/update-gcr-project
...
kubelet.go
2016-05-19 15:26:57 -07:00
Lantao Liu
c405bb8bbe
Change gcr project to official google_containers
2016-05-19 14:25:57 -07:00
Dawn Chen
32d7acc54f
Merge pull request #1 from Random-Liu/add-node-problem-detector
...
Add initial version of node problem detector
2016-05-18 17:30:59 -07:00
Lantao Liu
d96d67dae3
Add Godeps
2016-05-17 15:55:59 -07:00
Lantao Liu
f0312655bd
Add first version of node-problem-detector
2016-05-17 15:55:33 -07:00
Dawn Chen
802acee7e3
Initial commit
2016-05-17 10:37:41 -07:00