Commit Graph

12 Commits

Author SHA1 Message Date
Dawn Chen
5e563930c0 Merge pull request #81 from Random-Liu/fix-kernel-monitor-issues
Fix kernel monitor issues
2017-02-10 11:17:17 -08:00
Random-Liu
d281cb8a15 Fix kernel monitor issues:
* Change `unregister_netdevice` to be an event to fix #47.
* Change `KernelPanic` to `KernelOops` because we can't handle kernel
panic currently.
* Use system boot time instead of "StartPattern" to fix #48.
2017-02-09 16:09:27 -08:00
Lantao Liu
f20b892123 Merge pull request #84 from Random-Liu/fix-transition-timestamp
Only change transition timestamp when condition is changed.
2017-02-07 10:41:51 -08:00
Dawn Chen
b66c4df364 Merge pull request #39 from Random-Liu/journald-support
Journald support
2017-02-01 12:41:51 -08:00
Random-Liu
a986976a1d Only change transition timestamp when condition is changed. 2017-01-27 14:48:28 -08:00
fate-grand-order
9ac19a240a correct spelling error in kernel_monitor.go 2017-01-22 22:21:39 +08:00
Random-Liu
c15d463ad5 Finish the journald support 2017-01-19 01:59:09 -08:00
andy xie
2606d52afb check for linux os 2016-12-22 10:30:42 +08:00
Lantao Liu
532f933bd8 This PR:
1) Add lookback support in kernel monitor. After started, Kernel monitor
will check some old logs to detect problems which happened before last
node reboot.
2) Add `lookback` and `startPattern` in kernel monitor configuration.
  * `lookback` specifies how long time kernel monitor should look back.
  * `startPattern` specifies which log indicates the node is started.
  kernel monitor will clear all current node conditions once it finds
  a node start log. This makes sure that old problems won't change the
  node condition.
3) Add support for kernel panic monitoring, the null pointer and divide
0 kernel panic will be surfaced as event. Usually kernel monitor will
report these events during looking back phase.
2016-08-20 19:11:26 -07:00
Lantao Liu
5b07afd325 1. Make source and conditions configurable.
2. Add multiple events and conditions support in problem interface.
2016-06-02 15:32:02 -07:00
Lantao Liu
8759e4d610 Use Patch instead of UpdateStatus. 2016-05-30 19:22:32 -07:00
Lantao Liu
f0312655bd Add first version of node-problem-detector 2016-05-17 15:55:33 -07:00