Commit Graph

  • f89f620909 added new line in the known_modules.json varsha teratipally 2021-01-08 22:09:58 +00:00
  • f564d9092a Merge pull request #510 from jeremyje/nopanic Kubernetes Prow Robot 2021-01-08 14:43:05 -08:00
  • 8c16b56476 Merge pull request #511 from ForestCold/master Kubernetes Prow Robot 2021-01-08 14:19:06 -08:00
  • eb38b4b598 added a new metric to retrieve os features like unknown modules varsha teratipally 2020-12-15 17:31:10 +00:00
  • 041b77bd32 Merge pull request #1 from ForestCold/Update-supported-problem-deamon-list Magic Yami 2021-01-06 14:57:38 -08:00
  • a210b30d36 Update supported problem deamon list Magic Yami 2021-01-06 14:57:05 -08:00
  • a451a892ae Use Fatal instead of panic for go tests. Jeremy Edwards 2020-12-22 03:01:51 +00:00
  • 1da1f28cef Upgrade golang.org/x/sys to prepare for Windows Service. Jeremy Edwards 2020-12-13 06:39:59 +00:00
  • 4ad49bbd84 Merge pull request #503 from vteratipally/label_fix Kubernetes Prow Robot 2020-12-08 22:04:49 -08:00
  • 4dccc1ce24 Merge pull request #493 from vteratipally/kernel_cmdline_parameters Kubernetes Prow Robot 2020-12-08 17:58:18 -08:00
  • 4085da817d renaming splitWords to tokens varsha teratipally 2020-12-08 18:34:54 +00:00
  • aadb16b3d4 Remove Dockerfile.in rewrite hack and use updated arg in Dockerfile Jeremy Edwards 2020-12-05 22:34:16 +00:00
  • 8f2a94fd7e Merge pull request #502 from jeremyje/windows Kubernetes Prow Robot 2020-12-07 22:21:11 -08:00
  • 047958a49c changing the label names as per the standards varsha teratipally 2020-12-08 02:27:22 +00:00
  • ffc46f977d add code to retrieve kernel command line parameters varsha teratipally 2020-11-15 22:54:59 +00:00
  • 4adec4bbc6 Introduce Windows build of Node Problem Detector Jeremy Edwards 2020-12-05 23:54:52 +00:00
  • bf51d6600e Merge pull request #492 from vteratipally/module_stats_branch Kubernetes Prow Robot 2020-12-03 09:51:00 -08:00
  • 1e917af560 Merge pull request #455 from ZYecho/fix_newmessage Kubernetes Prow Robot 2020-11-24 16:14:39 -08:00
  • 6956e6074d Merge pull request #500 from Random-Liu/fix-staging-bucket Kubernetes Prow Robot 2020-11-20 09:44:51 -08:00
  • ed783da499 Change default staging bucket. Lantao Liu 2020-11-20 09:08:01 -08:00
  • 2b50e4af1a add testcases for cos and ubuntu to retrieve modules varsha teratipally 2020-11-19 10:29:12 +00:00
  • 944efce3a6 add code for retrieving kernel modules varsha teratipally 2020-11-13 03:16:14 +00:00
  • 59536256e3 Merge pull request #475 from vteratipally/boot_size_disk v0.8.5 Kubernetes Prow Robot 2020-11-18 14:42:50 -08:00
  • 112d53b10a Merge pull request #497 from vteratipally/fs_types Kubernetes Prow Robot 2020-11-18 10:48:07 -08:00
  • b51cb3219f fix: print result's message when status unknown zhangyue 2020-08-18 16:34:22 +08:00
  • 0c258bb704 Update kernel-monitor.json vteratipally 2020-11-17 13:38:07 -08:00
  • 438d014389 Merge pull request #425 from jsoref/grammar Kubernetes Prow Robot 2020-11-16 21:38:04 -08:00
  • 3abcfb7063 Merge pull request #490 from karan/vendor Kubernetes Prow Robot 2020-11-16 14:06:50 -08:00
  • d8ea2538de Merge pull request #489 from abansal4032/health-check-kubelet-connection Kubernetes Prow Robot 2020-11-16 14:06:42 -08:00
  • cff4a54d6a Merge pull request #488 from vteratipally/io_errors Kubernetes Prow Robot 2020-11-16 14:06:36 -08:00
  • 5919888571 Merge pull request #485 from karan/helm-readme Kubernetes Prow Robot 2020-11-16 14:06:28 -08:00
  • 2d53c0a2a6 Merge pull request #481 from tosi3k/oom-regex-fix Kubernetes Prow Robot 2020-11-16 14:06:20 -08:00
  • 33571a312d Merge pull request #478 from neoseele/master Kubernetes Prow Robot 2020-11-16 14:06:12 -08:00
  • 06e5a875be Merge pull request #430 from wawa0210/linux-only Kubernetes Prow Robot 2020-11-16 14:06:04 -08:00
  • 1550882948 avoid duplicating the disk bytes used metrics based on fstype and mountopts varsha teratipally 2020-11-16 20:10:46 +00:00
  • 35bfe697a5 Merge pull request #484 from karan/trial-metric Kubernetes Prow Robot 2020-11-12 12:00:28 -08:00
  • db35f6a857 bump some dependencies to latest versions Karan Goel 2020-11-09 15:33:13 -08:00
  • 2513756583 Add kubelet apiserver connection fail check in health checker Archit Bansal 2020-11-05 23:31:43 -08:00
  • 925ea7393c Collect CPU load averages in a separate metric Karan Goel 2020-11-03 10:34:21 -08:00
  • f01b5e5cfe Detect I/O errors varsha teratipally 2020-11-06 03:48:33 +00:00
  • d39915d392 fix helm instructions Karan Goel 2020-11-04 11:58:14 -08:00
  • 0fb464c24a Merge pull request #459 from abansal4032/logging-improvements Kubernetes Prow Robot 2020-08-28 17:05:19 -07:00
  • 6b650e785e Adapt OOMKilling pattern to old and new Linux kernels Antoni Zawodny 2020-10-22 15:12:26 +02:00
  • 589411702a fix: node memory metrics are off by 1024 Neil 2020-10-19 17:22:35 +11:00
  • f984abbe2e catching hung task with pattern like taks airflow scheduler: some of the events related to hungtask is not identified varsha teratipally 2020-10-08 23:04:15 +00:00
  • f42281ee26 Merge pull request #459 from abansal4032/logging-improvements v0.8.4 Kubernetes Prow Robot 2020-08-28 17:05:19 -07:00
  • 8c94d5e60c Add logging levels to custom plugin logs. Archit Bansal 2020-08-27 17:07:56 -07:00
  • 7fa34545b7 Merge pull request #458 from abansal4032/logging-improvements Kubernetes Prow Robot 2020-08-27 10:41:53 -07:00
  • 3a9370e01b Log custom plugin stderr only if the status is not ok. Archit Bansal 2020-08-26 23:17:04 -07:00
  • 8a41d4abe3 Merge pull request #453 from vteratipally/docker_failures Kubernetes Prow Robot 2020-08-14 15:26:18 -07:00
  • edfd70a16c Update docker-monitor.json vteratipally 2020-08-11 10:02:17 -07:00
  • fbdd9eec9a Update docker-monitor.json vteratipally 2020-08-11 09:59:46 -07:00
  • 860e6b0145 Merge pull request #452 from vteratipally/add_fstypes Kubernetes Prow Robot 2020-08-07 13:37:57 -07:00
  • 4ce29a95d5 removed the $ symbol as npd handles end of the line varsha teratipally 2020-08-06 01:30:11 +00:00
  • 50127b0512 changed labelname after code review varsha teratipally 2020-08-06 00:43:45 +00:00
  • 4c40b7e468 updated readme varsha teratipally 2020-08-05 21:43:58 +00:00
  • 95237efb4d Detect docker startup failures varsha teratipally 2020-08-05 21:29:11 +00:00
  • e13210157d Add more info to disk metrics varsha teratipally 2020-08-05 21:12:25 +00:00
  • c01ea4f582 Merge pull request #450 from saintube/master Kubernetes Prow Robot 2020-08-04 12:14:21 -07:00
  • 9678892546 Fix typo in custom-plugin-monitor Frame 2020-08-03 17:08:42 +08:00
  • f3ab10eddb Merge pull request #442 from abansal4032/custom-plugin-logs-capture v0.8.3 Kubernetes Prow Robot 2020-07-29 14:18:03 -07:00
  • 6acf5b1edb Capture the logs from stderr of custom plugins. Archit Bansal 2020-07-16 01:08:02 -07:00
  • c3cf941e98 Merge pull request #441 from abansal4032/custom-plugin-log-fix Kubernetes Prow Robot 2020-07-28 09:45:48 -07:00
  • f80f3e0dfa Generate status generation logs from custom plugin run only on condition change. Archit Bansal 2020-07-16 00:51:38 -07:00
  • ca34880303 Merge pull request #444 from abansal4032/health-check-cooldown-fix Kubernetes Prow Robot 2020-07-17 18:32:54 -07:00
  • 27f1e774ef Merge pull request #443 from abansal4032/health-check-enable-repair Kubernetes Prow Robot 2020-07-17 17:48:51 -07:00
  • f56d0a929d Use InactiveExitTimestamp instead of ActiveEnterTimestamp for cooldown period in health check monitor. Archit Bansal 2020-07-15 19:06:07 -07:00
  • 84188cc0aa Set auto-repair=true by default for health check monitors. Archit Bansal 2020-07-15 18:57:53 -07:00
  • 061e977d1c Merge pull request #433 from bengadbois/add-health-checker-image Kubernetes Prow Robot 2020-06-09 10:47:20 -07:00
  • 32f770dd4e docker image: add health-checker binary Ben Gadbois 2020-06-09 08:25:31 -07:00
  • 452818cef8 Merge pull request #426 from abansal4032/health-check-monitor v0.8.2 Kubernetes Prow Robot 2020-05-27 18:02:02 -07:00
  • 44dc4aa6c1 Add health-check-monitor Archit Bansal 2020-05-11 14:19:56 -07:00
  • 9dea1cf665 avoid npd pod schedule on windows node wawa0210 2020-05-27 16:37:15 +08:00
  • 9b587abc13 Grammar Josh Soref 2020-05-10 11:10:38 -04:00
  • 1d03b66f15 Merge pull request #424 from stpabhi/rhel-support Kubernetes Prow Robot 2020-04-15 14:30:45 -07:00
  • 5342a50874 Add rhel support for osversion Abhilash Pallerlamudi 2020-04-15 12:14:29 -07:00
  • 20e0147106 Merge pull request #422 from blackwith/patch-1 Kubernetes Prow Robot 2020-04-08 10:45:43 -07:00
  • 74554c4b26 update system-log-monitor and image version Mathieu Collin 2020-04-08 11:24:56 +02:00
  • 633ced6c8e Merge pull request #421 from majst01/lsblk Kubernetes Prow Robot 2020-03-25 10:17:03 -07:00
  • 70c457e5df Install util-linux to have lsblk binary Stefan Majer 2020-03-25 10:36:47 +01:00
  • c709314cd7 Merge pull request #419 from KohlsTechnology/remedy-system-docs Kubernetes Prow Robot 2020-03-10 14:01:36 -07:00
  • ab5ea72c74 Merge pull request #418 from muff1nman/namespace-option Kubernetes Prow Robot 2020-03-10 09:49:36 -07:00
  • f603f26afa Document Using Descheudler As a Remedy System Sean Malloy 2020-03-08 22:30:51 -05:00
  • 7fd465e195 Add namespace option for events Andrew DeMaria 2020-03-05 19:04:31 -07:00
  • 4ad6227196 Merge pull request #414 from SHLo/patch-1 Kubernetes Prow Robot 2020-02-27 14:12:38 -08:00
  • 925d69a18d fix wording shlo 2020-02-24 11:07:57 +08:00
  • 450c6c3b01 Merge pull request #410 from xueweiz/stats v0.8.1 Kubernetes Prow Robot 2020-02-06 10:49:25 -08:00
  • 8c02c6d4d2 Check metric sanity in e2e tests Xuewei Zhang 2020-02-03 15:09:22 -08:00
  • 83b09277f0 Collect more cpu/disk/memory metrics Xuewei Zhang 2020-01-28 23:59:21 -08:00
  • 9ade82734d Add github.com/prometheus/procfs library Xuewei Zhang 2020-01-31 16:01:33 -08:00
  • 7f9437cba0 Add github.com/shirou/gopsutil/load library Xuewei Zhang 2020-01-31 15:42:22 -08:00
  • aadb2b88d1 Merge pull request #405 from xueweiz/test-pr Kubernetes Prow Robot 2020-01-07 13:40:19 -08:00
  • 140a850b63 Merge pull request #404 from xueweiz/queue Kubernetes Prow Robot 2020-01-06 13:14:16 -08:00
  • fb8304bec8 Rent Boskos project only once per test run. Xuewei Zhang 2020-01-02 18:17:54 -08:00
  • fa7a3d7df1 Fix disk metrics unit and queue_length calculation Xuewei Zhang 2020-01-02 17:01:33 -08:00
  • 0d0bba94e5 Merge pull request #402 from gmemcc/master Kubernetes Prow Robot 2019-12-18 11:57:57 -08:00
  • 5a4ac81186 Only disk_avg_queue_len is distorted on first collection Alex Wong 2019-12-12 14:39:29 +08:00
  • 3d10c892a2 Ignore first collected disk stats to prevent metric distortion Alex Wong 2019-12-11 11:13:43 +08:00
  • 7819ffda7c Merge pull request #400 from xueweiz/patch-1 Kubernetes Prow Robot 2019-12-10 11:32:07 -08:00
  • 6f27c80053 Install ginkgo executable in test/build.sh Xuewei Zhang 2019-12-06 22:09:00 -08:00