divolgin
ca51e92878
Allow memory writers
2021-09-30 18:25:52 +00:00
divolgin
6d0a57b16e
Don't panic when no data is collected
2021-09-29 21:25:28 +00:00
Jalaja Ganapathy
8a29442a2a
Remove ID from host preflight spec ( #438 )
2021-09-29 09:49:54 -07:00
divolgin
299497c0c0
Merge pull request #429 from danbudris/copyFromHostForCpNodes
...
add toleration to copy-from-host daemonset to allow collection from CP nodes
2021-09-29 09:01:14 -07:00
divolgin
0e8bedc281
Save collector data to disk directly
2021-09-29 00:15:02 +00:00
Jalaja Ganapathy
eb795c98b6
fix serializer for unique id ( #432 )
2021-09-24 14:20:37 -07:00
Jalaja Ganapathy
a0b3b3f7dc
add an unique id to each host preflights ( #431 )
...
* add an unique id to each host preflights
* auto generated files
* updated schemas for the new field id
* keeping it consistent with the rest of the spec
2021-09-24 13:29:14 -07:00
danbudris
67987a4432
add toleration to allow copy-from-host daemonset to run on CP nodes
2021-09-23 17:53:57 -04:00
Salah Aldeen Al Saleh
1bdd3db8c5
update schemas ( #428 )
...
* update schemas
* update controller-gen
2021-09-23 11:03:19 -07:00
Salah Aldeen Al Saleh
880c7dc3ea
ability to specify a list of namespaces for the cluster resources collector ( #424 )
...
* ability to specify a list of namespaces for the cluster resources collector
2021-09-23 08:02:05 -07:00
divolgin
afa08e5362
Analyzers should not return multiple results
2021-09-22 22:50:38 +00:00
Salah Aldeen Al Saleh
0c7fede7b6
check for nil analyzers ( #421 )
2021-09-21 12:12:10 -07:00
Salah Aldeen Al Saleh
465a533640
store analysis in the support bundle ( #417 )
...
* store analysis in the support bundle
2021-09-10 11:58:16 -07:00
Andrew Reed
91eb94baaa
Weave report analyzers
...
The IPAM pool analyzer checks that utilization of the pod IP subnet is
less than 85%. For example, if using 10.32.0.0/12, this analyzer will
warn if 3,482 IPs are currently allocated to pods.
The pending allocation analyzer checks that the IPAM status in the
report has no items for the PendingAllocates field. This indicates the
IPAM service is not ready according to the code in the weave status
template
e3712152d2/prog/weaver/http.go (L186) .
The weave connections analyzer checks that all connections to remote
peers are in the established state. The state will be "pending" if UDP
is blocked between nodes and will be "failed" if the weave pod on the
remote node is in a crash loop. To force a pending state for testing,
run the commands `iptables -A INPUT -p udp --dport 6784 -j REJECT` and
`iptables -A INPUT -p udp --dport 6783 -j REJECT` on a peer.
The weave connections analyzer also checks that all connections are
using the fastdp protocol. A commopn issue seen in the field on
CentOS/RHEL 7 is that some sides of a connection are using fastdp and
other sides have fallen back to sleeve. Set the WEAVE_NO_FASTDP env var
on the weave daemonset to "true" to test this analyzer.
2021-09-08 21:29:38 +00:00
Andrew Lavery
1b65d1a544
Merge pull request #413 from replicatedhq/laverya/collect-jobs-and-cronjobs
...
collect jobs and cronjobs as part of cluster-resources
2021-09-03 17:25:41 -04:00
Andrew Lavery
7fcc951c9a
collect jobs and cronjobs as part of cluster-resources
2021-09-03 15:46:03 -05:00
Dan Stough
0478a7a60f
fix: cluster-res collector fixed to one namespace
2021-09-03 19:23:44 +00:00
Salah Aldeen Al Saleh
c7af0dc593
fix openshift cluster detection ( #408 )
2021-08-24 09:51:12 -07:00
John Murphy
fd3b32293c
default result only when no other result exists ( #398 )
2021-07-28 11:19:41 -05:00
Kyle Sorensen
2977f8f0d3
Stop longhorn false positives on no results. ( #397 )
...
Longhorn analyzer no longer report positive results on no results
2021-07-28 09:37:54 -06:00
Kyle Sorensen
bf7d658313
troubleshoot enables collecting all data from a configmap ( #395 )
...
Enabled collecting all data from a ConfigMap instead of by key
2021-07-26 13:00:06 -06:00
emosbaugh
e2dbb623e3
Merge pull request #396 from replicatedhq/emosbaugh/20210726/cleanup
...
remove debug log
2021-07-26 11:38:29 -06:00
Joris 'Josh' De Winne
6349ae8aee
Adding support for inverted regex ( #370 )
2021-07-26 13:06:30 -04:00
Ethan Mosbaugh
851c91b582
remove debug log
2021-07-26 16:28:11 +00:00
emosbaugh
cf4d510413
Merge pull request #393 from replicatedhq/collect-copy-extractArchive
...
Copy collectors extractArchive property
2021-07-23 08:55:10 -06:00
John Murphy
e0f6cab5b3
Fix removes control characters from non interactive preflight runs ( #394 )
2021-07-23 09:46:36 -05:00
Ethan Mosbaugh
cf7864cd97
Copy collectors extractArchive property
2021-07-23 13:37:57 +00:00
emosbaugh
8dcfa9886d
Copy from host collector ( #391 )
...
* Copy from host collector
* namespace improvements
* better support for multiple nodes
2021-07-22 12:25:59 -07:00
John Murphy
6007f15253
fixed issue where warnings are disseminated along with passes ( #390 )
2021-07-22 08:27:39 -05:00
Andrew Lavery
ba65f92ca2
Merge pull request #389 from replicatedhq/laverya/simplify-text-analyze-code
...
simplify the text analyze code by combining with compareRegex code
2021-07-20 16:32:38 -07:00
Andrew Lavery
6a0fb2e19c
greatly improve coverage by adding regex group tests
2021-07-20 19:15:09 -04:00
Andrew Lavery
6861660460
simplify the text analyze code by combining with compareRegex code
2021-07-20 18:43:09 -04:00
kwsorensen
1ed6100ac8
Feature/validate tcp load balancer address ( #387 )
...
Load Balancer Validation part of troubleshoot pre-flight checks
2021-07-14 14:30:47 -06:00
emosbaugh
39350b5722
ConfigMap collector and secrets can be collected by selectors ( #384 )
...
* ConfigMap collector and secrets can be collected by selectors
* follow docs
* Pass context and kubernetes client to collectors
* collect tests
* analyze tests
* fix tests
* improvements
2021-07-08 16:30:26 -07:00
Andrew Reed
c95dc489a2
Accumulate all longhorn pass results
...
If there are any error or warning results then return those. Otherwise
return a single healthy pass result.
2021-07-08 18:25:10 +00:00
John Murphy
c119a16235
Fixed bugs introduced by handling multiple results in host preflights ( #383 )
...
Fixed bug caused by host preflights not handling empty when clauses, this cropped up because we now handle multiple host preflight results. Also expanded test coverage and added integration test script.
2021-07-08 11:08:53 -05:00
John Murphy
d730e6cad6
fixed tests
2021-07-06 08:42:12 -05:00
John Murphy
7e32de464a
implemented code review suggestion
2021-07-06 08:42:12 -05:00
John Murphy
ae4c07027b
host preflights can produce multiple results
2021-07-06 08:42:12 -05:00
John Murphy
eef54d0021
force timezone to upper case
2021-07-06 08:42:12 -05:00
divolgin
3382d29e96
Merge pull request #379 from replicatedhq/divolgin/apis
...
Update troubleshoot api schema
2021-07-01 11:32:24 -07:00
divolgin
7381d5086c
Update troubleshoot api schema
2021-07-01 17:24:00 +00:00
Andrew Reed
1ed8532663
Speed up replica checksum
2021-07-01 16:52:59 +00:00
Andrew Reed
3833955a58
Always include longhorn namespace
2021-07-01 15:03:28 +00:00
divolgin
52bbc0f2bf
Don't skip TLS validation on http package's default client
2021-06-30 18:22:15 +00:00
divolgin
53e8ee5495
Add a default redactor for postgres db connection string
2021-06-29 22:38:32 +00:00
Andrew Reed
cb3925a0af
Longhorn replica corruption analyzer
...
This automates the procedure from
https://longhorn.io/docs/1.1.1/advanced-resources/data-recovery/corrupted-replica/
2021-06-22 21:55:12 +00:00
Andrew Reed
e1bccd74b5
Analyze longhorn engine
2021-05-27 21:37:39 +00:00
Andrew Reed
a86f5cae7d
Collect all longhorn pod logs
2021-05-27 20:14:05 +00:00
Andrew Reed
0d5f17de3c
Analyze longhorn replica
2021-05-27 19:44:52 +00:00