Ash
dd48aadf7f
Allow filtering node resources on taint. ( #1840 )
...
* allow filtering node resources on taint
2025-09-09 14:33:51 +01:00
Ethan Mosbaugh
6e62251904
chore(deps): bump the security group with 16 updates ( #1835 )
...
* chore(deps): bump the security group with 16 updates
Bumps the security group with 16 updates:
| Package | From | To |
| --- | --- | --- |
| [github.com/shirou/gopsutil/v4](https://github.com/shirou/gopsutil ) | `4.25.7` | `4.25.8` |
| [github.com/spf13/cobra](https://github.com/spf13/cobra ) | `1.9.1` | `1.10.1` |
| [github.com/spf13/pflag](https://github.com/spf13/pflag ) | `1.0.7` | `1.0.9` |
| [github.com/stretchr/testify](https://github.com/stretchr/testify ) | `1.11.0` | `1.11.1` |
| [go.opentelemetry.io/otel](https://github.com/open-telemetry/opentelemetry-go ) | `1.37.0` | `1.38.0` |
| [go.opentelemetry.io/otel/sdk](https://github.com/open-telemetry/opentelemetry-go ) | `1.37.0` | `1.38.0` |
| [k8s.io/api](https://github.com/kubernetes/api ) | `0.33.4` | `0.34.0` |
| [k8s.io/apiextensions-apiserver](https://github.com/kubernetes/apiextensions-apiserver ) | `0.33.4` | `0.34.0` |
| [k8s.io/apimachinery](https://github.com/kubernetes/apimachinery ) | `0.33.4` | `0.34.0` |
| [k8s.io/apiserver](https://github.com/kubernetes/apiserver ) | `0.33.4` | `0.34.0` |
| [k8s.io/cli-runtime](https://github.com/kubernetes/cli-runtime ) | `0.33.4` | `0.34.0` |
| [k8s.io/client-go](https://github.com/kubernetes/client-go ) | `0.33.4` | `0.34.0` |
| [sigs.k8s.io/controller-runtime](https://github.com/kubernetes-sigs/controller-runtime ) | `0.21.0` | `0.22.0` |
| [k8s.io/kubelet](https://github.com/kubernetes/kubelet ) | `0.33.4` | `0.34.0` |
| [k8s.io/metrics](https://github.com/kubernetes/metrics ) | `0.33.4` | `0.34.0` |
| [k8s.io/utils](https://github.com/kubernetes/utils ) | `0.0.0-20241104100929-3ea5e8cea738` | `0.0.0-20250604170112-4c0f3b243397` |
Updates `github.com/shirou/gopsutil/v4` from 4.25.7 to 4.25.8
- [Release notes](https://github.com/shirou/gopsutil/releases )
- [Commits](https://github.com/shirou/gopsutil/compare/v4.25.7...v4.25.8 )
Updates `github.com/spf13/cobra` from 1.9.1 to 1.10.1
- [Release notes](https://github.com/spf13/cobra/releases )
- [Commits](https://github.com/spf13/cobra/compare/v1.9.1...v1.10.1 )
Updates `github.com/spf13/pflag` from 1.0.7 to 1.0.9
- [Release notes](https://github.com/spf13/pflag/releases )
- [Commits](https://github.com/spf13/pflag/compare/v1.0.7...v1.0.9 )
Updates `github.com/stretchr/testify` from 1.11.0 to 1.11.1
- [Release notes](https://github.com/stretchr/testify/releases )
- [Commits](https://github.com/stretchr/testify/compare/v1.11.0...v1.11.1 )
Updates `go.opentelemetry.io/otel` from 1.37.0 to 1.38.0
- [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases )
- [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/v1.37.0...v1.38.0 )
Updates `go.opentelemetry.io/otel/sdk` from 1.37.0 to 1.38.0
- [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases )
- [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/v1.37.0...v1.38.0 )
Updates `k8s.io/api` from 0.33.4 to 0.34.0
- [Commits](https://github.com/kubernetes/api/compare/v0.33.4...v0.34.0 )
Updates `k8s.io/apiextensions-apiserver` from 0.33.4 to 0.34.0
- [Release notes](https://github.com/kubernetes/apiextensions-apiserver/releases )
- [Commits](https://github.com/kubernetes/apiextensions-apiserver/compare/v0.33.4...v0.34.0 )
Updates `k8s.io/apimachinery` from 0.33.4 to 0.34.0
- [Commits](https://github.com/kubernetes/apimachinery/compare/v0.33.4...v0.34.0 )
Updates `k8s.io/apiserver` from 0.33.4 to 0.34.0
- [Commits](https://github.com/kubernetes/apiserver/compare/v0.33.4...v0.34.0 )
Updates `k8s.io/cli-runtime` from 0.33.4 to 0.34.0
- [Commits](https://github.com/kubernetes/cli-runtime/compare/v0.33.4...v0.34.0 )
Updates `k8s.io/client-go` from 0.33.4 to 0.34.0
- [Changelog](https://github.com/kubernetes/client-go/blob/master/CHANGELOG.md )
- [Commits](https://github.com/kubernetes/client-go/compare/v0.33.4...v0.34.0 )
Updates `sigs.k8s.io/controller-runtime` from 0.21.0 to 0.22.0
- [Release notes](https://github.com/kubernetes-sigs/controller-runtime/releases )
- [Changelog](https://github.com/kubernetes-sigs/controller-runtime/blob/main/RELEASE.md )
- [Commits](https://github.com/kubernetes-sigs/controller-runtime/compare/v0.21.0...v0.22.0 )
Updates `k8s.io/kubelet` from 0.33.4 to 0.34.0
- [Commits](https://github.com/kubernetes/kubelet/compare/v0.33.4...v0.34.0 )
Updates `k8s.io/metrics` from 0.33.4 to 0.34.0
- [Commits](https://github.com/kubernetes/metrics/compare/v0.33.4...v0.34.0 )
Updates `k8s.io/utils` from 0.0.0-20241104100929-3ea5e8cea738 to 0.0.0-20250604170112-4c0f3b243397
- [Commits](https://github.com/kubernetes/utils/commits )
---
updated-dependencies:
- dependency-name: github.com/shirou/gopsutil/v4
dependency-version: 4.25.8
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: github.com/spf13/cobra
dependency-version: 1.10.1
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: github.com/spf13/pflag
dependency-version: 1.0.9
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: github.com/stretchr/testify
dependency-version: 1.11.1
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: go.opentelemetry.io/otel
dependency-version: 1.38.0
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: go.opentelemetry.io/otel/sdk
dependency-version: 1.38.0
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/api
dependency-version: 0.34.0
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/apiextensions-apiserver
dependency-version: 0.34.0
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/apimachinery
dependency-version: 0.34.0
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/apiserver
dependency-version: 0.34.0
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/cli-runtime
dependency-version: 0.34.0
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/client-go
dependency-version: 0.34.0
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: sigs.k8s.io/controller-runtime
dependency-version: 0.22.0
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/kubelet
dependency-version: 0.34.0
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/metrics
dependency-version: 0.34.0
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/utils
dependency-version: 0.0.0-20250604170112-4c0f3b243397
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
...
Signed-off-by: dependabot[bot] <support@github.com >
* f
* f
* f
---------
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-08 09:32:56 -07:00
Diamon Wiggins
2861425293
feat(run-pod): allow image pull retries in run pod collector ( #1811 )
...
* allow image pull retries in run pod collector
* fix formatting
2025-07-25 11:30:14 -04:00
dependabot[bot]
733d401b0f
chore(deps): bump helm.sh/helm/v3 from 3.17.3 to 3.18.4 in the go_modules group ( #1805 )
2025-07-10 09:43:19 -07:00
Diamon Wiggins
989780af69
feat: allow secrets collector to retreive all key data if specified ( #1801 )
...
* allow secrets collector retreival all key data if specified
* add new line
* remove unneeded comments
2025-06-30 10:06:14 -04:00
Luke Amdor
3a457d11fc
feat: add timestamps flag to logs collector ( #1776 )
...
* feat: add timestamps falg to logs collector
Kubernetes logs can be transmitted with the captured timestamps. This is useful for containers that do not log with timestamps. So I'm exposing that as a flag.
* fix: update schemas
2025-04-17 10:51:07 -04:00
Salah Al Saleh
d5a6b19417
Add a host analyzer to check if a subnet contains an IP address ( #1735 )
...
* Add a host collector / analyzer to check if a subnet contains an IP address
2025-02-13 13:16:59 -08:00
Dexter Yan
64ee9e5596
feat(nodeResources): add GPU support ( #1708 )
...
* feat(nodeResources): add GPU support
* add resourceCapacity and sum test
* update with make schemas
* Correct tests names
Signed-off-by: Evans Mungai <evans@replicated.com >
---------
Signed-off-by: Evans Mungai <evans@replicated.com >
Co-authored-by: Evans Mungai <evans@replicated.com >
2025-01-03 15:11:10 +13:00
dependabot[bot]
3f1314c00d
chore(deps): bump the security group across 1 directory with 17 updates ( #1709 )
...
* chore(deps): bump the security group across 1 directory with 17 updates
Bumps the security group with 13 updates in the / directory:
| Package | From | To |
| --- | --- | --- |
| [github.com/cilium/ebpf](https://github.com/cilium/ebpf ) | `0.16.0` | `0.17.1` |
| [github.com/containerd/cgroups/v3](https://github.com/containerd/cgroups ) | `3.0.4` | `3.0.5` |
| [github.com/jackc/pgx/v5](https://github.com/jackc/pgx ) | `5.7.1` | `5.7.2` |
| [github.com/microsoft/go-mssqldb](https://github.com/microsoft/go-mssqldb ) | `1.7.2` | `1.8.0` |
| [go.opentelemetry.io/otel](https://github.com/open-telemetry/opentelemetry-go ) | `1.32.0` | `1.33.0` |
| [go.opentelemetry.io/otel/sdk](https://github.com/open-telemetry/opentelemetry-go ) | `1.32.0` | `1.33.0` |
| [k8s.io/api](https://github.com/kubernetes/api ) | `0.31.3` | `0.32.0` |
| [k8s.io/apiextensions-apiserver](https://github.com/kubernetes/apiextensions-apiserver ) | `0.31.3` | `0.32.0` |
| [k8s.io/cli-runtime](https://github.com/kubernetes/cli-runtime ) | `0.31.3` | `0.32.0` |
| [golang.org/x/net](https://github.com/golang/net ) | `0.31.0` | `0.33.0` |
| [helm.sh/helm/v3](https://github.com/helm/helm ) | `3.16.3` | `3.16.4` |
| [k8s.io/kubelet](https://github.com/kubernetes/kubelet ) | `0.31.3` | `0.32.0` |
| [k8s.io/metrics](https://github.com/kubernetes/metrics ) | `0.31.3` | `0.32.0` |
Updates `github.com/cilium/ebpf` from 0.16.0 to 0.17.1
- [Release notes](https://github.com/cilium/ebpf/releases )
- [Commits](https://github.com/cilium/ebpf/compare/v0.16.0...v0.17.1 )
Updates `github.com/containerd/cgroups/v3` from 3.0.4 to 3.0.5
- [Release notes](https://github.com/containerd/cgroups/releases )
- [Commits](https://github.com/containerd/cgroups/compare/v3.0.4...v3.0.5 )
Updates `github.com/jackc/pgx/v5` from 5.7.1 to 5.7.2
- [Changelog](https://github.com/jackc/pgx/blob/master/CHANGELOG.md )
- [Commits](https://github.com/jackc/pgx/compare/v5.7.1...v5.7.2 )
Updates `github.com/microsoft/go-mssqldb` from 1.7.2 to 1.8.0
- [Release notes](https://github.com/microsoft/go-mssqldb/releases )
- [Changelog](https://github.com/microsoft/go-mssqldb/blob/main/CHANGELOG.md )
- [Commits](https://github.com/microsoft/go-mssqldb/compare/v1.7.2...v1.8.0 )
Updates `go.opentelemetry.io/otel` from 1.32.0 to 1.33.0
- [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases )
- [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/v1.32.0...v1.33.0 )
Updates `go.opentelemetry.io/otel/sdk` from 1.32.0 to 1.33.0
- [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases )
- [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/v1.32.0...v1.33.0 )
Updates `k8s.io/api` from 0.31.3 to 0.32.0
- [Commits](https://github.com/kubernetes/api/compare/v0.31.3...v0.32.0 )
Updates `k8s.io/apiextensions-apiserver` from 0.31.3 to 0.32.0
- [Release notes](https://github.com/kubernetes/apiextensions-apiserver/releases )
- [Commits](https://github.com/kubernetes/apiextensions-apiserver/compare/v0.31.3...v0.32.0 )
Updates `k8s.io/apimachinery` from 0.31.3 to 0.32.0
- [Commits](https://github.com/kubernetes/apimachinery/compare/v0.31.3...v0.32.0 )
Updates `k8s.io/apiserver` from 0.31.3 to 0.32.0
- [Commits](https://github.com/kubernetes/apiserver/compare/v0.31.3...v0.32.0 )
Updates `k8s.io/cli-runtime` from 0.31.3 to 0.32.0
- [Commits](https://github.com/kubernetes/cli-runtime/compare/v0.31.3...v0.32.0 )
Updates `k8s.io/client-go` from 0.31.3 to 0.32.0
- [Changelog](https://github.com/kubernetes/client-go/blob/master/CHANGELOG.md )
- [Commits](https://github.com/kubernetes/client-go/compare/v0.31.3...v0.32.0 )
Updates `golang.org/x/net` from 0.31.0 to 0.33.0
- [Commits](https://github.com/golang/net/compare/v0.31.0...v0.33.0 )
Updates `helm.sh/helm/v3` from 3.16.3 to 3.16.4
- [Release notes](https://github.com/helm/helm/releases )
- [Commits](https://github.com/helm/helm/compare/v3.16.3...v3.16.4 )
Updates `k8s.io/kubelet` from 0.31.3 to 0.32.0
- [Commits](https://github.com/kubernetes/kubelet/compare/v0.31.3...v0.32.0 )
Updates `k8s.io/metrics` from 0.31.3 to 0.32.0
- [Commits](https://github.com/kubernetes/metrics/compare/v0.31.3...v0.32.0 )
Updates `k8s.io/utils` from 0.0.0-20240711033017-18e509b52bc8 to 0.0.0-20241104100929-3ea5e8cea738
- [Commits](https://github.com/kubernetes/utils/commits )
---
updated-dependencies:
- dependency-name: github.com/cilium/ebpf
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: github.com/containerd/cgroups/v3
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: github.com/jackc/pgx/v5
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: github.com/microsoft/go-mssqldb
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: go.opentelemetry.io/otel
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: go.opentelemetry.io/otel/sdk
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/api
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/apiextensions-apiserver
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/apimachinery
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/apiserver
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/cli-runtime
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/client-go
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: golang.org/x/net
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: helm.sh/helm/v3
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: k8s.io/kubelet
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/metrics
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/utils
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
...
Signed-off-by: dependabot[bot] <support@github.com >
* upgrade go version 1.23.0
* make schemas
---------
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Dexter Yan <yanshaocong@gmail.com >
2024-12-30 17:25:11 +13:00
Gerard Nguyen
a6fbf144b8
feat: container statuses analyzer ( #1698 )
...
* new schema for analyzer ClusterContainerStatues
2024-12-04 10:36:23 +11:00
Miguel Varela Ramos
8e2647077d
feat: add support for matchExpressions when filtering for nodes ( #1697 )
...
* feat: add support for matchExpressions when filtering for nodes
* fix: make generate
2024-11-30 23:15:26 +11:00
João Antunes
197f6de425
feat(host_analyzer): add host sysctl analyzer ( #1681 )
...
* feat(host_analyzer): add host sysctl analyzer
* chore: add e2e tests to support bundle collection
* chore: missing spec e2e test update
* chore: cleanup remote collector and use parse operator
* chore: update schemas
2024-11-08 18:55:24 +00:00
João Antunes
77c9968ff6
feat(host_sysctl): add host sysctl collector ( #1676 )
...
* feat(host_sysctl): add host sysctl collector
* chore: add examples
* Update pkg/collect/host_sysctl.go
Co-authored-by: Evans Mungai <evans@replicated.com >
* chore: use sysctl package vs exec calls
* chore: make linter happy
* chore: make schemas
* chore: go back to sysctl exec
* chore: make linter happy
---------
Co-authored-by: Evans Mungai <evans@replicated.com >
2024-11-07 18:18:11 +00:00
Ricardo Maraschini
e272683bce
feat: implement collector and analyser for network namespace connectivity ( #1670 )
...
* feat: implement collector and analyser for network namespace connectivity
checks if two network namespaces can talk to each other on udp and tcp.
its usage is as follows:
```yaml
apiVersion: troubleshoot.sh/v1beta2
kind: SupportBundle
metadata:
name: test
spec:
hostCollectors:
- networkNamespaceConnectivity:
collectorName: check-network-connectivity
fromCIDR: 10.0.0.0/24
toCIDR: 10.0.1.0/24
hostAnalyzers:
- networkNamespaceConnectivity:
collectorName: check-network-connectivity
outcomes:
- pass:
message: "Communication between 10.0.0.0/24 and 10.0.1.0/24 is working"
- fail:
message: "Communication between 10.0.0.0/24 and 10.0.1.0/24 isn't working"
```
if this fails then you may need to enable `forwarding` with:
```bash
sysctl -w net.ipv4.ip_forward=1
```
if it still fails then you may need to configure firewalld to allow the
traffic or simply disable it for sake of testing.
* chore: rebuild schemas
* chore: remove unused property
* chore: disable namespaces for other platforms
* chore: make sure we timeout temporary servers
* feat: analyzer now supports multi-node collection
* feat: check both udp and tcp even on failure
check both protocols even if one fails. this pr commit also introduces a
timeout that can be set by the user.
* feat: add templating to the failure outcome
allow users to dump the errors found during the analysis.
* chore: addressing pr comments
* feat: delete interface pair before namespace
even though the interface pair is deleted everyttime we delete the
namespace on my tests we better delete it before we delete the
namespace.
this comes out of a review comment where some people seem to still be
able to see the interface pair even after the namespace is deleted.
i.e. better safe than sorry.
* chore: fix typo on comment
2024-11-06 11:30:13 +01:00
ada mancini
eacff7112f
support adding a CA cert to http collector ( #1624 )
...
* add a TLS parameter for cacert
* pass a ca cert into http request
* test preflight
* make schemas
* log extra information from http request
* pass a proxy into the collector spec
* hitting a segfault; breakpoint
* accept a dir, file, or a string-literal as CA
* move tls params into get, put, post methods
* test for cert untrusted response
* make generate
* make schemas
* more test cases
* make schemas
* dont include system certs
* make generate && make schemas
* resolve gosec G402 warning
* remove old check for system certs
* ignore errcheck "return value not checked" linter errors
2024-10-23 18:15:08 -04:00
Dexter Yan
0d21eed5f8
fix(support): add missing host collectors for ParseSupportBundle ( #1656 )
...
* fix(support): add missing host collectors for ParseSupportBundle
* update
* add host ananlyers
2024-10-22 13:07:44 +13:00
Diamon Wiggins
8105fa00e9
Refactor Remote Host Collection ( #1633 )
...
* refactor remote collectors
* add remotecollect params struct
* remove commented checkrbac function
* removed unused function
* add temp comments
* refactor to not require RemoteCollect method per collector
* removed unneeded param
* removed unneeded param
* more refactor
* more refactor
* remove unneeded function
* remove debug print
* fix analyzer results
* move rbac to separate file
* be more specific with rbac function name
* fix imports
* fix node list file
* make k8s rest client config consistent with in cluster collection
* add ctx and otel tracing
* add test for allCollectedData
* move runHostCollectorsInPod to spec instead of metadata
* make generate
* fix broken references to supportbundle metadata
* add e2e tests
* update loader tests
* fix tests
* fix hostos remote collector spec
* update remoteHostCollectrs.yaml
---------
Co-authored-by: Dexter Yan <yanshaocong@gmail.com >
2024-10-09 18:38:49 +13:00
Ash
f58f02560f
Allow goldpinger / goldpinger util images to be set in collector spec ( #1635 )
...
* Add image parameter to the goldpinger collector
* Pass image directly as a function arg
Also allow util image to be set in spec
* Remove pointless util image override
* Update pkg/collect/goldpinger.go
Co-authored-by: Evans Mungai <evans@replicated.com >
* Simplify image override
---------
Co-authored-by: Evans Mungai <evans@replicated.com >
2024-10-03 13:18:36 +01:00
Evans Mungai
83f02f4705
feat: Install goldpinger daemonset if one does not exist when running goldpinger collector ( #1619 )
...
* feat: Install goldpinger if one does not exist when running goldpinger collector
- Deploy golpinger daemonset if one is not detected in the cluster
- Clean up all deployed resources
- Add delay to allow users to wait for goldpinger to perform checks
Signed-off-by: Evans Mungai <evans@replicated.com >
* Add missing test data file
Signed-off-by: Evans Mungai <evans@replicated.com >
* Better naming of create resource functions
Signed-off-by: Evans Mungai <evans@replicated.com >
---------
Signed-off-by: Evans Mungai <evans@replicated.com >
2024-09-24 17:17:14 +01:00
Dexter Yan
e97b9613a5
feat(support-bundle): add runHostCollectorsInPod in spec ( #1608 )
2024-09-20 11:57:58 -05:00
Gerard Nguyen
8823f7d99e
feat: host collector for DNS ( #1617 )
...
* add struct for host dns collector
* add miekg/dns
* add more logs
* nit
* new field names
* use Hostnames instead of Names
* misc update
* make schemas
* no error when there is no resolv.conf
* query all searches
* add summary.json file
* merge summary into result file
* query AAAA and CNAME as well
* update schema for hostnames to be required
2024-09-20 08:13:57 +10:00
Gerard Nguyen
7484b10914
feat: [sc-110727] troubleshoot: collector/analyzer for wildcard dns ( #1606 )
...
* store DNS collector in JSON output for analyze later
* fix incorrect path
* configurable dns image
* make non resolvable domain configurable
* nit update address field
* * update dns util image
* add unit test
2024-09-11 14:35:30 +10:00
dependabot[bot]
bd6c5b976d
chore(deps): bump the security group across 1 directory with 16 updates ( #1604 )
...
* chore(deps): bump the security group across 1 directory with 16 updates
Bumps the security group with 12 updates in the / directory:
| Package | From | To |
| --- | --- | --- |
| [github.com/containers/image/v5](https://github.com/containers/image ) | `5.32.1` | `5.32.2` |
| [github.com/hashicorp/go-getter](https://github.com/hashicorp/go-getter ) | `1.7.5` | `1.7.6` |
| [github.com/vmware-tanzu/velero](https://github.com/vmware-tanzu/velero ) | `1.14.0` | `1.14.1` |
| [go.opentelemetry.io/otel](https://github.com/open-telemetry/opentelemetry-go ) | `1.28.0` | `1.29.0` |
| [go.opentelemetry.io/otel/sdk](https://github.com/open-telemetry/opentelemetry-go ) | `1.28.0` | `1.29.0` |
| [k8s.io/api](https://github.com/kubernetes/api ) | `0.30.3` | `0.31.0` |
| [k8s.io/apiextensions-apiserver](https://github.com/kubernetes/apiextensions-apiserver ) | `0.30.3` | `0.31.0` |
| [k8s.io/cli-runtime](https://github.com/kubernetes/cli-runtime ) | `0.30.3` | `0.31.0` |
| [sigs.k8s.io/controller-runtime](https://github.com/kubernetes-sigs/controller-runtime ) | `0.18.5` | `0.19.0` |
| [helm.sh/helm/v3](https://github.com/helm/helm ) | `3.15.3` | `3.15.4` |
| [k8s.io/kubelet](https://github.com/kubernetes/kubelet ) | `0.30.3` | `0.31.0` |
| [k8s.io/metrics](https://github.com/kubernetes/metrics ) | `0.30.3` | `0.31.0` |
Updates `github.com/containers/image/v5` from 5.32.1 to 5.32.2
- [Release notes](https://github.com/containers/image/releases )
- [Commits](https://github.com/containers/image/compare/v5.32.1...v5.32.2 )
Updates `github.com/hashicorp/go-getter` from 1.7.5 to 1.7.6
- [Release notes](https://github.com/hashicorp/go-getter/releases )
- [Changelog](https://github.com/hashicorp/go-getter/blob/main/.goreleaser.yml )
- [Commits](https://github.com/hashicorp/go-getter/compare/v1.7.5...v1.7.6 )
Updates `github.com/vmware-tanzu/velero` from 1.14.0 to 1.14.1
- [Release notes](https://github.com/vmware-tanzu/velero/releases )
- [Changelog](https://github.com/vmware-tanzu/velero/blob/main/CHANGELOG.md )
- [Commits](https://github.com/vmware-tanzu/velero/compare/v1.14.0...v1.14.1 )
Updates `go.opentelemetry.io/otel` from 1.28.0 to 1.29.0
- [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases )
- [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/v1.28.0...v1.29.0 )
Updates `go.opentelemetry.io/otel/sdk` from 1.28.0 to 1.29.0
- [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases )
- [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/v1.28.0...v1.29.0 )
Updates `k8s.io/api` from 0.30.3 to 0.31.0
- [Commits](https://github.com/kubernetes/api/compare/v0.30.3...v0.31.0 )
Updates `k8s.io/apiextensions-apiserver` from 0.30.3 to 0.31.0
- [Release notes](https://github.com/kubernetes/apiextensions-apiserver/releases )
- [Commits](https://github.com/kubernetes/apiextensions-apiserver/compare/v0.30.3...v0.31.0 )
Updates `k8s.io/apimachinery` from 0.30.3 to 0.31.0
- [Commits](https://github.com/kubernetes/apimachinery/compare/v0.30.3...v0.31.0 )
Updates `k8s.io/apiserver` from 0.30.3 to 0.31.0
- [Commits](https://github.com/kubernetes/apiserver/compare/v0.30.3...v0.31.0 )
Updates `k8s.io/cli-runtime` from 0.30.3 to 0.31.0
- [Commits](https://github.com/kubernetes/cli-runtime/compare/v0.30.3...v0.31.0 )
Updates `k8s.io/client-go` from 0.30.3 to 0.31.0
- [Changelog](https://github.com/kubernetes/client-go/blob/master/CHANGELOG.md )
- [Commits](https://github.com/kubernetes/client-go/compare/v0.30.3...v0.31.0 )
Updates `sigs.k8s.io/controller-runtime` from 0.18.5 to 0.19.0
- [Release notes](https://github.com/kubernetes-sigs/controller-runtime/releases )
- [Changelog](https://github.com/kubernetes-sigs/controller-runtime/blob/main/RELEASE.md )
- [Commits](https://github.com/kubernetes-sigs/controller-runtime/compare/v0.18.5...v0.19.0 )
Updates `helm.sh/helm/v3` from 3.15.3 to 3.15.4
- [Release notes](https://github.com/helm/helm/releases )
- [Commits](https://github.com/helm/helm/compare/v3.15.3...v3.15.4 )
Updates `k8s.io/kubelet` from 0.30.3 to 0.31.0
- [Commits](https://github.com/kubernetes/kubelet/compare/v0.30.3...v0.31.0 )
Updates `k8s.io/metrics` from 0.30.3 to 0.31.0
- [Commits](https://github.com/kubernetes/metrics/compare/v0.30.3...v0.31.0 )
Updates `k8s.io/utils` from 0.0.0-20230726121419-3b25d923346b to 0.0.0-20240711033017-18e509b52bc8
- [Commits](https://github.com/kubernetes/utils/commits )
---
updated-dependencies:
- dependency-name: github.com/containers/image/v5
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: github.com/hashicorp/go-getter
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: github.com/vmware-tanzu/velero
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: go.opentelemetry.io/otel
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: go.opentelemetry.io/otel/sdk
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/api
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/apiextensions-apiserver
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/apimachinery
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/apiserver
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/cli-runtime
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/client-go
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: sigs.k8s.io/controller-runtime
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: helm.sh/helm/v3
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: k8s.io/kubelet
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/metrics
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/utils
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
...
Signed-off-by: dependabot[bot] <support@github.com >
* Run go mod tidy
Signed-off-by: Evans Mungai <evans@replicated.com >
* Update schemas and thier generator tools
* controller-gen to v0.16.2
* client-gen to v0.28.13
Signed-off-by: Evans Mungai <evans@replicated.com >
---------
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: Evans Mungai <evans@replicated.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Evans Mungai <evans@replicated.com >
2024-09-04 15:54:07 +12:00
Dexter Yan
0a2c9c74ab
feat(analyzer): allow templating for Node Resources Analyzer ( #1605 )
...
* feat(analyzer): allow templating for Node Resources Analyzer
2024-09-02 09:42:40 +12:00
Ethan Mosbaugh
1b1efa133e
feat(fio): add option to disable runtime ( #1601 )
2024-08-22 16:47:08 -07:00
Gerard Nguyen
47656a8e6f
feat: etcd collector ( #1589 )
...
* new schema for etcd collector
* add placeholder
* wip
* get supported distribution
* add exec implementation
* wait for etcd pod to be ready
* misc
* update k0s etcd certs path
* fix unit tests
* address code reviews
* update from code review
* add etcdctl version
2024-08-13 08:42:26 +10:00
Gerard Nguyen
60263caf78
feat: [sc-108732] Can't add annotations in pods executed with the runPod collector ( #1590 )
...
add new field annotations for run pod collector
2024-08-08 10:13:36 +10:00
Gerard Nguyen
a57171a918
feat: [sc-108689] troubleshoot: journald collector ( #1586 )
...
* add schema for Journald Host Collector
* implement journald host collector
* update host collector
* add --no-pager
2024-07-29 09:44:43 +10:00
Evans Mungai
01d5804977
feat: cgroups host collector ( #1581 )
...
Linux control groups host collector that detects whether the specified mountPoint is a cgroup filesystem and what version it is. The collector also collects information of the configured cgroup controllers.
Signed-off-by: Evans Mungai <evans@replicated.com >
2024-07-24 16:46:04 +01:00
Evans Mungai
1444c01725
feat: json compare host analyser ( #1582 )
...
* feat: json compore host analyser
Signed-off-by: Evans Mungai <evans@replicated.com >
* Add missing json compare host analyser file
Signed-off-by: Evans Mungai <evans@replicated.com >
* Generate schemas
Signed-off-by: Evans Mungai <evans@replicated.com >
* Fix failing tests
Signed-off-by: Evans Mungai <evans@replicated.com >
* Ensure json compare analyser always has a title
Signed-off-by: Evans Mungai <evans@replicated.com >
---------
Signed-off-by: Evans Mungai <evans@replicated.com >
2024-07-24 14:27:20 +01:00
Gerard Nguyen
4e999d6bfb
fix: [sc-106256] Add missing uri field to troubleshoot.sh types ( #1574 )
...
add uri field for top level type
2024-07-15 13:07:04 +10:00
Gerard Nguyen
edfa01c5c4
feat: [sc-106625] http analyzer for in-cluster ( #1566 )
...
* http analzyer for in-cluster
* make check-schemas
2024-06-19 12:14:37 +10:00
Gerard Nguyen
80e5fac07c
feat: New host collector and analyzer for Kernel Configs ( #1546 )
...
* new struct and update schemas
* implement Collect function
* add kernel config to collector struct
* generate kernel config analyzer schema
* implement kernel config analyzer
* fail on no match in pass outcome
* run make check-schemas
* fix failed unit test
* update from code review
* add selectedConfigs field
* run make check-schemas
2024-05-27 09:55:39 +10:00
dependabot[bot]
bd7bae6302
chore(deps): bump the security group with 9 updates ( #1549 )
...
* ---
updated-dependencies:
- dependency-name: k8s.io/api
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: k8s.io/apiextensions-apiserver
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: k8s.io/apimachinery
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: k8s.io/apiserver
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: k8s.io/cli-runtime
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: k8s.io/client-go
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: helm.sh/helm/v3
dependency-type: direct:production
update-type: version-update:semver-minor
dependency-group: security
- dependency-name: k8s.io/kubelet
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
- dependency-name: k8s.io/metrics
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: security
...
Signed-off-by: dependabot[bot] <support@github.com >
* Regenerate CR schemas
---------
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Evans Mungai <evans@replicated.com >
2024-05-21 12:07:50 +00:00
Gerard Nguyen
6b368f2221
feat: [sc-103754] Be able to detect search domain misconfiguration #1391 ( #1534 )
...
* new collector dns
* implement DNS collector
* add dns service and endpoints check
* add nil check on retrieve endpoints
2024-05-01 07:04:20 +10:00
Andrew Lavery
f18b5d754e
update k8s imports to v0.30.0 and address changed function signature ( #1528 )
...
* update k8s imports to v0.30.0 and address changed function signature
* update schemas
2024-04-24 23:05:37 +09:00
Evans Mungai
db871e6889
feat: node metrics analyser ( #1520 )
...
* feat: node metrics analyser
The analyser only checks PVC usage at the moment. More analysers
can be added on a need to have basis
* Add tests
* Fix flaky test by waiting for goldpinger pods to start
* Fix how outcomes get checked
* Fix catch all outcome condition
* Fix test
* feat: node metrics analyser
The analyser only checks PVC usage at the moment. More analysers
can be added on a need to have basis
* Add tests
* Fix flaky test by waiting for goldpinger pods to start
* Fix how outcomes get checked
* Fix catch all outcome condition
* Fix test
* Regenerate schemas
* Fix failing test
---------
Co-authored-by: Dexter Yan <yanshaocong@gmail.com >
2024-04-09 12:14:10 +01:00
Evans Mungai
123d17ab4a
feat: node metrics collector ( #1516 )
...
* feat: node metrics collector
A collector to collect node metrics served by the API server as
per the documented API https://kubernetes.io/docs/reference/instrumentation/node-metrics/
* Update CRD schemas
* Add tests
* Remove clean from build target
* Update comments
* Commit missing tests
* Remove unnecessary log in tests
2024-04-02 14:59:55 +01:00
Evans Mungai
ca5ff88964
chore: Update code generator and controller tools ( #1515 )
2024-03-28 11:52:15 +00:00
Gerard Nguyen
742e92f1ee
New Event Analyzer ( #1474 )
...
* add new Event analyzer
2024-03-07 08:31:45 +13:00
Ethan Mosbaugh
e24ca642aa
feat: sonobuoy collector ( #1469 )
2024-02-16 06:56:15 -08:00
Gerard Nguyen
5daf6d6c89
Feat: new collector run-daemonset ( #1460 )
...
* feat: new collector run-daemonset
2024-02-09 08:16:32 +13:00
Gerard Nguyen
39b371991e
feat: add timeout to run host collector ( #1435 )
...
* feat: add timeout to run host collector
* return error on invalid timeout
* return -1 on context deadline exceeded
2024-01-31 12:46:01 +00:00
Akash Shrivastava
361e12e691
feat: [ISSUE-1401]: Added helm get values option in Helm collector ( #1402 )
...
* feat: [ISSUE-1401]: Added helm get values option in Helm collector
Signed-off-by: Akash Shrivastava <akash.shrivastava@harness.io >
* Made changes in error handling
Signed-off-by: Akash Shrivastava <akash.shrivastava@harness.io >
* feat: [ISSUE-1401]: Added test cases and fixes
Signed-off-by: Akash Shrivastava <akash.shrivastava@harness.io >
* fixed test case
Signed-off-by: Akash Shrivastava <akash.shrivastava@harness.io >
---------
Signed-off-by: Akash Shrivastava <akash.shrivastava@harness.io >
2024-01-16 19:12:43 +00:00
Xav Paice
e542f4fd0a
bump k8s.io packages to v0.29.0 ( #1419 )
...
* bump k8s.io packages to v0.29.0
* update Go to 1.21
* update schemas
2024-01-08 17:24:41 +00:00
Evans Mungai
53113c0170
feat: goldpinger collector and analyser ( #1398 )
...
* feat: goldpinger analyser
Analyser to generate a report from goldpinger results
* Add goldpinger testdata
* Goldpinger collector
* Improvements after running tests
* More minor updates after further testing
* Better error message if a container fails to start
* A few more updates
* Add goldpinger e2e test
* Update schemas
* Clean up help installs in e2e tests
* Add resource limits to goldpinger pods
* Some minor improvements
* Some more changes noted when writing docs
* Update schemas
* A few more updates when testing with kURL
* Log goldpinger tests
* Tests before exit code
2023-12-12 11:02:41 +00:00
Weiyanli Chen(York)
bc4856869e
fix: missing omitempty on 2 of the new fields ( #1389 )
...
* fix: missing omitempty on 2 of the new fields
* fix: Rename TS_WORKSPACE_DIR to TS_OUTPUT_DIR
---------
Co-authored-by: Evans Mungai <evans@replicated.com >
2023-11-09 13:25:11 +00:00
Weiyanli Chen(York)
f6373f3e36
feat: save host run file output ( #1376 )
...
* feat: save cmd run output
* chore: schema changes
* chore: example hostCollector
* chore: add log messages to key actions
* fix: correctly inherit all parent env by default
* chore: do not save input file
the user invokes the input already got the input but those content could be sensitive to another user who received this bundle
* test: unit test for host run
* revert: "chore: do not save input file"
This reverts commit 6af77ad1ce .
that commit is wrong
* chore: fix log msg and example yaml
* Ensure child cmd runs in its own working dir
* Check filename for slashes not content
* Update logging
* Add using relative path files as commands
---------
Co-authored-by: Evans Mungai <evans@replicated.com >
2023-11-08 13:49:46 +00:00
Archit Sharma
7038da85b1
Velero analyzer ( #1366 )
...
* feat: add velero analyzer (#806 )
* updated schema
* analyzer without collector
* tests
* covers deprecated Restic repository type
* velero version from deployment image to check deprecated type
* read for both velero pod kinds (velero*, node-agent*)
---------
Signed-off-by: Archit Sharma <archit@pm.me >
2023-11-03 18:41:17 +05:30
Evans Mungai
514c86d891
fix: use duration strings for http collector timeout ( #1338 )
...
* fix: use duration strings for http collector timeout
This follows the same format that all other collectors use.
* Update from PR comment
2023-09-13 19:18:03 -04:00