troubleshoot

mirror of https://github.com/replicatedhq/troubleshoot.git synced 2026-02-14 18:29:53 +00:00

Author	SHA1	Message	Date
Noah Campbell	2cebe3d8f6	Support bundle upload functionality works for apps installed via Helm (#1904 ) * Gets licenseid and app slug from cluster secrets * Update upload.go * Update cluster_resources.go	2025-10-15 13:12:36 -05:00
Noah Campbell	6ffc83dc43	Updated linter (#1903 ) * moved linter to new branch * reads each yaml file separately when given multiple * split monolith lint file into more reasonably sized files * github action linter fix * lint error codes follow the rest of the codebase's standard	2025-10-14 16:25:50 -05:00
Noah Campbell	5aa088b3b6	Revert unintended commits on main	2025-10-13 15:23:31 -05:00
Noah Campbell	3f5ab9c721	doesnt harcode apiVersion line when looking and figures out which apiVersion to give if none is there	2025-10-13 15:18:43 -05:00
Noah Campbell	a5f4afb488	added lint subcommand	2025-10-13 15:18:04 -05:00
Noah Campbell	deab5e49a5	readded auto upload flag for support bundle (#1893 ) * readded auto upload flag for support bundle * updated auto upload message	2025-10-10 10:12:10 -07:00
Benjamin Yang	6c5c310eb3	Fix ollama clean (#1885 ) * fixing .json format * feat: aggregate files by resource type in Ollama agent for accurate cluster-wide analysis - Group pod/deployment/event/node files by type before analysis - Create cluster-wide summaries instead of per-file analysis - Add context about empty namespaces being normal in Kubernetes - Fixes false positives where empty namespaces were flagged as errors - Improves accuracy from ~60% to ~95% - Reduces analyzers from 21 to 12 (more efficient) - Speeds up analysis by ~30 seconds - Add cmd/analyze/main.go for building standalone analyze binary * feat: aggregate files by resource type in Ollama agent for accurate cluster-wide analysis - Group pod/deployment/event/node files by type before analysis - Create cluster-wide summaries instead of per-file analysis - Add context about empty namespaces being normal in Kubernetes - Fixes false positives where empty namespaces were flagged as errors - Improves accuracy from ~60% to ~95% - Reduces analyzers from 21 to 12 (more efficient) - Speeds up analysis by ~30 seconds - Fix event limiting condition to track included events separately - Update test to handle both aggregated and single-file analyzers - Add cmd/analyze/main.go for building standalone analyze binary * fixing error * fixing bugbot * fix bugbot errors * fix bugbot errors * bugbot errors * fixing more bugbot errors * fix: initialize namespace stats only after validating resource type - Move namespace initialization to after kind validation - Initialize for valid PodList/DeploymentList when items array exists - Initialize for valid single Pod/Deployment when kind matches - Skip initialization entirely for malformed/invalid JSON - Prevents reporting namespaces with invalid resource files * refactor: use if-else structure for clearer control flow - Restructure pod/deployment aggregation to use explicit if-else - Makes it clear that lists are processed in if block, singles in else - Functionally identical but clearer for static analysis - Resolves bugbot false positives about unreachable code	2025-10-08 16:57:00 -05:00
Marc Campbell	35759c47af	V1beta3 (#1873 ) * Change workflow branch from 'main' to 'v1beta3' * Auto updater (#1849) * added auto updater * updated docs * commit to trigger actions * Auto-collectors: foundational discovery, image metadata, CLI integrat… (#1845) * Auto-collectors: foundational discovery, image metadata, CLI integration; reset PRD markers * Address PR review feedback - Implement missing namespace exclude patterns functionality - Fix image facts collector to use empty Data field instead of static string - Correct APIVersion to use troubleshoot.sh/v1beta2 consistently * Fix bug bot issues: API parsing, EOF error, and API group corrections - Fix RBAC API parsing errors in rbac_checker.go (getAPIGroup/getAPIVersion functions) - Fix FakeReader EOF error to use standard io.EOF instead of custom error - Fix incorrect API group from troubleshoot.sh to troubleshoot.replicated.com in run.go These changes address the issues identified by the bug bot and ensure proper interface compliance and consistent API group usage. * Fix multiple bug bot issues - Fix RBAC API parsing errors in rbac_checker.go (getAPIGroup/getAPIVersion functions) - Fix FakeReader EOF error to use standard io.EOF instead of custom error - Fix incorrect API group from troubleshoot.sh to troubleshoot.replicated.com in run.go - Fix image facts collector Data field to contain structured JSON instead of static strings These changes address all issues identified by the bug bot and ensure proper interface compliance, consistent API usage, and meaningful data fields. * Update auto_discovery.go * Fix TODO comments in Auto-collector section Fixed 3 of 4 TODOs as requested in PR review: 1. pkg/collect/images/registry_client.go (line 46): - Implement custom CA certificate loading - Add x509 import and certificate parsing logic - Enables image collection from private registries with custom CAs 2. cmd/troubleshoot/cli/diff.go (line 209): - Implement bundle file count functionality - Add tar/gzip imports and getFileCountFromBundle() function - Properly counts files in support bundle archives (.gz/.tgz) 3. cmd/troubleshoot/cli/run.go (line 338): - Replace TODO with clarifying comment about RemoteCollectors usage - Confirmed RemoteCollectors are still actively used in preflights The 4th TODO (diff.go line 196) is left as-is since it's explicitly marked as Phase 4 future work (Support Bundle Differencing implementation). Addresses PR review feedback about unimplemented TODO comments. --------- Co-authored-by: Benjamin Yang <benjaminyang@Benjamins-MacBook-Pro.local> * resetting make targets and github workflows to support v1beta3 releas… (#1853) * resetting make targets and github workflows to support v1beta3 release later * removing generate * remove * removing * removing * Support bundle diff (#1855) implemented support bundle diff command * Preflight docs and template subcommands (#1847) * Added docs and template subcommands with test files * uses helm templating preflight yaml files * merge doc requirements for multiple inputs * Helm aware rendering and markdown output * v1beta3 yaml structure better mirrors beta2 * Update sample-preflight-templated.yaml * Added docs and template subcommands with test files * uses helm templating preflight yaml files * merge doc requirements for multiple inputs * Helm aware rendering and markdown output * v1beta3 yaml structure better mirrors beta2 * Update sample-preflight-templated.yaml * Added/updated documentation on subcommands * Update docs.go * commit to trigger actions * Updated yaml spec (#1851) * v1beta3 spec can be read by preflight * added test files for ease of testing * updated v1beta3 guide doc and added tests * fixed not removing tmp files from v1beta3 processing * created v1beta2 to v1beta3 converter * Updated yaml spec (#1863) * v1beta3 spec can be read by preflight * added test files for ease of testing * v1beta3 renderer fixes * fixed gitignore issue * Auto support bundle upload (#1860) * basic auto uploading support bundles * added upload command * added default vendor endpoint * added auth system from replicated cli * fixed case sensitivity issue in YAML parsing * support bundle uploads for end customers * app slug flag and detection without licenseID * moved v1beta3 examples to proper directory * does not auto update for package managers (#1850) * V1beta3 cleanup (#1869) * moving some files around * more cleanup * removing more unused * update ci for v1beta3 (#1870) * fmt: * removing unused examples * add a v1beta3 fixture * removing coverage reporting * adding brew (#1872) * Fixing testing errors (#1871) fix: resolve failing unit tests and diff consistency in v1beta3 - Fix readLinesFromReader to return lines WITH newlines (like difflib.SplitLines) - Update test expectations to match correct function behavior with newlines - This ensures consistency between streaming and non-streaming diff paths - Fix timeout test by changing from 10ms to 500ms to eliminate flaky failures Fixes TestReadLinesFromReader and Test_loadSupportBundleSpecsFromURIs_TimeoutError Resolves diff output inconsistency between code paths * Fix/exec textanalyze path clean (#1865) * created roadmap and yaml claude agent * Update roadmap.md * Fix textAnalyze analyzer to auto-match exec collector nested paths - Auto-detect exec output files (-stdout.txt, -stderr.txt, -errors.json) - Convert simple filenames to wildcard patterns automatically - Preserve existing wildcard patterns - Fixes 'No matching file' errors for exec + textAnalyze workflows --------- Co-authored-by: Noah Campbell <noah.edward.campbell@gmail.com> bump goreleaser to v2 * remove collect binary and risc binary * remove this check * add debug logging * larger runner for release * dropping goreleaser * fix syntax * fix syntax * goreleaser * larger * prerelease auto and more * publish to directory: * some more goreleaser/homebrew stuffs * removing risc * bump example * Advanced analysis clean (#1868) * created roadmap and yaml claude agent * Update roadmap.md * feat: Clean advanced analysis implementation - core agents, engine, artifacts * Remove unrelated files - keep only advanced analysis implementation * fix: Fix goroutine leak in hosted agent rate limiter - Added stop channel and stopped flag to RateLimiter struct - Modified replenishTokens to listen for stop signal and exit cleanly - Added Stop() method to gracefully shutdown rate limiter - Added Stop() method to HostedAgent to cleanup rate limiter on shutdown Fixes cursor bot issue: Rate Limiter Goroutine Leak * fix: Fix analyzer config and model validation bugs Bug 1: Analyzer Config Missing File Path - Added filePath to DeploymentStatus analyzer config in convertAnalyzerToSpec - Sets namespace-specific path (cluster-resources/deployments/{namespace}.json) - Falls back to generic path (cluster-resources/deployments.json) if no namespace - Fixes LocalAgent.analyzeDeploymentStatus backward compatibility Bug 2: HealthCheck Fails Model Validation - Changed Ollama model validation from prefix match to exact match - Prevents false positives where llama2:13b would match request for llama2:7b - Ensures agent only reports healthy when exact model is available Both fixes address cursor bot reported issues and maintain backward compatibility. * fixing lint errors * fixing lint errors * adding CLI flags * fix: resolve linting errors for CI - Remove unnecessary nil check in host_kernel_configs.go (len() for nil slices is zero) - Remove unnecessary fmt.Sprintf() calls in ceph.go for static strings - Apply go fmt formatting fixes Fixes failing lint CI check * fix: resolve CI failures in build-test workflow and Ollama tests 1. Fix GitHub Actions workflow logic error: - Replace problematic contains() expression with explicit job result checks - Properly handle failure and cancelled states for each job - Prevents false positive failures in success summary job 2. Fix Ollama agent parseLLMResponse panics: - Add proper error handling for malformed JSON in LLM responses - Return error when JSON is found but invalid (instead of silent fallback) - Add error when no meaningful content can be parsed from response - Prevents nil pointer dereference in test assertions Fixes failing build-test/success and build-test/test CI checks * fix: resolve all CI failures and cursor bot issues 1. Fix disable-ollama flag logic bug: - Remove disable-ollama from advanced analysis trigger condition - Prevents unintended advanced analysis mode when no agents registered - Allows proper fallback to legacy analysis 2. Fix diff test consistency: - Update test expectations to match function behavior (lines with newlines) - Ensures consistency between streaming and non-streaming diff paths 3. Fix Ollama agent error handling: - Add proper error return for malformed JSON in LLM responses - Add meaningful content validation for markdown parsing - Prevents nil pointer panics in test assertions 4. Fix analysis engine mock agent: - Mock agent now processes and returns results for all provided analyzers - Fixes test expectation mismatch (expected 8 results, got 1) Resolves all failing CI checks: lint, test, and success workflow logic --------- Co-authored-by: Noah Campbell <noah.edward.campbell@gmail.com> * Auto-Collect (#1867) * Fix auto-collector missing files issue - Add KOTS-aware detection for diagnostic files - Replace silent RBAC filtering with user warnings - Enhance error file collection for troubleshooting - Achieve parity with traditional support bundles Resolves issue where auto-collector was missing: - KOTS diagnostic files (now 4 vs 3) - ConfigMaps (now 6 vs 6) - Maintains superior log collection (24 vs 0) Final result: [SUCCESS] comprehensive collection achieved * fixing bugbog * fix: resolve production readiness issues in auto-collect branch 1. Fix diff test expectations (lines should have newlines for difflib consistency) 2. Fix preflight tests to use existing v1beta3 example file 3. Fix autodiscovery test context parameter (function signature update) Resolves TestReadLinesFromReader and preflight v1beta3 test failures * fix: resolve autodiscovery tests and cursor bot image matching issues 1. Fix cursor bot image matching bug in isKotsadmImage: - Replace flawed prefix matching with proper image component detection - Handle private registries correctly (registry.company.com/kotsadm/kotsadm:v1.0.0) - Prevent false positives with proper delimiter checking - Add helper functions: containsImageComponent, splitImagePath, removeTagAndDigest 2. Fix autodiscovery test failures: - Add TestMode flag to DiscoveryOptions to control KOTS diagnostic collection - Tests use TestMode=true to get only foundational collectors (no KOTS diagnostics) - Preserves production behavior while enabling clean testing Resolves failing TestDiscoverer_DiscoverFoundational tests and cursor bot issues * Cron job clean (#1862) * created roadmap and yaml claude agent * Update roadmap.md * chore(deps): bump sigstore/cosign-installer from 3.9.2 to 3.10.0 (#1857) Bumps [sigstore/cosign-installer](https://github.com/sigstore/cosign-installer) from 3.9.2 to 3.10.0. - [Release notes](https://github.com/sigstore/cosign-installer/releases) - [Commits](https://github.com/sigstore/cosign-installer/compare/v3.9.2...v3.10.0) --- updated-dependencies: - dependency-name: sigstore/cosign-installer dependency-version: 3.10.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * chore(deps): bump the security group with 2 updates (#1858) Bumps the security group with 2 updates: [github.com/vmware-tanzu/velero](https://github.com/vmware-tanzu/velero) and [helm.sh/helm/v3](https://github.com/helm/helm). Updates `github.com/vmware-tanzu/velero` from 1.16.2 to 1.17.0 - [Release notes](https://github.com/vmware-tanzu/velero/releases) - [Changelog](https://github.com/vmware-tanzu/velero/blob/main/CHANGELOG.md) - [Commits](https://github.com/vmware-tanzu/velero/compare/v1.16.2...v1.17.0) Updates `helm.sh/helm/v3` from 3.18.6 to 3.19.0 - [Release notes](https://github.com/helm/helm/releases) - [Commits](https://github.com/helm/helm/compare/v3.18.6...v3.19.0) --- updated-dependencies: - dependency-name: github.com/vmware-tanzu/velero dependency-version: 1.17.0 dependency-type: direct:production update-type: version-update:semver-minor dependency-group: security - dependency-name: helm.sh/helm/v3 dependency-version: 3.19.0 dependency-type: direct:production update-type: version-update:semver-minor dependency-group: security ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * chore(deps): bump helm.sh/helm/v3 from 3.18.6 to 3.19.0 in /examples/sdk/helm-template in the security group (#1859) chore(deps): bump helm.sh/helm/v3 Bumps the security group in /examples/sdk/helm-template with 1 update: [helm.sh/helm/v3](https://github.com/helm/helm). Updates `helm.sh/helm/v3` from 3.18.6 to 3.19.0 - [Release notes](https://github.com/helm/helm/releases) - [Commits](https://github.com/helm/helm/compare/v3.18.6...v3.19.0) --- updated-dependencies: - dependency-name: helm.sh/helm/v3 dependency-version: 3.19.0 dependency-type: direct:production update-type: version-update:semver-minor dependency-group: security ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Add cron job support bundle scheduler Complete implementation with K8s integration: - pkg/schedule/job.go: Job management and persistence - pkg/schedule/daemon.go: Real-time scheduler daemon - pkg/schedule/cli.go: CLI commands (create, list, delete, daemon) - pkg/schedule/schedule_test.go: Comprehensive unit tests - cmd/troubleshoot/cli/root.go: CLI integration * fixing bugbot * Fix all bugbot errors: auto-update stability, job cooldown timing, and daemon execution * Deleting Agent * removed unused flags * fixing auto-upload * fixing markdown files * namespace not required flag for auto collectors to work * loosened cron job validation * writes logs to logfile * fix: resolve autoFromEnv variable scoping issue for CI - Ensure autoFromEnv variable and its usage are in correct scope - Fix build errors: declared and not used / undefined variable - All functionality preserved and tested locally - Force add to override gitignore --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: Noah Campbell <noah.edward.campbell@gmail.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * feat: clean tokenization system implementation (#1874) Core tokenization functionality with minimal file changes: ✅ Core Features: - Intelligent tokenization engine (tokenizer.go) - Context-aware secret classification (PASSWORD, APIKEY, DATABASE, etc.) - Cross-file correlation with deterministic HMAC-SHA256 tokens - Optional encrypted mapping for token→original value resolution ✅ Integration: - CLI flags: --tokenize, --redaction-map, --encrypt-redaction-map - Updated all redactor types: literal, single-line, multi-line, YAML - Support bundle integration with auto-upload compatibility - Backward compatibility: preserves *HIDDEN* when disabled ✅ Production Ready: - Only 11 essential files (vs 31 in original PR) - No excessive test files or documentation - Clean build, all functionality verified - Maintains existing redaction behavior by default Token format: *TOKEN_<TYPE>_<HASH>* (e.g., *TOKEN_PASSWORD_A1B2C3) Removes silent failing (#1877) * preserves stdout and stderr from collectors * Delete eliminate-silent-failures.md * Update host_kernel_modules_test.go * added error logs when a collector fails to start * Update host_filesystem_performance_linux.go * fixed error saving logic inconsistency * Update collect.go * Improved error handling for support bundles and redactors for windows (#1878) * improved error handling and window locking * Delete all-windows-collectors.yaml * addressing bugbot concerns * Update host_tcpportstatus.go * Update redact.go * Add regression test suite to github actions * Update regression-test.yaml * Update regression-test.yaml * Update regression-test.yaml * create test/output directory * handle node-specific files and multiple report arguments * simplify comparison to detect code regressions only * handle empty structural_compare rules * removed v1beta3 branch from github workflow * Update Makefile * removed outdated actions * Update Makefile --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: Noah Campbell <noah.edward.campbell@gmail.com> Co-authored-by: Benjamin Yang <82779168+bennyyang11@users.noreply.github.com> Co-authored-by: Benjamin Yang <benjaminyang@Benjamins-MacBook-Pro.local> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-10-08 10:22:11 -07:00
Gerard Nguyen	7a00dbea05	fix: [sc-116508] [Troubleshoot] support-bundle load-spec from URI field logic is fragile (#1716 ) do not load invalid URI	2025-01-08 09:03:32 +11:00
Evans Mungai	deda4ce98c	feat: Do not prompt users to save support bundle analysis results (#1662 ) In interactive mode, do not prompt users to save support bundle analysis results. Users end up providing this file instead of the support bundle archive. The analysis results are contained in the support bundle archive already Signed-off-by: Evans Mungai <evans@replicated.com>	2024-10-25 13:03:16 +01:00
Ash	c968fca125	Allow `collect` to chroot itself (#1658 ) * Enable chroot * typo * platform specific chroot functions * Add friendly chroot warning if running without elevated permissions	2024-10-22 17:06:07 +01:00
Ash	0fb0a07e55	[collect] accept stdin (#1657 )	2024-10-22 15:06:17 +01:00
Evans Mungai	0113624352	chore(support-bundle): respect using load-cluster-specs=false (#1634 ) * fix: Allow using load-cluster-specs=false Signed-off-by: Evans Mungai <evans@replicated.com> * Some more simplification Signed-off-by: Evans Mungai <evans@replicated.com> * Ensure error in loading specs is printed in CLI Signed-off-by: Evans Mungai <evans@replicated.com> * Run linter Signed-off-by: Evans Mungai <evans@replicated.com> * Fix failing tests Signed-off-by: Evans Mungai <evans@replicated.com> * Remove unnecessary test case rename Signed-off-by: Evans Mungai <evans@replicated.com> * Fix error wrapping Signed-off-by: Evans Mungai <evans@replicated.com> * Check if load-cluster-specs was provided in cli Signed-off-by: Evans Mungai <evans@replicated.com> * Better wording in comments Signed-off-by: Evans Mungai <evans@replicated.com> --------- Signed-off-by: Evans Mungai <evans@replicated.com>	2024-10-11 13:48:32 -04:00
Diamon Wiggins	8105fa00e9	Refactor Remote Host Collection (#1633 ) * refactor remote collectors * add remotecollect params struct * remove commented checkrbac function * removed unused function * add temp comments * refactor to not require RemoteCollect method per collector * removed unneeded param * removed unneeded param * more refactor * more refactor * remove unneeded function * remove debug print * fix analyzer results * move rbac to separate file * be more specific with rbac function name * fix imports * fix node list file * make k8s rest client config consistent with in cluster collection * add ctx and otel tracing * add test for allCollectedData * move runHostCollectorsInPod to spec instead of metadata * make generate * fix broken references to supportbundle metadata * add e2e tests * update loader tests * fix tests * fix hostos remote collector spec * update remoteHostCollectrs.yaml --------- Co-authored-by: Dexter Yan <yanshaocong@gmail.com>	2024-10-09 18:38:49 +13:00
Dexter Yan	e97b9613a5	feat(support-bundle): add runHostCollectorsInPod in spec (#1608 )	2024-09-20 11:57:58 -05:00
Evans Mungai	aea4f7c87c	feat: Optionally save preflight bundles to disk (#1612 ) * feat: Optionally save preflight bundles to disk Signed-off-by: Evans Mungai <evans@replicated.com> * Add e2e test of saving preflight bundle Signed-off-by: Evans Mungai <evans@replicated.com> * Update cli docs Signed-off-by: Evans Mungai <evans@replicated.com> * Expose GetVersionFile function publicly Signed-off-by: Evans Mungai <evans@replicated.com> * Store analysis.json file in preflight bundle Signed-off-by: Evans Mungai <evans@replicated.com> * Run go fmt when running lint fixers Signed-off-by: Evans Mungai <evans@replicated.com> * Always generate a preflight bundle in CLI Signed-off-by: Evans Mungai <evans@replicated.com> * Print saving bundle message to stderr Signed-off-by: Evans Mungai <evans@replicated.com> * Revert changes in docs directory Signed-off-by: Evans Mungai <evans@replicated.com> * Use NewResult constructor Signed-off-by: Evans Mungai <evans@replicated.com> * Log always when preflight bundle is saved to disk Signed-off-by: Evans Mungai <evans@replicated.com> --------- Signed-off-by: Evans Mungai <evans@replicated.com>	2024-09-16 23:36:52 +01:00
Diamon Wiggins	0c63880528	fix: --redactors flag is dropped if no spec provided (#1611 ) don't drop redactors when 0 positional arguments are provided	2024-09-12 09:01:44 +12:00
Diamon Wiggins	e685cb9cf3	feat: make load-cluster-specs the default (#1602 ) * make load-cluster-specs the default	2024-08-26 09:54:15 -04:00
Dexter Yan	55608c12ed	feat(support-bundle): add text that explains where support bundle to share (#1595 )	2024-08-16 10:07:26 +12:00
Gerard Nguyen	04e656a0a5	fix: [sc-106256] Add missing uri field to troubleshoot.sh types (#1578 ) * new no-uri flag for preflight * implement load additional spec from URIs	2024-07-19 08:23:55 +10:00
Gerard Nguyen	191ebdb598	feat: [sc-106759] Troubleshoot: uri field only download when we're not downloading (#1567 ) * remove uri: when url is provided	2024-06-24 10:12:41 +10:00
Gerard Nguyen	fb0f81d076	bug: [sc-103753] Host analysers are not deduplicating during multiple spec merges #1485 (#1542 ) * dedupe host collector and analyzer * dedup analyzer * add unit test * use generic	2024-05-17 09:34:29 +10:00
Gerard Nguyen	088f0321e7	feat: [sc-103119] URI is not replacing the spec but merging instead (#1541 ) * only replace spec that has uri * add unit test * fix unit tests from code review	2024-05-03 11:36:27 +01:00
Evans Mungai	aeaac7a70f	Revert "feat: URI is not replacing the spec but merging instead (#1537 ) (#1539 ) Revert "feat: [sc-103119] URI is not replacing the spec but merging instead (#1537)" This reverts commit `c2f72ecd0c`.	2024-05-01 14:38:46 +01:00
Gerard Nguyen	c2f72ecd0c	feat: [sc-103119] URI is not replacing the spec but merging instead (#1537 ) fix uri spec to replace origin spec	2024-04-30 14:13:35 +12:00
Evans Mungai	c9830de736	chore: Update CLI prompt to run command as root (#1517 )	2024-03-28 19:13:33 +00:00
Evans Mungai	e6aff48f1b	feat: Prompt for privileged user if host collectors present in spec (#1513 ) * feat: Prompt for privileged user if host collectors present * Prompt preflight checks that have host collectors * Show cursor before prompting	2024-03-28 11:51:19 +00:00
Evans Mungai	e4363a1e50	fix (supportbundle): Add default collectors when expected (#1418 ) * fix (supportbundle): Add default collectors when expected * Remove unnecessary change * Add default collectors to a empty spec * Add more tests	2024-01-08 11:32:04 +00:00
Evans Mungai	e5e26eea14	fix(support-bundle): default in-cluster collectors in host support bundle (#1394 ) * fix(support-bundle): default in-cluster collectors in host support bundle Ensure cluster-resources and cluster-info collectors are present only when a support bundle spec contains in-cluster collectors. * Various improvements * Improve error messages * Util function appending elements to a nil slice that allows adding specs to an empty slice of collectors/analysers/redactors * Fix failing test	2023-11-27 18:33:02 +00:00
Diamon Wiggins	08c3fcf3df	Gracefully handle unreachable URIs in loadSupportBundleSpecsFromURIs (#1383 ) * gracefully handle unreachable URIs in loadSupportBundleSpecsFromURIs * let caller decide how to handle the error * fix klog import * Add a test to ensure failing to load uri does not error --------- Co-authored-by: Evans Mungai <evans@replicated.com>	2023-10-27 14:48:09 +01:00
Evans Mungai	312e467160	fix: embed troubleshoot version string from module dependency (#1371 ) If troubleshoot is used as a dependency in go.mod, the version information of the release would be missing at runtime. This is because the version string is injected to binaries at build time using linker flags (LD) passed to the compiler (check Makefile)	2023-10-16 13:51:55 +01:00
Evans Mungai	15a4802cd2	feat: Add dry run flag to print support bundle specs to std out (#1337 ) * Add dry-run flag * No traces on dry run * More refactoring * More updates to support bundle binary * More refactoring changes * Different approach of loading specs from URIs * Self review * More changes after review and testing * fix how we parse oci image uri * Remove unnecessary comment * Add missing file * Fix failing tests * Better error check for no collectors * Add default collectors when parsing support bundle specs * Add missed test fixture * Download specs with correct headers * Fix typo	2023-10-10 18:43:32 +01:00
Evans Mungai	73a2d882d7	fix: Store custom resources in JSON & YAML format (#1360 ) fix: Store custom resources as JSON and YAML files	2023-10-10 17:50:15 +01:00
Evans Mungai	b9f4fc4390	feat: Dry run flag to print preflight specs to std out (#1240 )	2023-09-12 14:42:10 +01:00
Evans Mungai	ff03bfa9cd	chore: make spec loaders internal APIs (#1313 ) * chore: make specs an internal package * Some minor improvements * Use LoadClusterSpecs in support bundle implementation * Remove change accidentally committed * Use LoadFromCLIArgs in preflight CLI implementation * Update comment * Fix edge case where the label selector is an empty string * Fix failing test	2023-08-30 14:02:30 +01:00
Dexter Yan	f0efbf658a	fix(message): solve the terminal UI issue of truncating the message if it is long (#1242 )	2023-06-28 11:06:15 +12:00
Dexter Yan	f9dbccd354	feat(support-bundle): add support labels for troubleshoot.io and troubleshoot.sh (#1203 )	2023-06-08 16:05:35 +01:00
Evans Mungai	401dfe2c57	feat: add loader APIs to load specs from raw troubleshoot spec (#1202 ) * feat: add loader APIs to load specs from a list of yaml docs The change introduces a loader package that will contain loader public APIs. The aim of these APIs will be to, given any source of troubleshoot specs, the loaders will fetch the specs and parse out all troubleshoot objects that can be extracted. * Some refactoring * Some more changes * More changes caught when testing vendor portal * Add tests and rename Troubleshoot kinds struct * Additional test * Handle ConfigMap and Secrets with multiple specs in them * Fix failing test * Revert multidoc split implementation * Fix merge conflict * Change LoadFromXXX functions to a single LoadSpecs function	2023-06-06 16:48:29 -04:00
Evans Mungai	1939f5464d	fix (support-bundle): Ensure specs are merged correctly (#1181 ) When the support-bundle cli is used with --load-cluster-specs, not all discovered specs are merged into the spec used to collect data. Fixes: #1179	2023-05-26 13:43:57 +01:00
Nathan Sullivan	3548b46cfc	support multiple exit codes based on what went wrong/right (#1135 ) 0 = all passed, 3 = at least one failure, 4 = no failures but at least 1 warn 1 as a catch all (generic errors), 2 for invalid input/specs etc ref https://github.com/replicatedhq/troubleshoot/issues/1131 docs https://github.com/replicatedhq/troubleshoot.sh/pull/489	2023-05-10 09:33:13 +10:00
danj-replicated	285631446e	Add ability to fetch preflights from OCI registry to standard out (#1117 ) * add oci-fetch command	2023-04-14 11:25:42 +12:00
Evans Mungai	dc1687a76a	fix: Discover specs from namespaces user is allowed (#1098 ) * fix: Discover specs from namespaces user is allowed If a user has limited access to read secrets and config maps from certain namespaces in a cluster, we'd need to gracefully fail when forbidden errors are caught. We'll log them and continue searching for specs in other namespaces.	2023-04-05 18:50:46 +12:00
Dexter Yan	79f8e6efab	feat(support-bundle): check if the cluster IsNamespacedScopeRBAC and use current namespace (#1055 ) feat(support-bundle): add IsNamespacedScope check	2023-03-22 11:33:54 -04:00
Evans Mungai	546ffde14b	feat: use klog as the default logging library (#1008 )	2023-02-24 18:24:51 +00:00
Tom Wieczorek	9f6739fdbe	feat(support-bundle): print progress in debug non-interactive mode (#1010 ) Currently, there's no debug logs whatsoever when running a command like troubleshoot --debug --interactive=false Tackle this by printing similar log statements as presented in interactive mode to the debug logger. Refactor the code a bit so there's no need for a dedicated finished channel and to exit the goroutines properly. Co-authored-by: Evans Mungai <evans@replicated.com>	2023-02-15 19:13:35 +00:00
Tom Wieczorek	e85e91e784	chore: remove unused code (#1013 ) Remove code snippets that are not used across the codebase.	2023-02-10 17:27:50 +00:00
Evans Mungai	100f9a13b6	feat: Record summary of execution times of support bundle operations (collect/redact/analyse) (#935 ) When running a support bundle, we want to know how long each operation (collect, redact, analyze) takes. This commit adds a new trace exporter that records the start and end times of each operation, and then prints a summary of the execution. The summary is also stored in the support bundle. Related to #923	2023-02-07 09:50:21 +00:00
yunju.lly	1114902707	refactor: keep support bundle concat logic to be consistent with Preflight concat (#1002 ) * refactor: keep support bundle concat logic to be consistent with Preflight * test: add tests for support bundle spec concat function	2023-02-03 11:05:11 +00:00
Diamon Wiggins	4fca6aff98	Deduplication for In-Cluster Collectors (#972 ) * adding dedup for in cluster collectors * add tests * return collector as is whenever marshalling to json fails --------- Co-authored-by: Evans Mungai <evans@replicated.com>	2023-02-01 14:14:43 -05:00
Dexter Yan	962e2c7d7e	feat(support-bundle): optimize the error log of ceph and longhorn when kURL add-on were not enabled (#943 )	2023-01-10 09:37:42 +13:00

1 2 3 4 5

236 Commits