mirror of https://github.com/replicatedhq/troubleshoot.git synced 2026-02-14 18:29:53 +00:00

Files

Benjamin Yang a9d2180dd6 102 redactor newline corruption clean (#1947 )

* fix: prevent redactors from corrupting binary files (#102)

Redactors were adding newlines to files without them, corrupting binary
files during support bundle collection (51 bytes → 53 bytes).

Created LineReader to track original newline state and only restore
newlines when they were present in the original file.

- Added pkg/redact/line_reader.go
- Refactored single_line.go, multi_line.go, literal.go
- Added 48 tests, all passing
- Verified: binary files now preserved byte-for-byte

Fixes #102


* fix: handle empty lines correctly in MultiLineRedactor

- Check line1 == nil instead of len(line1) == 0 for empty file detection
- Fixes edge case where file containing only '\n' would be dropped
- Addresses bugbot finding about empty line handling


* fix: handle empty lines correctly in MultiLineRedactor

- Check line1 != nil instead of len(line1) > 0 in both locations
- Fixes edge case where empty trailing lines would be dropped
- Fix test isolation in literal_test.go (move ResetRedactionList to parent)
- Addresses bugbot findings about empty line handling

* fmt

* chore: update regression baselines from run 20107431959

* adding defense

* fix: propagate non-EOF errors in all early return paths

Ensure non-EOF errors (like buffer overflow) are properly propagated
to caller in both pre-loop early returns. Addresses bugbot finding.

* fix: use unique test names to prevent redaction list pollution

Use t.Name() instead of hardcoded 'test' to ensure each test
has unique redactor name, preventing parallel test interference

---------

Co-authored-by: hedge-sparrow <sparrow@spooky.academy>

2025-12-10 16:55:54 -06:00

baselines

102 redactor newline corruption clean (#1947 )

2025-12-10 16:55:54 -06:00

e2e

fix(collect): cluster resource errors json file has wrong name (#1936 )

2025-11-28 10:17:03 +13:00

.gitignore

V1beta3 (#1873 )

2025-10-08 10:22:11 -07:00

README.md

V1beta3 (#1873 )

2025-10-08 10:22:11 -07:00

validate-preflight-e2e.sh

feat: Optionally save preflight bundles to disk (#1612 )

2024-09-16 23:36:52 +01:00

validate-support-bundle-e2e.sh

fix: --redactors flag is dropped if no spec provided (#1611 )

2024-09-12 09:01:44 +12:00

README.md

Regression Test Suite

This directory contains the regression test infrastructure for validating preflight and support bundle collectors.

Overview

The regression test suite:

Provisions an ephemeral k3s cluster via Replicated Actions
Runs multiple preflight and support bundle specs
Compares output bundles against known-good baselines
Reports regressions (missing files, changed outputs)

Directory Structure

test/
├── README.md                    # This file
├── baselines/                   # Known-good baseline bundles
│   ├── preflight-v1beta3/
│   │   └── baseline.tar.gz
│   ├── preflight-v1beta2/
│   │   └── baseline.tar.gz
│   ├── supportbundle/
│   │   └── baseline.tar.gz
│   └── metadata.json            # Baseline metadata (git sha, date, k8s version)
└── output/                      # Test run outputs (gitignored)
    ├── preflight-v1beta3-bundle.tar.gz
    ├── preflight-v1beta2-bundle.tar.gz
    ├── supportbundle.tar.gz
    └── diff-report-*.json

Specs Under Test

Spec	File	Values	Description
Preflight v1beta3	`examples/preflight/complex-v1beta3.yaml`	`examples/preflight/values-complex-full.yaml`	Templated v1beta3 with ~30 analyzers
Preflight v1beta2	`examples/preflight/all-analyzers-v1beta2.yaml`	N/A	Legacy v1beta2 format with all analyzer types
Support Bundle	`examples/collect/host/all-kubernetes-collectors.yaml`	N/A	Comprehensive collector suite

Running Tests

Via GitHub Actions (Recommended)

The regression test workflow runs automatically on:

Push to main or v1beta3 branches
Pull requests
Manual trigger via workflow_dispatch

Manual trigger:

gh workflow run regression-test.yaml

Locally (Manual)

# 1. Build binaries
make bin/preflight bin/support-bundle

# 2. Create k3s cluster (use your preferred method)
k3d cluster create test-cluster --wait

# 3. Run specs
./bin/preflight examples/preflight/complex-v1beta3.yaml \
  --values examples/preflight/values-complex-full.yaml \
  --interactive=false

./bin/preflight examples/preflight/all-analyzers-v1beta2.yaml \
  --interactive=false

./bin/support-bundle examples/collect/host/all-kubernetes-collectors.yaml \
  --interactive=false

# 4. Compare bundles (if baselines exist)
python3 scripts/compare_bundles.py \
  --baseline test/baselines/preflight-v1beta3/baseline.tar.gz \
  --current preflightbundle-*.tar.gz \
  --rules scripts/compare_rules.yaml \
  --report test/output/diff-report.json \
  --spec-type preflight

# 5. Clean up
k3d cluster delete test-cluster

Creating Initial Baselines

If baselines don't exist yet (first time setup):

Run workflow to generate bundles:
```
gh workflow run regression-test.yaml
```

Download artifacts:

gh run download <run-id> --name regression-test-results-<run-id>-1

Inspect bundles manually:

tar -tzf preflight-v1beta3-bundle.tar.gz | head -20
tar -xzf preflight-v1beta3-bundle.tar.gz
# Verify contents look correct

Copy as baselines and commit:

mkdir -p test/baselines/{preflight-v1beta3,preflight-v1beta2,supportbundle}

cp preflight-v1beta3-bundle.tar.gz test/baselines/preflight-v1beta3/baseline.tar.gz
cp preflight-v1beta2-bundle.tar.gz test/baselines/preflight-v1beta2/baseline.tar.gz
cp supportbundle.tar.gz test/baselines/supportbundle/baseline.tar.gz

git add test/baselines/
git commit -m "chore: add initial regression test baselines"
git push

Updating Baselines

When legitimate changes occur (new collectors, changed output format):

Option 1: Automatic Update (Workflow Input)

gh workflow run regression-test.yaml -f update_baselines=true

This will:

Run tests
Copy new bundles as baselines
Commit and push updated baselines

** Use with caution!** Only use this after verifying changes are intentional.

Option 2: Manual Update

# Download artifacts from a successful run
gh run download <run-id> --name regression-test-results-<run-id>-1

# Replace baselines
cp preflight-v1beta3-bundle.tar.gz test/baselines/preflight-v1beta3/baseline.tar.gz
cp preflight-v1beta2-bundle.tar.gz test/baselines/preflight-v1beta2/baseline.tar.gz
cp supportbundle.tar.gz test/baselines/supportbundle/baseline.tar.gz

# Commit
git add test/baselines/
git commit -m "chore: update regression baselines - reason for change"
git push

Comparison Strategy

The comparison uses a 3-tier approach:

1. Exact Match (2 files)

Files compared byte-for-byte:

static-data.txt/static-data - static data collector
version.yaml - spec version
Data collector files (files/example.yaml, config/replicas.txt)

2. Structural Comparison (8 files)

Compare specific fields only, ignore variable values:

Database collectors (postgres/*.json, mysql/*.json, etc.) - Compare isConnected boolean
DNS (dns/debug.json) - Verify service exists, queries succeed
Registry (registry/*.json) - Compare exists per image
HTTP (http*.json) - Compare status code only

3. Non-Empty Check (Everything Else)

For highly variable outputs:

cluster-resources - UIDs, timestamps, resourceVersions vary
node-metrics - All metric values constantly change
logs - Timestamps in every line
run/exec collectors - Random pod names, variable output
And more...

Strategy: Verify file exists, is non-empty, and (for JSON) is valid JSON.

Understanding Test Results

Passing Test

All expected files present
Exact match files identical
Structural comparison fields match
All files non-empty and valid

Failing Test - Regressions Detected

Files missing:

⚠ Missing in current: postgres/postgres-example.json

→ Collector stopped producing output (regression)

Structural mismatch:

❌ postgres/postgres-example.json: database connection status changed: true -> false

→ Collector behavior changed (potential regression)

Empty file:

❌ dns/debug.json: File is empty

→ Collector failed to collect data (regression)

ℹ️ New Files (Not a Failure)

ℹ New file in current: newcollector/output.json

→ New collector added (expected when adding features)

Troubleshooting

Workflow fails: "No baseline found"

First time setup - baselines need to be created (see above).

Many "structural mismatch" failures

Check if cluster state changed:

Different k8s version?
Different installed components?
Resources created/deleted?

Comparison fails with Python error

Ensure dependencies installed:

pip install pyyaml deepdiff

Cluster creation times out

Check Replicated Actions limits:

# View cluster status
gh api /repos/replicatedhq/compatibility-actions/...

Configuration Files

`scripts/compare_rules.yaml`

Defines comparison strategy per file pattern.

Add new rule:

preflight:
  structural_compare:
    "mycollector/*.json": "my_comparator_function"

Then implement _compare_my_comparator_function() in scripts/compare_bundles.py.

`scripts/compare_bundles.py`

Comparison engine - implements comparison logic.

Add new comparator:

def _compare_my_comparator_function(self, baseline: Dict, current: Dict) -> bool:
    """Compare mycollector output."""
    # Your comparison logic
    return baseline["field"] == current["field"]

`.github/workflows/regression-test.yaml`

GitHub Actions workflow definition.

Tips

Start simple: Begin with baselines for v1beta2 only, add v1beta3 later
Iterate on rules: Add structural comparisons as you discover false positives
Review diffs: Always inspect diff reports before updating baselines
Document changes: In baseline update commits, explain why output changed
Monitor runtime: Workflow should complete in < 20 minutes

README.md Unescape Escape

Regression Test Suite

Overview

Directory Structure

Specs Under Test

Running Tests

Via GitHub Actions (Recommended)

Locally (Manual)

Creating Initial Baselines

Updating Baselines

Option 1: Automatic Update (Workflow Input)

Option 2: Manual Update

Comparison Strategy

1. Exact Match (2 files)

2. Structural Comparison (8 files)

3. Non-Empty Check (Everything Else)

Understanding Test Results

Passing Test

Failing Test - Regressions Detected

ℹ️ New Files (Not a Failure)

Troubleshooting

Workflow fails: "No baseline found"

Many "structural mismatch" failures

Comparison fails with Python error

Cluster creation times out

Configuration Files

scripts/compare_rules.yaml

scripts/compare_bundles.py

.github/workflows/regression-test.yaml

Tips

Related Documentation

README.md

`scripts/compare_rules.yaml`

`scripts/compare_bundles.py`

`.github/workflows/regression-test.yaml`