Yashasvi Yadav
62f500fb2e
feat: add GCP zone outage rollback support ( #1200 )
...
Add rollback functionality for GCP zone outage scenarios following the
established rollback pattern (Service Hijacking, PVC, Syn Flood).
- Add @set_rollback_context_decorator to run()
- Set rollback callable before stopping nodes with base64/JSON encoded data
- Add rollback_gcp_zone_outage() static method with per-node error handling
- Fix missing poll_interval argument in starmap calls
- Add unit tests for rollback and run methods
Closes #915
Signed-off-by: YASHASVIYADAV30 <yashasviydv30@gmail.com >
Co-authored-by: Paige Patton <64206430+paigerube14@users.noreply.github.com >
2026-03-26 14:42:45 -04:00
Arpit Raj
ec241d35d6
fix: improve logging reliability and code quality ( #1199 )
...
- Fix typo 'wating' -> 'waiting' in scenario wait log message
- Replace print() with logging.debug() for pod metrics in prometheus client
- Replace star import with explicit imports in utils/__init__.py
- Remove unnecessary global declaration in main()
- Log VM status exceptions at ERROR level with exception details
Include unit tests in tests/test_logging_and_code_quality.py covering all fixes.
Signed-off-by: 1PoPTRoN <vrxn.arp1traj@gmail.com >
Co-authored-by: Paige Patton <64206430+paigerube14@users.noreply.github.com >
2026-03-26 13:08:56 -04:00
Arpit Raj
59e10d5a99
fix: bind exception variable in except handlers to prevent NameError ( #1198 )
...
Signed-off-by: 1PoPTRoN <vrxn.arp1traj@gmail.com >
Co-authored-by: Paige Patton <64206430+paigerube14@users.noreply.github.com >
2026-03-26 09:43:37 -04:00
Paige Patton
c8aa959df2
controller -> detailed ( #1201 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-03-26 08:47:06 -04:00
Paige Patton
3db5e1abbe
no rebuild image ( #1197 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
v5.0.2
2026-03-20 12:54:45 -04:00
Paige Patton
1e699c6cc9
different quay users ( #1196 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-03-20 17:30:42 +01:00
Paige Patton
0ebda3e101
test multi platform ( #1194 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-03-20 11:09:33 -04:00
Tullio Sebastiani
8a5be0dd2f
Resiliency Score krknctl compatibility fixes ( #1195 )
...
* added console log of the resiliency score when mode is "detailed"
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* base image krknctl input
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
resiliency score flag
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* removed json print in run_krkn.py
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* unit test fix
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
---------
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
2026-03-20 11:09:07 -04:00
Tullio Sebastiani
62dadfe25c
Resiliency Score krknctl compatibility fixes ( #1195 )
...
* added console log of the resiliency score when mode is "detailed"
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* base image krknctl input
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
resiliency score flag
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* removed json print in run_krkn.py
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* unit test fix
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
---------
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
2026-03-20 11:08:56 -04:00
Paige Patton
cb368a2f5c
adding tests coverage for resiliency scoring ( #1161 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
v5.0.1
2026-03-19 14:16:51 -04:00
Paige Patton
bb636cd3a9
Custom weight to resiliency ( #1173 )
...
* feat(resiliency): implement comprehensive resiliency scoring system
- Added resiliency scoring engine
- Implemented scenario-wise scoring with telemetry
- Added configurable SLOs and detailed reporting
Signed-off-by: Abhinav Sharma <abhinavs1920bpl@gmail.com >
Signed-off-by: Paige Patton <prubenda@redhat.com >
* fix: check prometheus url after openshift prometheus check
Signed-off-by: Abhinav Sharma <abhinavs1920bpl@gmail.com >
Signed-off-by: Paige Patton <prubenda@redhat.com >
* custom weight
Signed-off-by: Paige Patton <prubenda@redhat.com >
---------
Signed-off-by: Abhinav Sharma <abhinavs1920bpl@gmail.com >
Signed-off-by: Paige Patton <prubenda@redhat.com >
Co-authored-by: Abhinav Sharma <abhinavs1920bpl@gmail.com >
2026-03-19 13:14:08 -04:00
Arpit Raj
f241b2b62f
fix: prevent script injection in require-docs workflow ( #1187 )
...
- replace shell interpolation of PR body with jq + $GITHUB_EVENT_PATH
- replace shell interpolation of branch name with actions/github-script
- remove unused actions/checkout step
- add 27 unit tests covering checkbox detection, docs PR search, and
security regression checks to prevent re-introduction of the bug
Signed-off-by: Arpit Raj <vrxn.arp1traj@gmail.com >
Co-authored-by: Paige Patton <64206430+paigerube14@users.noreply.github.com >
2026-03-17 09:37:35 -04:00
Paige Patton
2a60a519cd
adding run tag ( #1179 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-03-16 16:18:50 -04:00
Naga Ravi Chaitanya Elluri
31756e6d9b
Add Beta features governance policy ( #1185 )
...
Introduce documentation defining Beta feature expectations, lifecycle,
user guidance, and promotion criteria to GA. This helps users understand
that Beta features are experimental and intended for early feedback.
Signed-off-by: Naga Ravi Chaitanya Elluri <nelluri@redhat.com >
2026-03-12 23:39:14 -04:00
Paige Patton
8c9bce6987
sed change ( #1186 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-03-11 12:34:12 -04:00
Arpit Raj
5608482f1b
fix: use sorted() instead of .sort() for key validation ( #1182 ) ( #1184 )
...
Signed-off-by: Arpit Raj <vrxn.arp1traj@gmail.com >
2026-03-10 10:58:12 -04:00
Darshan Jain
a14d3955a6
feat(ci): add pytest-based CI test framework v2 with ephemeral namespace isolation ( #1172 ) ( #1171 )
...
* feat: add pytest-based CI test framework v2 with ephemeral namespace isolation
Signed-off-by: ddjain <darjain@redhat.com >
* feat(ci): add tests_v2 pytest functional test framework
Signed-off-by: ddjain <darjain@redhat.com >
Co-authored-by: Cursor <cursoragent@cursor.com >
* feat: improve naming convention
Signed-off-by: ddjain <darjain@redhat.com >
* improve local setup script.
Signed-off-by: ddjain <darjain@redhat.com >
* added CI job for v2 test
Signed-off-by: ddjain <darjain@redhat.com >
* disabled broken test
Signed-off-by: ddjain <darjain@redhat.com >
* improved CI pipeline execution time
Signed-off-by: ddjain <darjain@redhat.com >
* chore: remove unwanted/generated files from PR
Signed-off-by: ddjain <darjain@redhat.com >
* clean up gitignore file
Signed-off-by: ddjain <darjain@redhat.com >
* fix copilot comments
Signed-off-by: ddjain <darjain@redhat.com >
* fixed copilot suggestion
Signed-off-by: ddjain <darjain@redhat.com >
* uncommented out test upload stage
Signed-off-by: ddjain <darjain@redhat.com >
* exclude CI/tests_v2 from test coverage reporting
Signed-off-by: ddjain <darjain@redhat.com >
* uploading style.css to fix broken report artifacts
Signed-off-by: ddjain <darjain@redhat.com >
* added openshift supported labels in namespace creatation api
Signed-off-by: ddjain <darjain@redhat.com >
---------
Signed-off-by: ddjain <darjain@redhat.com >
Co-authored-by: Cursor <cursoragent@cursor.com >
2026-03-06 08:44:07 -05:00
Arpit Raj
f655ec1a73
fix: accumulate failed scenarios across all scenario types instead of overwriting ( #1178 )
...
Signed-off-by: Arpit Raj <vrxn.arp1traj@gmail.com >
2026-03-05 14:06:56 -05:00
Paige Patton
dfc350ac03
adding set run tag ( #1174 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-02-27 15:05:05 -05:00
Paige Patton
c474b810b2
updating to use krkn-lib virt functions ( #989 )
...
Assisted By: Claude Code
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-02-27 14:45:31 -05:00
Paige Patton
072e8d0e87
changing pod ( #1175 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-02-27 14:40:49 -05:00
Nesar Kavri
aee61061ac
Fix: make entrypoint fail fast if setup-ssh.sh fails ( #1170 )
...
Signed-off-by: Nesar976 <kavrinesar@gmail.com >
2026-02-27 14:18:01 -05:00
Paige Patton
544cac8bbb
merge ( #710 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-02-27 14:10:08 -05:00
SurbhiAgarwal
49b1affdb8
Improve error message clarity for setuptools version requirement ( #1162 )
...
Fixes #1143 - Updated error message to clearly state that version 38.3 or newer is required
Signed-off-by: Surbhi <agarwalsurbhi1807@gmail.com >
Co-authored-by: Paige Patton <64206430+paigerube14@users.noreply.github.com >
2026-02-24 10:59:22 -05:00
Darshan Jain
c1dd43fe87
DevConf Pune 2026 feedback ( #1169 )
...
Signed-off-by: ddjain <darjain@redhat.com >
2026-02-23 19:54:06 +05:30
Ashish Mahajan
8dad2a3996
fix: use per-URL status_code in HealthChecker telemetry ( #1091 )
...
Signed-off-by: AR21SM <mahajanashishar21sm@gmail.com >
Co-authored-by: Paige Patton <64206430+paigerube14@users.noreply.github.com >
2026-02-19 09:25:03 -05:00
Tullio Sebastiani
cebc60f5a8
Network chaos NG porting - pod network chaos node network chaos ( #991 )
...
* fix ibm
Signed-off-by: Paige Patton <prubenda@redhat.com >
* type hint fix
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* pod network chaos plugin structure + utils method refactoring
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* Pod network chaos plugin
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* Node network chaos plugin
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* default config files
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* config.yaml
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* all field optional
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* minor fixes
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* minor nit on config
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* utils unit tests
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* PodNetworkChaos unit tests
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* NodeNetworkChaos unit test
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* PodNetworkChaos functional test
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* NodeNetworkChaso functional test
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* added funtests to the gh action
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* unit test fix
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* changed test order + resource rename
* functional tests fix
smallchange
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
fix requirements
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* changed pod test target
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
* added kind port mapping and removed portforwarding
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
fix
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
test fixes
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
test fixes
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
---------
Signed-off-by: Paige Patton <prubenda@redhat.com >
Signed-off-by: Tullio Sebastiani <tsebasti@redhat.com >
Co-authored-by: Paige Patton <prubenda@redhat.com >
2026-02-18 16:20:16 +01:00
Darshan Jain
2065443622
collect ERROR and CRITICAL logs and send to elastic search ( #1147 ) ( #1150 )
...
* collect ERROR and CRITICAL logs and send to elastic search
Signed-off-by: ddjain <darjain@redhat.com >
* bump up krkn-lib to 6.0.3
Signed-off-by: ddjain <darjain@redhat.com >
---------
Signed-off-by: ddjain <darjain@redhat.com >
2026-02-18 18:26:14 +05:30
Ashish Mahajan
b6ef7fa052
fix: use list comprehension to avoid skipping nodes during exclusion ( #1059 )
...
Fixes #1058
Signed-off-by: AR21SM <mahajanashishar21sm@gmail.com >
Co-authored-by: Paige Patton <64206430+paigerube14@users.noreply.github.com >
2026-02-17 15:20:10 -05:00
Paige Patton
4f305e78aa
remove chaos ai
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-02-11 13:44:13 -05:00
dependabot[bot]
b17e933134
Bump pillow from 10.3.0 to 12.1.1 in /utils/chaos_ai ( #1157 )
...
Bumps [pillow](https://github.com/python-pillow/Pillow ) from 10.3.0 to 12.1.1.
- [Release notes](https://github.com/python-pillow/Pillow/releases )
- [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst )
- [Commits](https://github.com/python-pillow/Pillow/compare/10.3.0...12.1.1 )
---
updated-dependencies:
- dependency-name: pillow
dependency-version: 12.1.1
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-11 10:08:42 -05:00
Paige Patton
beea484597
adding vm ware tests ( #1133 )
...
Signed-off-by: Paige Patton <paigepatton@Paiges-MacBook-Air.local >
Signed-off-by: Paige Patton <prubenda@redhat.com >
Co-authored-by: Paige Patton <paigepatton@Paiges-MacBook-Air.local >
2026-02-10 16:24:26 -05:00
Paige Patton
0222b0f161
fix ibm ( #1155 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-02-10 10:09:28 -05:00
Ashish Mahajan
f7e674d5ad
docs: fix typos in logs, comments, and documentation ( #1079 )
...
Signed-off-by: AR21SM <mahajanashishar21sm@gmail.com >
2026-02-09 09:48:51 -05:00
Ashish Mahajan
7aea12ce6c
fix(VirtChecker): handle empty VMI interfaces list ( #1072 )
...
Signed-off-by: AR21SM <mahajanashishar21sm@gmail.com >
Co-authored-by: Paige Patton <64206430+paigerube14@users.noreply.github.com >
2026-02-09 08:29:48 -05:00
Darshan Jain
625e1e90cf
feat: add color-coded console logging ( #1122 ) ( #1146 )
...
Functional & Unit Tests / Functional & Unit Tests (push) Failing after 2m16s
Functional & Unit Tests / Generate Coverage Badge (push) Has been skipped
Manage Stale Issues and Pull Requests / Mark and Close Stale Issues and PRs (push) Successful in 24s
Signed-off-by: ddjain <darjain@redhat.com >
2026-02-05 14:27:52 +05:30
dependabot[bot]
a9f1ce8f1b
Bump pillow from 10.2.0 to 10.3.0 in /utils/chaos_ai ( #1149 )
...
Functional & Unit Tests / Functional & Unit Tests (push) Failing after 34m28s
Functional & Unit Tests / Generate Coverage Badge (push) Has been skipped
Manage Stale Issues and Pull Requests / Mark and Close Stale Issues and PRs (push) Successful in 5s
Bumps [pillow](https://github.com/python-pillow/Pillow ) from 10.2.0 to 10.3.0.
- [Release notes](https://github.com/python-pillow/Pillow/releases )
- [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst )
- [Commits](https://github.com/python-pillow/Pillow/compare/10.2.0...10.3.0 )
---
updated-dependencies:
- dependency-name: pillow
dependency-version: 10.3.0
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-02 13:47:47 -05:00
Paige Patton
66e364e293
wheel updates ( #1148 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-02-02 13:46:22 -05:00
Paige Patton
898ce76648
adding python3.11 updates ( #1012 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-02-02 12:00:33 -05:00
Chaudary Farhan Saleem
4a0f4e7cab
fix: correct spelling typos in log messages ( #1145 )
...
- Fix 'wating' - 'waiting' (2 occurrences)
- Fix 'successfuly' - 'successfully' (12 occurrences)
- Fix 'orginal' - 'original' (1 occurrence)
Improves professionalism of log output and code comments.
Signed-off-by: farhann_saleem <chaudaryfarhann@gmail.com >
2026-02-02 09:23:44 -05:00
Darshan Jain
819191866d
Add CLAUDE.md for AI-assisted development ( #1141 )
...
Functional & Unit Tests / Functional & Unit Tests (push) Failing after 1m38s
Functional & Unit Tests / Generate Coverage Badge (push) Has been skipped
Manage Stale Issues and Pull Requests / Mark and Close Stale Issues and PRs (push) Successful in 6s
Signed-off-by: ddjain <darjain@redhat.com >
2026-01-31 23:41:49 +05:30
Paige Patton
37ca4bbce7
removing unneeded requirement ( #1066 )
...
Functional & Unit Tests / Functional & Unit Tests (push) Failing after 2m50s
Functional & Unit Tests / Generate Coverage Badge (push) Has been skipped
Manage Stale Issues and Pull Requests / Mark and Close Stale Issues and PRs (push) Successful in 4s
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-01-20 13:33:28 -05:00
Ashish Mahajan
b9dd4e40d3
fix(hogs): correct off-by-one error in random node selection ( #1112 )
...
Signed-off-by: AR21SM <mahajanashishar21sm@gmail.com >
2026-01-20 11:00:50 -05:00
AR21SM
3fd249bb88
Add stale PR management to workflow
...
Functional & Unit Tests / Functional & Unit Tests (push) Failing after 2m11s
Functional & Unit Tests / Generate Coverage Badge (push) Has been skipped
Manage Stale Issues and Pull Requests / Mark and Close Stale Issues and PRs (push) Successful in 5s
Signed-off-by: AR21SM <mahajanashishar21sm@gmail.com >
2026-01-19 15:10:49 -05:00
Naga Ravi Chaitanya Elluri
773107245c
Add contribution guidelines reference to the PR template ( #1108 )
...
Signed-off-by: Naga Ravi Chaitanya Elluri <nelluri@redhat.com >
2026-01-19 14:30:04 -05:00
Paige Patton
05bc201528
adding chaos_ai deprecation ( #1106 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-01-19 13:14:04 -05:00
Ashish Mahajan
9a316550e1
fix: add missing 'as e' to capture exception in TimeActionsScenarioPlugin ( #1057 )
...
Signed-off-by: AR21SM <mahajanashishar21sm@gmail.com >
Co-authored-by: Paige Patton <64206430+paigerube14@users.noreply.github.com >
2026-01-19 09:37:30 -05:00
Ashish Mahajan
9c261e2599
feat(ci): add stale issues automation workflow ( #1055 )
...
Functional & Unit Tests / Functional & Unit Tests (push) Failing after 9m42s
Functional & Unit Tests / Generate Coverage Badge (push) Has been skipped
Close Stale Issues / Mark and Close Stale Issues (push) Successful in 9s
Signed-off-by: AR21SM <mahajanashishar21sm@gmail.com >
2026-01-17 10:13:49 -05:00
Paige Patton
0cc82dc65d
add service hijacking to add to file not overwrite ( #1067 )
...
Functional & Unit Tests / Functional & Unit Tests (push) Failing after 5m41s
Functional & Unit Tests / Generate Coverage Badge (push) Has been skipped
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-01-16 14:24:03 -05:00
Paige Patton
269e21e9eb
adding telemety ( #1064 )
...
Signed-off-by: Paige Patton <prubenda@redhat.com >
2026-01-16 13:53:48 -05:00