Tullio Sebastiani
72b46f8393
temporarly removed io-hog scenario ( #433 )
...
* temporarly removed io-hog scenario
* removed litmus documentation & config
v1.3.1
2023-06-05 11:03:44 -04:00
José Castillo Lema
a7938e58d2
Allow kraken to run with environment variables instead of kubeconfig file ( #429 )
...
* Include check for inside k8s scenario
* Include check for inside k8s scenario (2)
* Include check for inside k8s scenario (3)
* Include check for inside k8s scenario (4)
2023-06-01 14:43:01 -04:00
Naga Ravi Chaitanya Elluri
9858f96c78
Change the severity of the etcd leader election check to warning
...
This is the first step towards the goal to only have metrics tracking
the overall health and performance of the component/cluster. For instance,
for etcd disruption scenarios, leader elections are expected, we should instead
track etcd leader availability and fsync latency under critical catergory vs leader
elections.
2023-05-31 11:50:20 -04:00
Tullio Sebastiani
c91e8db928
Added Tullio Sebastiani to the mantainers list
2023-05-25 06:18:33 -04:00
Naga Ravi Chaitanya Elluri
54ea98be9c
Add enhancements being planned as part of the roadmap ( #425 )
2023-05-24 14:36:59 -04:00
Pradeep Surisetty
9748622e4f
Add maintainers details
2023-05-24 10:38:53 -04:00
Pradeep Surisetty
47f93b39c2
Add Code of Conduct
2023-05-22 13:25:52 -04:00
Tullio Sebastiani
aa715bf566
bump Dockerfile to release v1.3.0
2023-05-15 12:50:44 -04:00
Tullio Sebastiani
b9c08a45db
extracted the namespace as scenario input ( #419 )
...
fixed sub-workflow and input
Co-authored-by: Naga Ravi Chaitanya Elluri <nelluri@redhat.com >
v1.3.0
2023-05-15 18:24:23 +02:00
Naga Ravi Chaitanya Elluri
d9f4607aa6
Add blogs and update roadmap
2023-05-15 11:50:16 -04:00
yogananth-subramanian
8806781a4f
Pod network outage Chaos scenario
...
Pod network outage chaos scenario blocks traffic at pod level irrespective of the network policy used.
With the current network policies, it is not possible to explicitly block ports which are enabled
by allowed network policy rule. This chaos scenario addresses this issue by using OVS flow rules
to block ports related to the pod. It supports OpenShiftSDN and OVNKubernetes based networks.
Below example config blocks access to openshift console.
````
- id: pod_network_outage
config:
namespace: openshift-console
direction:
- ingress
ingress_ports:
- 8443
label_selector: 'component=ui'
````
2023-05-15 10:43:58 -04:00
Tullio Sebastiani
83b811bee4
Arcaflow stress-ng hogs with parallelism support ( #418 )
...
* kubeconfig management for arcaflow + hogs scenario refactoring
* kubeconfig authentication parsing refactored to support arcaflow kubernetes deployer
* reimplemented all the hog scenarios to allow multiple parallel containers of the same scenarios
(eg. to stress two or more nodes in the same run simultaneously)
* updated documentation
* removed sysbench scenarios
* recovered cpu hogs
* updated requirements.txt
* updated config.yaml
* added gitleaks file for test fixtures
* imported sys and logging
* removed config_arcaflow.yaml
* updated readme
* refactored arcaflow documentation entrypoint
2023-05-15 09:45:16 -04:00
Paige Rubendall
16ea18c718
Ibm plugin node scenario ( #417 )
...
* Node scenarios for ibmcloud
* adding openshift check info
2023-05-09 12:07:38 -04:00
Naga Ravi Chaitanya Elluri
1ab94754e3
Add missing parameters supported by container scenarios ( #415 )
...
Also renames retry_wait to expected_recovery_time to make it clear that
the Kraken will exit 1 if the container doesn't recover within the expected
time.
Fixes https://github.com/redhat-chaos/krkn/issues/414
2023-05-05 13:02:07 -04:00
Tullio Sebastiani
278b2bafd7
Kraken is pointing to a buggy kill-pod plugin implementation ( #416 )
2023-05-04 18:19:54 +02:00
Naga Ravi Chaitanya Elluri
bc863fa01f
Add support to check for critical alerts
...
This commit enables users to opt in to check for critical alerts firing
in the cluster post chaos at the end of each scenario. Chaos scenario is
considered as failed if the cluster is unhealthy in which case user can
start debugging to fix and harden respective areas.
Fixes https://github.com/redhat-chaos/krkn/issues/410
2023-05-03 16:14:13 -04:00
Naga Ravi Chaitanya Elluri
900ca74d80
Reorganize the content from https://github.com/startx-lab ( #346 )
...
Moving the content around installing kraken using helm to the
chaos in practice section of the guide to showcase how startx-lab
is deploying and leveraging Kraken.
2023-04-24 13:51:49 -04:00
Tullio Sebastiani
82b8df4e85
kill-pod plugin dependency pointing to specific commit
...
switched to redhat-chaos repo
2023-04-20 08:26:51 -04:00
Tullio Sebastiani
691be66b0a
kubeconfig_path in new_client_from_config
...
added clients in the same context of the config
2023-04-19 14:12:46 -04:00
Tullio Sebastiani
019b036f9f
renamed trigger work from /test to funtest ( #401 )
...
added quotes
renamed trigger to funtest
Co-authored-by: Naga Ravi Chaitanya Elluri <nelluri@redhat.com >
2023-04-10 09:30:53 -04:00
Paige Rubendall
13fa711c9b
adding privileged namespace ( #399 )
...
Co-authored-by: Naga Ravi Chaitanya Elluri <nelluri@redhat.com >
2023-04-06 16:18:57 -04:00
Naga Ravi Chaitanya Elluri
17f61625e4
Exit on critical alert failures
...
This commit captures and exits on non-zero return code i.e when
critical alerts are fired
Fixes https://github.com/redhat-chaos/krkn/issues/396
2023-03-27 12:43:57 -04:00
Tullio Sebastiani
3627b5ba88
cpu hog scenario + basic arcaflow documentation ( #391 )
...
typo
typo
updated documentation
fixed workflow map issue
2023-03-15 16:52:20 +01:00
Tullio Sebastiani
fee4f7d2bf
arcaflow integration ( #384 )
...
arcaflow library version
Co-authored-by: Tullio Sebastiani <tsebasti@redhat.com >
2023-03-08 12:01:03 +01:00
Tullio Sebastiani
0534e03c48
removed useless step that was failing ( #389 )
...
removed only old namespace file cat
Co-authored-by: Tullio Sebastiani <tsebasti@redhat.com >
2023-02-23 16:28:09 +01:00
Tullio Sebastiani
bb9a19ab71
removed blocking event check
2023-02-22 09:41:52 -05:00
Tullio Sebastiani
c5b9554de5
check user's authorization before running functional tests
...
check users authorization before running functional tests
removed usesless checkout
step rename
typo in trigger
2023-02-21 12:38:34 -05:00
dependabot[bot]
e5f97434d3
Bump werkzeug from 2.0.3 to 2.2.3 ( #385 )
...
Bumps [werkzeug](https://github.com/pallets/werkzeug ) from 2.0.3 to 2.2.3.
- [Release notes](https://github.com/pallets/werkzeug/releases )
- [Changelog](https://github.com/pallets/werkzeug/blob/main/CHANGES.rst )
- [Commits](https://github.com/pallets/werkzeug/compare/2.0.3...2.2.3 )
---
updated-dependencies:
- dependency-name: werkzeug
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Naga Ravi Chaitanya Elluri <nelluri@redhat.com >
2023-02-20 14:34:31 -05:00
Tullio Sebastiani
8b18fa8a35
Github Action + functional tests (no *hog tests) ( #382 )
...
* Github Action + functional tests (no *hog tests)
* changed the trigger keyword to /test
* removed deprecated kill_pod scenario + added namespace to app_outage (new kill_pod)
* #365 : renamed ingress_namespace scenario to network_diagnostrcs
* requested team filter added
---------
Co-authored-by: Tullio Sebastiani <tullio.sebastiani@x3solutions.it >
2023-02-16 09:42:33 +01:00
Paige Rubendall
93686ca736
new quay image reference
2023-01-31 17:21:45 -05:00
Naga Ravi Chaitanya Elluri
64f4c234e9
Add prom token creation step
...
This enables compatability with all OpenShift versions.
Reference PR by Paige in Cerberus: https://github.com/redhat-chaos/cerberus/pull/190 .
2023-01-31 12:36:09 -05:00
Naga Ravi Chaitanya Elluri
915cc5db94
Bump release version to v1.2.0
2023-01-19 12:03:46 -05:00
José Castillo Lema
493a8a245f
Docker provider for node actions ( #369 )
...
* Docker provider for node actions
* Adjusted dependencies and imports
* Update config_kind.yaml
Signed-off-by: José Castillo Lema <josecastillolema@gmail.com >
Signed-off-by: José Castillo Lema <josecastillolema@gmail.com >
v1.2.0
2023-01-10 14:36:18 -05:00
José Castillo Lema
d76ab31155
OCM/ACM integration ( #370 )
...
* OCM support for ManagedClusters
* Updated docs and general adjustments
* Improved docs
* Improved docs2
* Removed io packet import
Signed-off-by: José Castillo Lema <josecastillolema@gmail.com >
* Removed time from imports
Signed-off-by: José Castillo Lema <josecastillolema@gmail.com >
* Removed duplicate logging import
Signed-off-by: José Castillo Lema <josecastillolema@gmail.com >
* Removed sys import
Signed-off-by: José Castillo Lema <josecastillolema@gmail.com >
* Update run.py
Signed-off-by: José Castillo Lema <josecastillolema@gmail.com >
Signed-off-by: José Castillo Lema <josecastillolema@gmail.com >
2023-01-10 08:58:17 -05:00
dependabot[bot]
bed40b0c6a
Bump setuptools from 63.4.1 to 65.5.1
...
Bumps [setuptools](https://github.com/pypa/setuptools ) from 63.4.1 to 65.5.1.
- [Release notes](https://github.com/pypa/setuptools/releases )
- [Changelog](https://github.com/pypa/setuptools/blob/main/CHANGES.rst )
- [Commits](https://github.com/pypa/setuptools/compare/v63.4.1...v65.5.1 )
---
updated-dependencies:
- dependency-name: setuptools
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
2023-01-04 18:42:52 +05:30
Paige Rubendall
3c5c3c5665
Giving more details on configuration ( #371 )
...
* givng more details on configuration
* adding few changes
2022-12-08 11:18:42 +05:30
Tullio Sebastiani
cf7bc28a2d
updated k8s/Openshift installation documentation ( #359 )
...
* Added some bits and pieces to the krkn k8s installation to make it easier
* updated k8s/Oc installation documentation
* gitignore
* doc reorg
* fixed numbering + removed italic
Co-authored-by: Tullio Sebastiani <tullio.sebastiani@x3solutions.it >
2022-11-30 23:02:17 +05:30
Paige Rubendall
4035f2724b
Adding wait duration for pods ( #368 )
...
* adding wait duration for pods
* adding kube apiserver with plugin schema
2022-11-18 07:43:26 +05:30
Naga Ravi Chaitanya Elluri
6b17dbdbb3
Allow users to set the listening address
...
This commit provides an option for the user to set the listening address
for the signal. This also fixes a security vulnerability.
Fixes https://github.com/redhat-chaos/krkn/issues/307
2022-11-08 15:59:57 -05:00
Naga Ravi Chaitanya Elluri
1c207538b6
Use run dir instead of tmp
...
This commit also logs a message to handle the exception during the
node checks.
Fixes https://github.com/redhat-chaos/krkn/issues/356 , https://github.com/redhat-chaos/krkn/issues/357
2022-11-08 15:46:08 -05:00
Naga Ravi Chaitanya Elluri
6ccc16a0ab
Use autoescape=True to mitigate XSS vulnerabilities
...
Fixes https://github.com/redhat-chaos/krkn/issues/354
2022-11-08 14:34:06 -05:00
Naga Ravi Chaitanya Elluri
b9d5a7af4d
Use safe loader for Yaml
...
This fixes the security vulnerabilities for example - it raises an
exception when opening a yaml file with code.
Fixes https://github.com/redhat-chaos/krkn/issues/352
2022-11-08 13:35:06 -05:00
Sandro Bonazzola
1c4a51cbfa
refactor: use arcaflow plugin
...
Signed-off-by: Sandro Bonazzola <sbonazzo@redhat.com >
2022-10-18 16:43:33 +02:00
Christophe LARUE
68c02135d3
Add helm and tekton examples
2022-10-18 09:41:24 -04:00
Naga Ravi Chaitanya Elluri
61700c0dc5
Bump release version to v1.1.1
2022-10-14 12:47:17 -04:00
Paige Rubendall
da749339f7
Adding scenarios sub folders to container creation ( #337 )
...
* adding scenarios sub folders to container creation
* adding req
* trying other package installations
* more specific versions
* removing vsphere
* adding wheel
* put vmware back
Fixes : #335
Signed-off-by: Sandro Bonazzola <sbonazzo@redhat.com >
Co-authored-by: Sandro Bonazzola <sbonazzo@redhat.com >
v1.1.1
2022-10-14 09:51:04 +02:00
Sandro Bonazzola
66eb541bfb
Docker: take main as 1.1.0 is now broken
...
Signed-off-by: Sandro Bonazzola <sbonazzo@redhat.com >
2022-10-14 08:51:04 +02:00
Sandro Bonazzola
6589e50743
require recent aliyun-python-sdk
...
reducing the time needed by pip to figure out the version to be
installed.
Signed-off-by: Sandro Bonazzola <sbonazzo@redhat.com >
2022-10-14 08:51:04 +02:00
Sandro Bonazzola
026fbd9987
test: check for control-plane label
...
previously the test was looking for master label.
Recent kubernetes uses control-plane lable instead.
Signed-off-by: Sandro Bonazzola <sbonazzo@redhat.com >
2022-10-14 08:51:04 +02:00
Sandro Bonazzola
4357ce5386
adjust vmware requirement to latest tag
...
Require latest tag rather than main branch as main branch is broken.
Signed-off-by: Sandro Bonazzola <sbonazzo@redhat.com >
2022-10-14 08:51:04 +02:00