github/krkn - krkn - Gitea: Git with a nice cup of tea

mirror of https://github.com/krkn-chaos/krkn.git synced 2026-04-15 06:57:28 +00:00

Author	SHA1	Message	Date
Sahil Shah	82db2fca75	Removing Litmus Scenario	2023-11-16 09:50:04 -05:00
Naga Ravi Chaitanya Elluri	afe8d817a9	Print telemetry data location to stdout This commit also deprecates litmus integration.	2023-11-13 10:01:17 -05:00
Tullio Sebastiani	7a966a71d0	krkn integration of telemetry events collection (#523 ) * function package refactoring in krkn-lib * cluster events collection flag * krkn-lib version bump requirements * dockerfile bump	2023-10-31 14:31:33 -04:00
Tullio Sebastiani	27fabfd4af	OCP/K8S functionalities and packages splitting in `krkn-lib` (#507 ) * krkn-lib ocp/k8s split adaptation * library reference updated * requirements update * rebase with main + fix	2023-10-30 17:31:48 +01:00
jtydlack	ff469579e9	Use function get_yaml_item_value Enables using default even though the value was loaded as None.	2023-10-24 14:55:49 -04:00
Paige Rubendall	f7f1b2dfb0	Service disruption (#494 ) * adding service disruption * fixing kil services * service log changes * remvoing extra logging * adding daemon set * adding service disruption name changes * cerberus config back * bad string	2023-10-06 12:51:10 -04:00
Tullio Sebastiani	61356fd70b	Added log telemetry piece to Krkn (#500 ) * config * log collection and upload dictionary key fix * escape regex in config.yaml * bump krkn-lib version * updated funtest github cli command * update krkn-lib version to 1.3.2 * fixed requirements.txt	2023-10-06 10:08:46 -04:00
Tullio Sebastiani	782d04c1b1	Prints the telemetry json after sending it to the webservice (#479 ) * prints telemetry json after sending it to the service deserialized base64 parameters * json output even if telemetry collection is disabled.	2023-09-25 12:00:08 -04:00
Tullio Sebastiani	f868000ebd	Switched from krkn_lib_kubernetes to krkn_lib v1.0.0 (#469 ) * changed all the references to krkn_lib_kubernetes to the new krkn_lib changed all the references * added krkn-lib pointer in documentation	2023-08-22 12:41:40 -04:00
Tullio Sebastiani	39c0152b7b	Krkn telemetry integration (#435 ) * adapted config.yaml to the new feature * temporarly pointing requirement.txt to the lib feature branch * run_kraken.py + arcaflow scenarios refactoring typo * plugin scenario * node scenarios return failed scenarios * container scenarios fix * time scenarios * cluster shutdown scenarios * namespace scenarios * zone outage scenarios * app outage scenarios * pvc scenarios * network chaos scenarios * run_kraken.py adaptation to telemetry * prometheus telemetry upload + config.yaml some fixes typos and logs max retries in config telemetry id with run_uuid safe_logger * catch send_telemetry exception * scenario collection bug fixes * telemetry enabled check * telemetry run tag * requirements pointing to main + archive_size * requirements.txt and config.yaml update * added telemetry config to common config * fixed scenario array elements for telemetry	2023-08-10 14:42:53 -04:00
José Castillo Lema	570631ebfc	Widen except (#457 ) Signed-off-by: José Castillo Lema <josecastillolema@gmail.com>	2023-07-26 18:53:52 +02:00
Naga Ravi Chaitanya Elluri	ce409ea6fb	Update kube-burner dependency version to 1.7.0	2023-06-15 11:55:17 -04:00
Tullio Sebastiani	68dc17bc44	krkn-lib-kubernetes refactoring proposal (#400 ) * run_kraken.py updated + renamed kubernetes library folder unstaged files kubecli marker * container scenarios updated * node scenarios updated typo injected kubecli * managed cluster scenarios updated * time scenarios updated * litmus scenarios updated * cluster scenarios updated * namespace scenarios updated * pvc scenarios updated * network chaos scenarios updated * common_managed_cluster functions updated * switched draft library to official one * regression on rebase	2023-06-13 10:02:35 -04:00
Naga Ravi Chaitanya Elluri	572eeefaf4	Minor fixes This commit fixes few typos and duplicate logs	2023-06-12 21:05:27 -04:00
José Castillo Lema	a7938e58d2	Allow kraken to run with environment variables instead of kubeconfig file (#429 ) * Include check for inside k8s scenario * Include check for inside k8s scenario (2) * Include check for inside k8s scenario (3) * Include check for inside k8s scenario (4)	2023-06-01 14:43:01 -04:00
yogananth-subramanian	8806781a4f	Pod network outage Chaos scenario Pod network outage chaos scenario blocks traffic at pod level irrespective of the network policy used. With the current network policies, it is not possible to explicitly block ports which are enabled by allowed network policy rule. This chaos scenario addresses this issue by using OVS flow rules to block ports related to the pod. It supports OpenShiftSDN and OVNKubernetes based networks. Below example config blocks access to openshift console. ```` - id: pod_network_outage config: namespace: openshift-console direction: - ingress ingress_ports: - 8443 label_selector: 'component=ui' ````	2023-05-15 10:43:58 -04:00
Naga Ravi Chaitanya Elluri	bc863fa01f	Add support to check for critical alerts This commit enables users to opt in to check for critical alerts firing in the cluster post chaos at the end of each scenario. Chaos scenario is considered as failed if the cluster is unhealthy in which case user can start debugging to fix and harden respective areas. Fixes https://github.com/redhat-chaos/krkn/issues/410	2023-05-03 16:14:13 -04:00
Tullio Sebastiani	fee4f7d2bf	arcaflow integration (#384 ) arcaflow library version Co-authored-by: Tullio Sebastiani <tsebasti@redhat.com>	2023-03-08 12:01:03 +01:00
José Castillo Lema	d76ab31155	OCM/ACM integration (#370 ) * OCM support for ManagedClusters * Updated docs and general adjustments * Improved docs * Improved docs2 * Removed io packet import Signed-off-by: José Castillo Lema <josecastillolema@gmail.com> * Removed time from imports Signed-off-by: José Castillo Lema <josecastillolema@gmail.com> * Removed duplicate logging import Signed-off-by: José Castillo Lema <josecastillolema@gmail.com> * Removed sys import Signed-off-by: José Castillo Lema <josecastillolema@gmail.com> * Update run.py Signed-off-by: José Castillo Lema <josecastillolema@gmail.com> Signed-off-by: José Castillo Lema <josecastillolema@gmail.com>	2023-01-10 08:58:17 -05:00
Paige Rubendall	4035f2724b	Adding wait duration for pods (#368 ) * adding wait duration for pods * adding kube apiserver with plugin schema	2022-11-18 07:43:26 +05:30
Naga Ravi Chaitanya Elluri	6b17dbdbb3	Allow users to set the listening address This commit provides an option for the user to set the listening address for the signal. This also fixes a security vulnerability. Fixes https://github.com/redhat-chaos/krkn/issues/307	2022-11-08 15:59:57 -05:00
Sandro Bonazzola	80829fcafe	run_kraken.py: resolve ~ with kubeconfig as we default to ~ for kubeconfig, we need to be able to read it. Signed-off-by: Sandro Bonazzola <sbonazzo@redhat.com>	2022-09-13 12:01:16 +02:00
Paige Rubendall	9de6c7350e	adding stringio for security reasons	2022-09-12 11:14:08 -04:00
Sandro Bonazzola	155269fd9d	pycodestyle fixes: run_kraken.py Other than plain style changes, introduced constants `KUBE_BURNER_URL` and `KUBE_BURNER_VERSION` solving the problem of having a too long string and at the same time make it easier to bump the requirement on Kube Burner. Signed-off-by: Sandro Bonazzola <sbonazzo@redhat.com>	2022-09-05 10:25:59 +02:00
Naga Ravi Chaitanya Elluri	412d718985	Fix code alignment	2022-08-25 11:32:19 -04:00
Naga Ravi Chaitanya Elluri	6c75d3dddb	Add option to skip litmus installation This commit adds an option for the user to pick whether to install litmus or not depending on their use case. One use case is disconnected environments where litmus is pre-installed insted of reaching out to the internet.	2022-08-23 14:09:10 -04:00
Janos Bonic	ccd902565e	Fixes #265 : Replace Powerfulseal and introduce Wolkenwalze SDK for plugin system	2022-08-02 16:25:03 +01:00
Naga Ravi Chaitanya Elluri	9208f39e06	Add support to run on Kubernetes This commit: - Leverages distribution flag in the config set by the user to skip things not supported on OpenShift to be able to run scenarios on Kubernetes. - Adds sample config and scenario files that work on Kubernetes.	2022-06-01 07:27:06 -05:00
Naga Ravi Chaitanya Elluri	90e1f20d50	Add support for setting performance-dashboards on Kubernetes	2022-05-25 09:45:39 -04:00
Sanja Bonic	f52f16ade8	Starts fixing small parts of issue #185	2022-05-18 20:01:59 +02:00
Adolfo Aguirrezabal	3adf5847b2	Add option to avoid litmus uninstall before chaos run (#242 ) * Adds option to avoid litmus uninstall before chaos run * Add new option to the config files	2022-05-05 09:02:25 -04:00
Paige Rubendall	7f60701444	adding alibaba node scenario start	2022-04-01 16:46:29 -04:00
yogananth-subramanian	50dd9873c1	Node egress traffic shaping Patch adds a scenario to create variations in egress traffic of a Node's interface using the tc and Netem.	2021-12-16 12:54:53 -05:00
Alejandro Gullón	baa812b7f0	Added new scenario to fill up a given volumen (#182 ) * Added new scenario to fill up a given volumen * fixing small issues and style * adding PVC as input param instead of pod name * small fix * get container name and volumen name replace oc with kubectl commands * adding yaml file to create a pv, pvc and pod to run pvc_scenario * adding support to match both string for describe command when looking for pod_name * added support to find the pvc from a given pod * small fix * small fix	2021-11-24 12:18:49 -05:00
Naga Ravi Chaitanya Elluri	674eb74a75	Expose setting the signal in the config This commit enables users to start Kraken to act as listener by setting the signal to PAUSE in the config to get the cluster to a desired test or run any setup before injecting chaos by setting the signal to RUN. This helps in cases where we have test cases that need to coordinate the chaos at a desired time depending on the state of the cluster/test run.	2021-10-26 09:05:25 -04:00
Paige Rubendall	6b865fc573	Adding server set up for kraken	2021-10-25 08:58:46 -04:00
Naga Ravi Chaitanya Elluri	cdf3bc03d2	Add support to block traffic to an application This commit enables users to simulate a downtime of an application by blocking the traffic for the specified duration to see how it/other components communicating with it behave in case of downtime.	2021-10-01 10:13:40 -04:00
Paige Rubendall	22df024312	adding validation that namespace becomes active	2021-09-28 09:58:55 -04:00
Naga Ravi Chaitanya Elluri	f36da323e7	Prioritize filtering on namespace to improve performance This will avoid querying all namespaces for pods matching the label_selector if defined as shown in the sample scenario config. This commit also prints a pointer to the report generated at the end of the run.	2021-09-22 15:03:39 -04:00
Naga Ravi Chaitanya Elluri	036e51a6b1	Delete litmus crd's during the cleanup This commit will ensure that the litmus resources installed on the cluster get cleaned up and also creates the chaosengine in the specified namespace.	2021-09-16 16:30:21 -04:00
Paige Rubendall	a9056ddf43	adding litmus logging	2021-09-08 17:11:49 -04:00
Naga Ravi Chaitanya Elluri	5da0b259c5	Run all the litmus resources in a single namespace - This eases the usage and debuggability by running the fault injection pods in the same namespace as other resources of litmus. This will also ease the deletion process and ensure that there are no leftover objects on the cluster. - This commit also enables users to use the same rbac template for all the litmus scenarios without having to pull in a specic one for each of the scenarios.	2021-09-08 16:37:07 -04:00
Naga Ravi Chaitanya Elluri	6456eec76a	Add zone outage scenarios This commit adds support to create zone outage in AWS by denying both ingress and egress traffic to the instances belonging to a particular subnet belonging to the zone by tweaking the network acl. This creates an outage of all the nodes in the zone - both master and workers.	2021-08-17 11:43:13 -04:00
Naga Ravi Chaitanya Elluri	716057eab6	Monitor user application availability during chaos Current Kraken integration with Cerberus monitors the cluster as well as the application health post chaos and pass/fails if they are not healthy after chaos. This commit adds ability to monitor the user application health during the chaos and fails the run in case of downtime as it's potentially a downtime in case of customers environment as well. It is especially useful in case of control plane failure scenarios including API server, Etcd, Ingress etc.	2021-07-27 13:15:57 -04:00
Naga Ravi Chaitanya Elluri	c0b9cb46da	Improve error handling This commit: - Adds timeout to avoid operations hanging for long durations. - Improves exception handling and exits wherever needed. - Sets KUBECONFIG env var globoally to access the cluster.	2021-07-21 12:48:06 -04:00
Paige Rubendall	f051c1c30f	Merge pull request #120 from paigerube14/container_kill Container kill	2021-07-15 15:07:58 -04:00
prubenda	76efac8f9b	Adding delete of namespaces	2021-07-13 13:31:45 -04:00
prubenda	46a1823291	Adding killing of specific containers in pods	2021-07-08 17:10:48 -04:00
Naga Ravi Chaitanya Elluri	e195922504	Document pip version and add more logging	2021-07-07 09:49:52 -04:00
prubenda	5456fce924	Adding getting started docs	2021-06-23 13:58:43 -04:00

1 2

79 Commits