Commit Graph

94 Commits

Author SHA1 Message Date
Naga Ravi Chaitanya Elluri
590edff63b Avoid namespace context switch
There are cases where the kubeconfig can be read only like when running
Kraken as a kubernetes deployment. This commit fixes the instances to
use -n flag instead of a namespace context switch.
2021-07-27 11:31:32 -04:00
Naga Ravi Chaitanya Elluri
e9f5961986 [Docs] Add instructions on how to mount custom scenarios 2021-07-26 09:57:11 -04:00
koflerm
304f606b2b Use jsonpath to retrieve pod nodename (#129) 2021-07-23 20:08:06 -04:00
Naga Ravi Chaitanya Elluri
c0b9cb46da Improve error handling
This commit:
- Adds timeout to avoid operations hanging for long durations.
- Improves exception handling and exits wherever needed.
- Sets KUBECONFIG env var globoally to access the cluster.
2021-07-21 12:48:06 -04:00
Paige Rubendall
f051c1c30f Merge pull request #120 from paigerube14/container_kill
Container kill
2021-07-15 15:07:58 -04:00
prubenda
76efac8f9b Adding delete of namespaces 2021-07-13 13:31:45 -04:00
prubenda
46a1823291 Adding killing of specific containers in pods 2021-07-08 17:10:48 -04:00
Naga Ravi Chaitanya Elluri
b75b6e0042 Increase the granularity of cerberus checks
This commit modifies the wait time from 60 seconds to 3 seconds between
each of the requests to the API to capture the components state at a more
granular level by default.
2021-07-08 16:59:33 -04:00
Naga Ravi Chaitanya Elluri
d7ba19c382 Automate the infrastruture pieces
This commit:
- Adds support to automate the infrastructure pieces leveraged by Kraken
  including Cerberus and Elasticsearch
- Adds a Kraken config that can be used to discover all the infra pieces
  automatically without having to tweak the configuration.
2021-07-07 15:52:26 -04:00
Naga Ravi Chaitanya Elluri
e195922504 Document pip version and add more logging 2021-07-07 09:49:52 -04:00
Jared O'Connell
9b83dbcf04 Baremetal Node Support (#74)
* Support for baremtal node scenarious

* Finished baremetal support

* Added documentation for baremetal

* Clarify limitations of implementation in documentation

* Add baremetal support to new run.py file

* Allow use on newer machines

Some older machines require lanplus instead of lan

* Setup to allow per-device user, pass, and bmc address

Also set min version for a dependency

* Fix linting issues

* More linting issue fixes

* More linter issues

* Account for linter standard non-conformity

* Added baremetal warning

Co-authored-by: jaredoconnell <jocnnel@redhat.com>
2021-07-02 17:31:40 -04:00
Paige Rubendall
0afcd22f66 Merge pull request #115 from chaitanyaenr/workflow
Update the workflow
2021-06-23 14:00:04 -04:00
prubenda
5456fce924 Adding getting started docs 2021-06-23 13:58:43 -04:00
Naga Ravi Chaitanya Elluri
d1ae298692 Update the workflow
This commit modifies the workflow diagram to add pieces that are
leveraged to determine pass/fail of the chaos scnearios.
2021-06-23 12:41:38 -04:00
prubenda
41bf815f98 Adding shut down scenario for gcp, az, aws, openstack 2021-06-23 09:00:58 -04:00
Naga Ravi Chaitanya Elluri
e30a4243f6 Add support to alerting on metrics evaluation
This commit enables alerting in Kraken based on the Prometheus queries defined
by the user and modifies the return code of the run to determine pass/fail for
the run.
2021-06-22 15:22:37 -04:00
Naga Ravi Chaitanya Elluri
7e8f0450d6 Add support to scrape and index metrics
This commit:
- Enables Kraken to leverage kube-burner to scrape metrics from
  Prometheus and index them into Elasticsearch. This way we can
  take a look at the metrics in Grafana long term even after the
  cluster is terminated.
- Enables separation of operations based on distribution with
  OpenShift as the default option. One of the use cases is to
  capture Prometheus instance details as it's installed by default
  while it's optional for Kubernetes.
2021-06-21 14:55:50 -04:00
Naga Ravi Chaitanya Elluri
871eb3d74e Avoid circular dependencies
This commit deletes unneeded imports and fixes the circular dependency
issues.
2021-06-17 11:18:34 -04:00
Robert O'Brien
56de5c76a9 Added selinux label to the docker run install command 2021-06-17 08:30:58 -04:00
Naga Ravi Chaitanya Elluri
5c2453b07e Refactor code base
This commit:
- Refactors the code base to be more modular by moving functions
  into respective modules to make it lean and reusable.
- Uses black to reformat the code to follow PEP 8 practices.
2021-06-14 17:41:10 -04:00
Ryan Drew
8d9faf7033 Correct license from MIT to Apache 2.0 2021-06-08 18:52:42 -04:00
Naga Ravi Chaitanya Elluri
e671754e66 Add dependencies needed for performance monitoring 2021-05-06 17:26:09 -04:00
Paige Rubendall
190cf5d462 Blank node name error message (#97)
* adding contribute doc

* Fixing blank node name param printing off incorrect data
2021-05-06 10:13:17 -04:00
Amit Sagtani
d00d6ec69e Install pre-commit and use GitHub Actions (#94)
* added pre-commit and code-cleaning

* removed tox and TravisCI
2021-05-05 09:53:45 -04:00
Naga Ravi Chaitanya Elluri
70b14956c7 Docs: Add pointer to the litmus based scenarios 2021-05-03 10:01:07 -04:00
prubenda
a14f4c2561 Adding wait in litmus scenario before cerberus integration 2021-04-20 14:45:31 -04:00
Naga Ravi Chaitanya Elluri
93ec3bc14b Fix dependency issues
This commit installs setuptools and tox as part of the requirements.
2021-04-16 08:25:12 -04:00
Barry
90e881672e Modifying the Dockerfile to add the needed pip3 update 2021-04-14 13:14:48 -04:00
Mike Fiedler
5470612309 Merge pull request #86 from paigerube14/apiserver_labels
Apiserver labels
2021-04-13 16:04:29 -04:00
prubenda
e1320b0d7b adding update to openshift apiserver label for pod scenario 2021-04-13 10:42:59 -04:00
prubenda
4c2d1b989c adding contribute doc 2021-04-08 19:22:37 -04:00
Naga Ravi Chaitanya Elluri
db42f054ba Add pointer to the new blog
This commit:
- Adds a pointer to a new blog which emphasizes the importance
  of making chaos part of Perf/Scale test runs.
- Bumps up the allowed max-line-length for the linters.
2021-03-22 19:37:20 -04:00
Mike Fiedler
6dc06c1c57 Merge pull request #40 from paigerube14/az_nodes
Az nodes
2021-03-17 17:50:24 -04:00
prubenda
c7bb32f633 Adding azure to node scenarios 2021-03-17 17:41:07 -04:00
Mike Fiedler
5456b7204a Merge pull request #75 from chaitanyaenr/typo
Fix the link in the docs
2021-03-16 09:10:48 -04:00
Naga Ravi Chaitanya Elluri
576227189d Fix the link in the docs 2021-03-16 09:00:49 -04:00
Mike Fiedler
49cb37e28d Merge pull request #73 from paigerube14/contribute_doc
adding contribute doc
2021-02-25 12:15:14 -05:00
prubenda
387d6921a6 adding contribute doc 2021-02-25 10:33:55 -05:00
Pravin Dsilva
807d96ae9c Dockerfile for ppc64le
Signed-off-by: Pravin Dsilva <pravin.d-silva@ibm.com>
2021-02-17 12:29:27 -05:00
Mike Fiedler
bfe9448c6a Merge pull request #68 from pravin-dsilva/bastion_scenario
Add node level chaos scenarios for bastion node
2021-02-16 15:31:23 -05:00
Pravin Dsilva
918b5fb6d3 Add node level chaos scenarios for bastion node
Signed-off-by: Pravin Dsilva <pravin.d-silva@ibm.com>
2021-02-16 09:04:55 -08:00
Naga Ravi Chaitanya Elluri
a7e28ca490 Add support to deploy performance dashboards
This commit enables performance monitoring on the cluster when
running Kraken to be able to observe how cluster reacts to failures
as it's important to make sure the cluster is healthy in terms of
both recovery as well as performance.
2021-02-10 16:06:55 -05:00
mjulie
a42adf89e8 Add pod scenarios for custom app
Signed-off-by: mjulie <mjulie@in.ibm.com>
2021-02-04 11:24:05 -05:00
mjulie
488aa826e4 Add pod scenarios for custom app
Signed-off-by: mjulie <mjulie@in.ibm.com>
2021-02-04 11:24:05 -05:00
mjulie
9df350a189 Add pod scenarios for custom app
Signed-off-by: mjulie <mjulie@in.ibm.com>
2021-02-04 11:24:05 -05:00
Mike Fiedler
a7b69797a3 Merge pull request #66 from chaitanyaenr/demo
Add pointers to helpful resources around Kraken
2021-02-03 09:35:39 -05:00
Naga Ravi Chaitanya Elluri
12201a32c7 Add pointers to helpful resources around Kraken 2021-02-02 21:17:49 -05:00
Mike Fiedler
ca44f53d62 Merge pull request #61 from arcprabh/arc_openstack_cloud
Enable support for Openstack cloud.
2021-02-02 13:11:43 -05:00
arcprabh
8dd18af161 Enable support for Openstack cloud.
Signed-off-by: arcprabh <arcprabh@in.ibm.com>

Incorporated first round of review comments

Signed-off-by: arcprabh <arcprabh@in.ibm.com>

Resolve multiple node name issue for single ip

Signed-off-by: arcprabh <arcprabh@in.ibm.com>
2021-02-02 20:47:30 +05:30
Mike Fiedler
183313804d Merge pull request #33 from paigerube14/litmus_1
Litmus integration with node cpu hog experiment
2020-12-03 12:48:26 -05:00