mirror of https://github.com/jpetazzo/container.training.git synced 2026-02-14 09:39:56 +00:00

Files

Jerome Petazzoni defeef093d Add dynamic provisioning and PostgreSQL example

In this section, we setup Portworx to have a dynamic provisioner.
Then we use it to deploy a PostgreSQL Stateful Set.
Finally we simulate a node failure and observe the failover.

2018-09-03 05:47:21 -05:00

11 KiB

Raw Blame History

Highly available Persistent Volumes

How can we achieve true durability?
How can we store data that would survive the loss of a node?

We need to use Persistent Volumes backed by highly available storage systems
There are many ways to achieve that:
- leveraging our cloud's storage APIs
- using NAS/SAN systems or file servers
- distributed storage systems

We are going to see one distributed storage system in action

Our test scenario

We will set up a distributed storage system on our cluster
We will use it to deploy a SQL database (PostgreSQL)
We will insert some test data in the database
We will disrupt the node running the database
We will see how it recovers

Portworx

Portworx is a commercial persistent storage solution for containers
It works with Kubernetes, but also Mesos, Swarm ...
It provides hyper-converged storage

(=storage is provided by regular compute nodes)
We're going to use it here because it can be deployed on any Kubernetes cluster

(it doesn't require any particular infrastructure)
We don't endorse or support Portworx in any particular way

(but we appreciate that it's super easy to install!)

Portworx requirements

Kubernetes cluster ✔️
Optional key/value store (etcd or Consul) ❌
At least one available block device ❌

The key-value store

In the current version of Portworx (1.4) it is recommended to use etcd or Consul
But Portworx also has beta support for an embedded key/value store
For simplicity, we are going to use the latter option

(but we could use the Consul service that we deployed earlier, too)

One available block device

Block device = disk or partition on a disk
We can see block devices with lsblk

(or cat /proc/partitions if we're old school like that!)
If we don't have a spare disk or partition, we can use a loop device
A loop device is a block device actually backed by a file
These are frequently used to mount ISO (CD/DVD) images or VM disk images

Setting up a loop device

We are going to create a 10 GB (empty) file on each node
Then make a loop device from it, to be used by Portworx

Create a 10 GB file on each node:

for N in $(seq 1 5); do ssh node$N sudo truncate --size 10G /portworx.blk; done

(If SSH asks to confirm host keys, enter yes each time.)

Associate the file to a loop device on each node:

for N in $(seq 1 5); do ssh node$N sudo losetup /dev/loop0 /portworx.blk; done

]

Installing Portworx

To install Portworx, we need to go to https://install.portworx.com/
This website will ask us a bunch of questoins about our cluster
Then, it will generate a YAML file that we should apply to our cluster

Or, we can just apply that YAML file directly (it's in k8s/portworx.yaml)

Install Portworx:

kubectl apply -f ~/container.training/k8s/portworx.yaml

]

Generating a custom YAML file

If you want to generate a YAML file tailored to your own needs, the easiest way is to use https://install.portworx.com/.

FYI, this is how we obtained the YAML file used earlier:

KBVER=$(kubectl version -o json | jq -r .serverVersion.gitVersion)
BLKDEV=/dev/loop0
curl https://install.portworx.com/1.4/?kbver=$KBVER&b=true&s=$BLKDEV&c=px-workshop&stork=true&lh=true

If you want to use an external key/value store, add one of the following:

&k=etcd://`XXX`:2379
&k=consul://`XXX`:8500

... where XXX is the name or address of your etcd or Consul server.

Dynamic provisioning of persistent volumes

We are going to run PostgreSQL in a Stateful set
The Stateful set will specify a volumeClaimTemplate
That volumeClaimTemplate will create Persistent Volume Claims
Kubernetes' dynamic provisioning will satisfy these Persistent Volume Claims

(by creating Persistent Volumes and binding them to the claims)
The Persistent Volumes are then available for the PostgreSQL pods

Storage Classes

It's possible that multiple storage systems are available
Or, that a storage system offers multiple tiers of storage

(SSD vs. magnetic; mirrored or not; etc.)
We need to tell Kubernetes which system and tier to use
This is achieved by creating a Storage Class
A volumeClaimTemplate can indicate which Storage Class to use
It is also possible to mark a Storage Class as "default"

(it will be used if a volumeClaimTeamplate doesn't specify one)

Our default Storage Class

This is our Storage Class (in k8s/storage-class.yaml):

kind: StorageClass
apiVersion: storage.k8s.io/v1beta1
metadata:
  name: portworx-replicated
  annotations:
    storageclass.kubernetes.io/is-default-class: "true"
provisioner: kubernetes.io/portworx-volume
parameters:
 repl: "2"
 priority_io: "high"

It says "use Portworx to create volumes"
It tells to Portworx "keep 2 replicas of these volumes"
It marks the Storage Class as being the default one

Creating our Storage Class

Let's apply that YAML file!

Create the Storage Class:

kubectl apply -f ~/container.training/k8s/storage-class.yaml

Check that it is now available:
```
kubectl get sc
```

]

It should show as portworx-replicated (default).

Our Postgres Stateful set

The next slide shows k8s/postgres.yaml
It defines a Stateful set
With a volumeClaimTemplate requesting a 1 GB volume
That volume will be mounted to /var/lib/postgresql
There is another little detail: we enable the stork scheduler
The stork scheduler is optional (it's specific to Portworx)
It helps the Kubernetes scheduler to colocate the pod with its volume

(see this blog post for more details about that)

apiVersion: apps/v1
kind: StatefulSet
metadata:
  name: postgres
spec:
  selector:
    matchLabels:
      app: postgres
  serviceName: postgres
  template:
    metadata:
      labels:
        app: postgres
    spec:
      schedulerName: stork
      containers:
      - name: postgres
        image: postgres:10.5
        volumeMounts:
        - mountPath: /var/lib/postgresql
          name: postgres
  volumeClaimTemplates:
  - metadata:
      name: postgres
    spec:
      accessModes: ["ReadWriteOnce"]
      resources:
        requests:
          storage: 1Gi

]

Creating the Stateful set

Before applying the YAML, watch what's going on with kubectl get events -w

Apply that YAML:

kubectl apply -f ~/container.training/k8s/postgres.yaml

]

Testing our PostgreSQL pod

We will use kubectl exec to get a shell in the pod
Good to know: we need to use the postgres user in the pod

Get a shell in the pod, as the postgres user:

kubectl exec -ti postgres-0 su postgres

Check that default databases have been created correctly:
```
psql -l
```

]

(This should show us 3 lines: postgres, template0, and template1.)

Inserting data in PostgreSQL

We will create a database and populate it with pgbench

Create a database named demo:
```
createdb demo
```
Populate it with pgbench:
```
pgbench -i -s 10 demo
```

]

The -i flag means "create tables"
The -s 10 flag means "create 10 x 100,000 rows"

Checking how much data we have now

The pgbench tool inserts rows in table pgbench_accounts

Check that the demo base exists:
```
psql -l
```

Check how many rows we have in pgbench_accounts:

psql demo -c "select count(*) from pgbench_accounts"

]

(We should see a count of 1,000,000 rows.)

Find out which node is hosting the database

We can find that information with kubectl get pods -o wide

Check the node running the database:
```
kuebectl get pod postgres-0 -o wide
```

]

We are going to disrupt that node.

By "disrupt" we mean: "disconnect it from the network".

Disconnect the node

We will use iptables to block all traffic exiting the node

(except SSH traffic, so we can repair the node later if needed)

SSH to the node to disrupt:
```
ssh `nodeX`
```

Allow SSH traffic leaving the node, but block all other traffic:

sudo iptables -I OUTPUT -p tcp --sport 22 -j ACCEPT
sudo iptables -I OUTPUT 2 -j DROP

]

Check that the node is disconnected

Check that the node can't communicate with other nodes:
```
ping -c 3 node1
```
Logout to go back on node1
Watch the events unfolding with kubectl get events -w and kubectl get pods -w

]

It will take some time for Kubernetes to mark the node as unhealthy
Then it will attempt to reschedule the pod to another node
In about a minute, our pod should be up and running again

Check that our data is still available

We are going to reconnect to the (new) pod and check

Get a shell on the pod:

kubectl exec -ti postgres-0 su postgres

Check the number of rows in the pgbench_accounts table:

psql demo -c "select count(*) from pgbench_accounts

]

Double-check that the pod has really moved

Just to make sure the system is not bluffing!

Look on which node the pod is now running
```
kubectl get pod postgres-0 -o wide
```

]

Re-enable the node

Let's fix the node that we disconnected from the network

SSH to the node:
```
ssh `nodeX`
```
Remove the iptables rule blocking traffic:
```
sudo iptables -D OUTPUT 2
```

]

A few words about this PostgreSQL setup

In a real deployment, you would want to set a password

This can be done by creating a secret:

kubectl create secret generic postgres \
        --from-literal=password=$(base64 /dev/urandom | head -c16)

And then passing that secret to the container:

env:
- name: POSTGRES_PASSWORD
  valueFrom:
    secretKeyRef:
      name: postgres
      key: password

Troubleshooting Portworx

If we need to see what's going on with Portworx:

PXPOD=$(kubectl -n kube-system get pod -l name=portworx -o json | 
	      jq -r .items[0].metadata.name)
kubectl -n kube-system exec $PXPOD -- /opt/pwx/bin/pxctl status

We can also connect to Lighthouse (a web UI)
- check the port with kubectl -n kube-system get svc px-lighthouse
- connect to that port
- the default login/password is admin/Password1
- then specify portworx-service as the endpoint

Removing Portworx

Portworx provides a storage driver
It needs to place itself "above" the Kubelet

(it installs itself straight on the nodes)
To remove it, we need to do more than just deleting its Kubernetes resources

It is done by applying a special label:

kubectl label nodes --all px/enabled=remove --overwrite

Then removing a bunch of local files:
```
sudo chattr -i /etc/pwx/.private.json
sudo rm -rf /etc/pwx /opt/pwx
```
(on each node where Portworx was running)

11 KiB Raw Blame History