github/container.training

Fork 0

mirror of https://github.com/jpetazzo/container.training.git synced 2026-02-14 17:49:59 +00:00

Files

Jérôme Petazzoni f37d8112f8 🔧 Mention container engine levels

2025-09-11 16:21:27 +02:00

5.5 KiB

Raw Permalink Blame History

Docker Engine and other container engines

We are going to cover the architecture of the Docker Engine.
We will also present other container engines.

class: pic

Docker Engine external architecture

The Engine is a daemon (service running in the background).
All interaction is done through a REST API exposed over a socket.
On Linux, the default socket is a UNIX socket: /var/run/docker.sock.
We can also use a TCP socket, with optional mutual TLS authentication.
The docker CLI communicates with the Engine over the socket.

Note: strictly speaking, the Docker API is not fully REST.

Some operations (e.g. dealing with interactive containers and log streaming) don't fit the REST model.

class: pic

Docker Engine internal architecture

Up to Docker 1.10: the Docker Engine is one single monolithic binary.
Starting with Docker 1.11, the Engine is split into multiple parts:
- dockerd (REST API, auth, networking, storage)
- containerd (container lifecycle, controlled over a gRPC API)
- containerd-shim (per-container; does almost nothing but allows to restart the Engine without restarting the containers)
- runc (per-container; does the actual heavy lifting to start the container)
Some features (like image and snapshot management) are progressively being pushed from dockerd to containerd.

For more details, check this short presentation by Phil Estes.

Other container engines

The following list is not exhaustive.

Furthermore, we limited the scope to Linux containers.

We can also find containers (or things that look like containers) on other platforms like Windows, macOS, Solaris, FreeBSD ...

LXC

The venerable ancestor (first released in 2008).
Docker initially relied on it to execute containers.
No daemon; no central API.
Each container is managed by a lxc-start process.
Each lxc-start process exposes a custom API over a local UNIX socket, allowing to interact with the container.
No notion of image (container filesystems had be managed manually).
Networking had to be set up manually.

LXD

Re-uses LXC code (through liblxc).
Builds on top of LXC to offer a more modern experience.
Daemon exposing a REST API.
Can run containers and virtual machines.
Can manage images, snapshots, migrations, networking, storage.
"offers a user experience similar to virtual machines but using Linux containers instead."
Driven by Canonical.

Incus

Community-driven fork of LXD.
Relatively recent announced in August 2023 so time will tell what the notable differences will be.

CRI-O

Designed to be used with Kubernetes as a simple, basic runtime.
Compares to containerd.
Daemon exposing a gRPC interface.
Controlled using the CRI API (Container Runtime Interface defined by Kubernetes).
Needs an underlying OCI runtime (e.g. runc).
Handles storage, images, networking (through CNI plugins).

We're not aware of anyone using it directly (i.e. outside of Kubernetes).

systemd

"init" system (PID 1) in most modern Linux distributions.
Offers tools like systemd-nspawn and machinectl to manage containers.
systemd-nspawn is "In many ways it is similar to chroot(1), but more powerful".
machinectl can interact with VMs and containers managed by systemd.
Exposes a DBUS API.
Basic image support (tar archives and raw disk images).
Network has to be set up manually.

Kata containers

OCI-compliant runtime.
Fusion of two projects: Intel Clear Containers and Hyper runV.
Run each container in a lightweight virtual machine.
Requires running on bare metal or with nested virtualization.

gVisor

OCI-compliant runtime.
Implements a subset of the Linux kernel system calls.
Written in go, uses a smaller subset of system calls.
Can be heavily sandboxed.
Can run in two modes:
- KVM (requires bare metal or nested virtualization),
- ptrace (no requirement, but slower).

Others

Micro VMs: Firecracker, Edera...
crun (runc rewritten in C)
youki (runc rewritten in Rust)

To Docker Or Not To Docker

The Docker Engine is very developer-centric:
- easy to install
- easy to use
- no manual setup
- first-class image build and transfer
As a result, it is a fantastic tool in development environments.
On Kubernetes clusters, containerd or CRI-O are better choices.
On Kubernetes clusters, the container engine is an implementation detail.

Different levels

Directly use namespaces, cgroups, capabilities with custom code or scripts

useful for troubleshooting/debugging and for educative purposes; e.g. pipework
Use low-level engines like runc, crun, youki

useful when building custom architectures; e.g. a brand new orchestrator
Use low-level APIs like CRI or containerd grpc API

useful to achieve high-level features like Docker, but without Docker; e.g. ctr, nerdctl
Use high-level APIs like Docker and Kubernetes

that's what most people will do

5.5 KiB Raw Permalink Blame History

Docker Engine and other container engines

Docker Engine external architecture

Docker Engine external architecture

Docker Engine internal architecture

Docker Engine internal architecture

Other container engines

LXC

LXD

Incus

CRI-O

systemd

Kata containers

gVisor

Others

To Docker Or Not To Docker

Different levels

5.5 KiB

Raw Permalink Blame History