This post is a clone from BretFisher/udemy-docker-mastery with some minor modifications.
What is Docker? Three Major Innovations
Docker was not the inventor of containers, but it made them easier to use with the three steps to creating and running them:
- Images: It aids you in packaging an application (with all its dependencies).
- Registries: It helps to distribute that app around to all the places you need to run it.
- Containers: It runs that app in a highly reproducible way.
Docker calls this the "Build, Ship and Run" lifecycle.
These are the basics that all other container technology is built on. Kubernetes, Swarm, Helm and most the cloud native tooling assumes you're using these three innovations.
We’ll learn a lot more about these three innovations in this course, so for now, I'll give you the basics. Note that these three innovations now have standard specifications and are governed by the Open Container Initiative (OCI), which is part of the Linux Foundation.
The Docker Image: Universal app packaging
- It’s called a "Docker image", the standards name "OCI Image", or just "image" for short.
- Docker uses a list instructions, called a Dockerfile, which is similar to a shell script, and it layers those instructions on top of each other until it has everything you need to run the application, including all its system dependencies.
- Including the dependencies is a key differentiator between Docker and many previous packaging systems. It helps prevent the (only) "works on my machine" problem of two different environments having slightly different sets of dependencies.
- If it was a Python app you wanted to build, then the image would contain the app itself, all the Python dependencies the app needs.
- They key distinction is it also include the exact Python version and system libraries to correctly run Python.
- Everything except the OS kernel and hardware drivers is included. Even metadata on how to start the app, default environment variables, and what ports it listens on are included.
The Registry: Easy app distribution
TL;DR - An image registry is largely just a HTTP-based storage system that sends things through HTTP, but it stores them on hard drives
- Called a "Docker registry", or just "registry" for short.
- This innovation was the key to connecting our building of images on one machine to running our containers on another.
- Now that we have built an image, ran it on our local machine, how do we get it on all the other machines?
- How can I be sure that the rest of my team, my CI testing, and all my servers run the exact same image?
- The registry is a HTTP-based package manager that works like apt, yum, npm, and other package managers.
- You can push an image to it, and then pull an image somewhere else.
- The registry protocol is efficient. It only pushes and pulls the changed parts (layers), and stores the image in the machines local cache for fast running of new containers.
Think of the images and the registry as the universal package manager for modern computing, where we may want to build, download, and run any app on any system. This includes building and running on Linux, macOS, Windows, in the cloud, in your datacenter, on a mainframe, or a tiny Raspberry Pi. They all work with Docker.
How Docker registries work
An example if you pull images from google registry, build it locally with another name and push it to your company internal Docker images registry.
docker pull gcr.io/google-containers/alpine-with-bash:1.0
docker build -t registry.mycompany.io:5000/myimage:awesome .
docker push registry.mycompany.io:5000/myimage:awesome
- What happens when we execute
docker run alpine
? - If the Engine needs to pull the
alpine
image, it expands it intolibrary/alpine
library/alpine
is expanded intoindex.docker.io/library/alpine
- The Engine communicates with
index.docker.io
to retrievelibrary/alpine:latest
- To use something else than
index.docker.io
, we specify it in the image name
Things to consider when picking a registry
So, here's a couple of tips.
- Look at its build system. See if it has a way to automate the building of images for you like DockerHub does.
- Multi-user auth and Role-Based Access Control (RBAC)
- Storage features
- There's replication between registries so that you can have a global system of registries if you want to run your own.
- There's caching of different registries so that you can have downstream registries and master registries
- Garbage collection of old images and image layers
The Docker Container: Easy app running
- Called a "Docker container", the standards name "OCI Container", or just "container" for short. It's not called a "docker", or "dockers". Docker is many things.
- Docker will launch your container image into a new running container and use the command you specified in the Dockerfile to start it.
- It uses two Linux Kernel features, called namespaces and cgroups (control groups), to isolate your app so it can’t see the rest of the host by default. To the app, the only files it sees are the ones in the container image. It sees no other processes outside the container and even gets its own virtual ethernet adapter and private IP.
- Now let me stress, It’s not virtualization. It’s application isolation, similar to chroot, FreeBSD jails, or Solaris zones, which all came before Docker.
- You can start many of these containers from the same container image, on the same system, and they’ll all be isolated from each other. File changes in one container don’t affect the files in another.
Practical
Multiple CMD commands
In principle, there should only be one CMD command in your Dockerfile. When CMD is used multiple times, only the last instance is executed.
The CMD command specifies the instruction that is to be executed when a Docker container starts. This CMD command is not really necessary for the container to work, as the echo command can be called in a RUN statement as well.