Sharing Our Passion for Technology
& Continuous Learning
<   Back to Blog

A Hands-On Tour of Kubernetes: Part 4 - Deployments and Replication

kubernetes logo on a blue background

Deployments

So far we’ve been deploying pods directly. This has been a great way to gain familiarity with Kubernetes, but typically, we’ll rely on some other workload resource to create our pods for us. In this section, we’ll look at the Deployment resource.

Creating a deployment isn’t too different from creating a pod directly. We specify a name/namespace along with a container image. However, when we create a deployment, we can also use --replicas to tell it how many pod replicas to create. All the pods created by a deployment use the same image, but they are given a unique, randomly-generated name.

Let’s test this out. We’ll start by creating a namespace for ourselves

$ kubectl create namespace galaxy
namespace/galaxy created

Now, let’s create our first deployment. In this case, we’ll specify that we want three replicas:

$ kubectl create deployment star --image=jmalloc/echo-server:0.3.6 --replicas=3 --namespace=galaxy
deployment.apps/star created

Like pods, namespaces, and services, we can use kubectl get to list our deployments:

$ kubectl get deployments --namespace=galaxy
NAME   READY   UP-TO-DATE   AVAILABLE   AGE
star   3/3     3            3           6s

This output shows that our deployment, star, created the three pods as requested. Let’s list those three pods:

$ kubectl get pods --namespace=galaxy
NAME                    READY   STATUS    RESTARTS   AGE
star-796b6dcc77-q4lhd   1/1     Running   0          4m57s
star-796b6dcc77-j87bk   1/1     Running   0          4m57s
star-796b6dcc77-kvtfk   1/1     Running   0          4m57s

Before, when we managed pods directly, deleting a pod meant it was truly deleted from the cluster. In comparison, deployments will make sure there are the correct number of pods. If we delete a pod that is part of a deployment, the deployment will automatically create a new pod.

$ kubectl delete pod star-796b6dcc77-q4lhd --namespace=galaxy
pod "star-796b6dcc77-q4lhd" deleted

$ kubectl get pods --namespace=galaxy
NAME                    READY   STATUS    RESTARTS   AGE
star-796b6dcc77-j87bk   1/1     Running   0          7m46s
star-796b6dcc77-kvtfk   1/1     Running   0          7m46s
star-796b6dcc77-7qlht   1/1     Running   0          5s

We’re not stuck with three replicas though. We can scale our deployment to adjust the replicas count. Let’s scale up our replica count:

$ kubectl scale deployment star --replicas=5 --namespace=galaxy
deployment.apps/star scaled

$ kubectl get pods --namespace=galaxy
NAME                    READY   STATUS    RESTARTS   AGE
star-796b6dcc77-j87bk   1/1     Running   0          11m
star-796b6dcc77-kvtfk   1/1     Running   0          11m
star-796b6dcc77-7qlht   1/1     Running   0          3m58s
star-796b6dcc77-lqst5   1/1     Running   0          5s
star-796b6dcc77-57qfg   1/1     Running   0          5s

And now, let’s scale down our replica count:

$ kubectl scale deployment star --replicas=2 --namespace=galaxy
deployment.apps/star scaled

$ kubectl get pods --namespace=galaxy
NAME                    READY   STATUS    RESTARTS   AGE
star-796b6dcc77-j87bk   1/1     Running   0          12m
star-796b6dcc77-kvtfk   1/1     Running   0          12m

Deleting a deployment will remove all the pods it created:

$ kubectl delete deployment star --namespace=galaxy
deployment.apps "star" deleted

$ kubectl get pods --namespace=galaxy
No resources found in galaxy namespace.

Deployments and services play nicely together.

Let’s recreate our deployment and look at the labels:

$ kubectl create deployment star --image=jmalloc/echo-server:0.3.6 --replicas=3 --namespace=galaxy
deployment.apps/star created

$ kubectl get pods --show-labels --namespace=galaxy
NAME                    READY   STATUS    RESTARTS   AGE   LABELS
star-796b6dcc77-hkzpm   1/1     Running   0          17s   app=star,pod-template-hash=796b6dcc77
star-796b6dcc77-7bmqd   1/1     Running   0          17s   app=star,pod-template-hash=796b6dcc77
star-796b6dcc77-rq5kv   1/1     Running   0          17s   app=star,pod-template-hash=796b6dcc77

Notice how all the pods have an app=star label. A Deployment will attach the same labels to all its pods. This plays nicely with Services. We can use the expose command to create a Service, using app=star as the selector.

$ kubectl expose deployment/star --name=star --port=8080 --selector=app=star --namespace=galaxy
service/star exposed

Create our wget pod

$ kubectl run spaceship --image=alpine:3.19 --namespace=galaxy --command -- sleep infinite
pod/spaceship created

Now let’s call our service from the pod:

$ kubectl exec pod/spaceship --namespace=galaxy -- wget -q -O- http://star:8080
Request served by star-796b6dcc77-hkzpm

HTTP/1.1 GET /

Host: star:8080
User-Agent: Wget
Connection: close

As we learned previously, the Service will load balance requests across the pods with matching labels. Since a Deployment creates pods with the same labels, our requests will be load-balanced across them.

$ kubectl exec pod/spaceship --namespace=galaxy -- wget -q -O- http://star:8080
Request served by star-796b6dcc77-rq5kv

HTTP/1.1 GET /

Host: star:8080
User-Agent: Wget
Connection: close

(Note that the hostname is different. You may need to execute the request several times to see the different hostnames)

It is possible to scale to 0 replicas, by the way. Don’t think of this as a lambda, however. Pods won’t automatically spin up when a request is received.

$ kubectl scale deployment star --replicas=0 --namespace=galaxy
deployment.apps/star scaled

$ kubectl get pods --namespace=galaxy
NAME        READY   STATUS    RESTARTS   AGE
spaceship   1/1     Running   0          16m

$ kubectl exec pod/spaceship --namespace=galaxy -- wget -q -O- http://star:8080
wget: can't connect to remote host (10.43.215.214): Connection refused
command terminated with exit code 1

Our request to the Service fails because no pods running.


Deployments also make it easy to update our application image. Great for deploying new versions of our app.

Let’s create a new deployment for ourselves to see how this works.

$ kubectl create deployment planet --replicas=3 --image=nginx:1.21 --namespace=galaxy
deployment.apps/planet created

$ kubectl get pods --namespace=galaxy
NAME                      READY   STATUS    RESTARTS   AGE
spaceship                 1/1     Running   0          17m
planet-7dbd5686dd-wgnhh   1/1     Running   0          15s
planet-7dbd5686dd-92pfj   1/1     Running   0          15s
planet-7dbd5686dd-8l2xs   1/1     Running   0          15s

Let’s say we are ready to deploy a new version of our app. We can do:

$ kubectl set image deployment/planet nginx=nginx:1.22 --namespace=galaxy
deployment.apps/planet image updated

$ kubectl get pods --namespace=galaxy
NAME                      READY   STATUS    RESTARTS   AGE
spaceship                 1/1     Running   0          20m
planet-78c595dc85-vqw8j   1/1     Running   0          38s
planet-78c595dc85-nkjsr   1/1     Running   0          27s
planet-78c595dc85-jv8cx   1/1     Running   0          25s

Note: the first part of nginx=<image> (the name to the left of the equals) refers to the container name in the pod. The container name can be verified by looking at the details of the pod or deployment using kubectl describe. With how we’ve been creating resources, the container name matches the name of the container image without the tag or repository information.

Depending on how quickly you run the commands, you may see extra pods with a ContainerCreating or Terminating status. For example:

NAME                      READY   STATUS              RESTARTS   AGE
spaceship                 1/1     Running             0          19m
planet-7dbd5686dd-wgnhh   1/1     Running             0          2m37s
planet-7dbd5686dd-92pfj   1/1     Running             0          2m37s
planet-7dbd5686dd-8l2xs   1/1     Running             0          2m37s
planet-78c595dc85-vqw8j   0/1     ContainerCreating   0          8s

This is because the Deployment uses a RollingUpdate rollout strategy. Kubernetes incrementally replaces instances of your old version with the new one. This ensures that at least one instance of your application is always running, leading to zero downtime.

Let’s clean up before moving on:

$ kubectl delete namespace galaxy

Demo Application

So far, we’ve been using existing images, but how do we deploy our application?

This demo application is a simple web application, using Python 3.9, with a frontend and backend that displays a few of my favorite things. We will create Docker images and use a manifest file to deploy them to a Kubernetes cluster.

Backend

We’ll start by constructing a backend API that facilitates interaction with our frontend. The backend will be a simple API that will allow us to create and retrieve our favorite things. The backend will be built using FastAPI and will be packaged as a Docker image.

Our backend in the file backend.py will have two endpoints:

  • GET /things - returns a list of things
  • POST /things - creates a new thing
import fastapi
import pydantic

class Thing(pydantic.BaseModel):
    name: str

app = fastapi.FastAPI()

things: list[Thing] = [
    {"name": "raindrops on roses"},
    {"name": "whiskers on kittens"},
    {"name": "bright copper kettles"},
    {"name": "warm woolen mittens"},
]

@app.get("/things", response_model=list[Thing])
def get_things():
    return things

@app.post("/things", status_code=201)
def create_thing(thing: Thing):
    things.append(thing)
    return thing

Let’s add a Dockerfile backend.Dockerfile to build our backend image.

FROM python:3.9

WORKDIR /app
COPY requirements.txt .
RUN pip install -r requirements.txt

COPY backend.py .

EXPOSE 8000
ENTRYPOINT ["uvicorn", "backend:app", "--host", "0.0.0.0", "--port", "8000"]

Frontend

Let’s build a frontend that will interact with our backend API. Create a file called frontend.py that contains the following:

import fastapi
import fastapi.templating
import pydantic_settings
import requests

class Settings(pydantic_settings.BaseSettings):
    api_host: str = "localhost"
    api_port: int = 8000

settings = Settings()
app = fastapi.FastAPI()
templates = fastapi.templating.Jinja2Templates(directory="templates")

api_url = f"http://{settings.api_host}:{settings.api_port}"

@app.get("/")
def index(request: fastapi.Request):
    print(api_url)
    r = requests.get(f"{api_url}/things")
    r.raise_for_status()
    return templates.TemplateResponse("index.html", {"request": request, "things": r.json()})

@app.post("/new")
def new(request: fastapi.Request, name: str = fastapi.Form()):
    r = requests.post(f"{api_url}/things", json={"name": name})
    r.raise_for_status()
    return templates.TemplateResponse("new.html", {"request": request, "name": name})

We will also need to create a templates folder to hold our HTML files. Inside the templates folder, we will create two files, index.html and new.html, with the following content.

The Jinja template file for index.html

<!DOCTYPE html>
<html>
<body>

<script src="https://unpkg.com/htmx.org@1.7.0" integrity="sha384-EzBXYPt0/T6gxNp0nuPtLkmRpmDBbjg6WmCUZRLXBBwYYmwAUxzlSGej0ARHX0Bo" crossorigin="anonymous" defer></script>

<h1>Favorite Things</h1>

<div id="things">
{% for thing in things %}
    <p>{{ thing.name }}</p>
{% endfor %}
</div>

<form hx-post="new" hx-target="#things" hx-swap="beforeend">
    <input type="text" name="name">
    <input type="submit">
</form>

</body>
</html>

The Jinja template file for new.html

<p>{{ name }}</p>

Let’s create a Dockerfile frontend.Dockerfile that we will use to build our frontend image.

FROM python:3.9

WORKDIR /app
COPY requirements.txt .
RUN pip install -r requirements.txt

COPY templates/ templates/
COPY frontend.py .

EXPOSE 8000
ENTRYPOINT ["uvicorn", "frontend:app", "--host", "0.0.0.0", "--port", "3000"]

Before we can build our images, we need to create a requirements.txt file to hold our python dependencies. The contents of the file will be as below:

annotated-types==0.6.0
anyio==4.2.0
certifi==2023.11.17
charset-normalizer==3.3.2
click==8.1.7
fastapi==0.109.0
h11==0.14.0
idna==3.6
Jinja2==3.1.3
MarkupSafe==2.1.4
pydantic==2.5.3
pydantic-settings==2.1.0
pydantic_core==2.14.6
python-dotenv==1.0.1
python-multipart==0.0.6
requests==2.31.0
sniffio==1.3.0
starlette==0.35.1
typing_extensions==4.9.0
urllib3==2.1.0
uvicorn==0.27.0

We then create a manifest file demo.yaml that will hold our Kubernetes resources. The file will contain the following manifest files:

apiVersion: v1
kind: Namespace
metadata:
  name: demo

---

apiVersion: apps/v1
kind: Deployment
metadata:
  name: backend
  namespace: demo
spec:
  replicas: 1
  selector:
    matchLabels:
      app: backend
  template:
    metadata:
      labels:
        app: backend
    spec:
      containers:
      - image: backend:1
        name: backend
        imagePullPolicy: Never

---

apiVersion: v1
kind: Service
metadata:
  name: backend
  namespace: demo
spec:
  selector:
    app: backend
  ports:
  - port: 8000
    protocol: TCP
    targetPort: 8000

---

apiVersion: apps/v1
kind: Deployment
metadata:
  name: frontend
  namespace: demo
spec:
  replicas: 1
  selector:
    matchLabels:
      app: frontend
  template:
    metadata:
      labels:
        app: frontend
    spec:
      containers:
      - image: frontend:1
        name: frontend
        imagePullPolicy: Never
        env:
        - name: API_HOST
          value: backend

---

apiVersion: v1
kind: Service
metadata:
  name: frontend
  namespace: demo
spec:
  selector:
    app: frontend
  ports:
  - port: 3000
    protocol: TCP
    targetPort: 3000

Notice imagePullPolicy is set to Never. This is because we are building our images locally. If we were to push our images to a registry, we would set this to Always or ifNotPresent.

The folder structure will look like this:

demo
  - templates
    - index.html
    - new.html
  - backend.Dockerfile
  - backend.py
  - demo.yaml
  - frontend.Dockerfile
  - frontend.py
  - requirements.txt

Now let’s build our Docker images and apply the manifest file.

Build and tag the docker image for the backend.

docker build . -f backend.Dockerfile -t backend:1

Build and tag the docker image for the frontend.

docker build . -f frontend.Dockerfile -t frontend:1

Now we can deploy our application to our cluster with kubectl apply.

kubectl apply -f demo.yaml

Then, we need to port-forward the frontend so that we can see it in our browser.

kubectl port-forward svc/frontend 3000:3000 --namespace demo

When you visit http://localhost:3000 in your browser, you should see the following page:

A screenshot of a webpage titled ‘Favorite Things’ with a list of four items: ‘raindrops on roses’, ‘whiskers on kittens’, ‘bright copper kettles’, and ‘warm woolen mittens’. Below the list is a ‘Submit’ button.”

Updating our application

Let’s say we want to update our application to a new version. We can do this by creating a new Docker image and updating our manifest to use the new image version.

Let’s update the index.html with a new title.

...
<h1>My Favorite Things</h1>
...

We then build and tag the frontend image version to 2:

docker build . -f frontend.Dockerfile -t frontend:2

Then, we update the frontend deployment in the manifest to use the new image version.

...
apiVersion: apps/v1
kind: Deployment
metadata:
  name: frontend
  namespace: demo
spec:
  replicas: 1
  selector:
    matchLabels:
      app: frontend
  template:
    metadata:
      labels:
        app: frontend
    spec:
      containers:
      - image: frontend:2
        name: frontend
        imagePullPolicy: Never
...

Finally, we can apply the changes to our cluster.

$ kubectl apply -f demo.yaml

Running kubectl get pods -n demo quickly enough will show that the frontend pod is being updated with a RollingUpdate rollout strategy.

$ kubectl get pods --namespace demo
NAME                        READY   STATUS              RESTARTS   AGE
backend-5df5d9554c-5zzp8    1/1     Running             0          2d22h
frontend-77b7dbdfff-vq79m   1/1     Running             0          26s
frontend-dc8bc69c4-pcc7g    0/1     ContainerCreating   0          1s
$ kubectl get pods --namespace demo
NAME                        READY   STATUS        RESTARTS   AGE
backend-5df5d9554c-5zzp8    1/1     Running       0          2d22h
frontend-dc8bc69c4-pcc7g    1/1     Running       0          2s
frontend-77b7dbdfff-vq79m   0/1     Terminating   0          27s

Once the update is complete, we can refresh our browser to see the new version of our application.

Scaling

Let’s say we want to scale our backend to 3 replicas. We can update our manifest to use 3 replicas and then apply the changes to our cluster.

...
apiVersion: apps/v1
kind: Deployment
metadata:
  name: backend
  namespace: demo
spec:
  replicas: 3
  selector:
    matchLabels:
      app: backend
  template:
    metadata:
      labels:
        app: backend
    spec:
      containers:
      - image: backend:1
        name: backend
        imagePullPolicy: Never
...

Then, apply the changes to our cluster.


$ kubectl apply -f demo.yaml

Now we can check the status of our pods:

$ kubectl get pods --namespace=demo -o wide --selector=app=backend
NAME                       READY   STATUS    RESTARTS   AGE   IP            NODE                   NOMINATED NODE   READINESS GATES
backend-5df5d9554c-8v9pq   1/1     Running   3          41m   10.42.0.251   lima-rancher-desktop   <none>           <none>
backend-5df5d9554c-ngctc   1/1     Running   3          41m   10.42.0.254   lima-rancher-desktop   <none>           <none>
backend-5df5d9554c-n5mvm   1/1     Running   3          41m   10.42.0.252   lima-rancher-desktop   <none>           <none>

In this post, we looked at how deployments can be used to manage pod replicas. We also saw how deployments and services can be used to expose our pods. Our demo application demonstrated how to deploy our own applications, and how to update them with zero downtime.

What Next?

Familiarize yourself with more workload types:

Other Concepts: