Deploying an application involves packaging your code and making it available on servers so that users can access it. In this post, I’ll walk through how to build a Go (Golang) application into a Docker image and deploy it to a Kubernetes cluster. We’ll cover creating an optimized Dockerfile, setting up Kubernetes Deployment and Service manifests, and understanding important configurations like GOMAXPROCS and Kubernetes resource limits.

What Is Deployment?

Deployment is the process of distributing an application so that it can be run on a target environment, such as servers or cloud platforms. It involves packaging the application, setting up the necessary infrastructure, and configuring it to run reliably and efficiently.

Building a Go Application into a Docker Image

What Is Docker and Dockerfile?

Docker is a platform that allows developers to package applications into containers—standardized units that contain everything the software needs to run, including libraries, dependencies, and configuration files.

A Dockerfile is a script containing a set of instructions to build a Docker image. It specifies the base image, application code, environment variables, and commands to run.

Creating the Dockerfile

Below is a Dockerfile for building and deploying a Go application:

# syntax=docker/dockerfile:1

# Stage 1: Build
FROM golang:alpine AS build

ENV TZ=Asia/Ulaanbaatar
ENV GO111MODULE=on
ENV GOPROXY=https://goproxy.io,direct

# Install necessary packages
RUN apk add --no-cache bash ca-certificates git gcc g++ libc-dev make tzdata

# Set the working directory
WORKDIR /go/src/app

# Copy go.mod and go.sum files and download dependencies
COPY go.mod go.sum ./
RUN go mod download

# Copy the source code
COPY . .

# Build the Go application
RUN CGO_ENABLED=0 GOOS=linux go build -a -o main main.go

# Stage 2: Deploy
FROM busybox:stable-musl

ENV TZ=Asia/Ulaanbaatar

# Copy necessary files from the build stage
COPY --from=build /go/src/app/main /home/
COPY --from=build /go/src/app/template /home/template

# Set the working directory
WORKDIR /home

# Expose the application port
EXPOSE 4000

# Set the maximum number of open file descriptors
RUN ulimit -n 10000

# Start the application
ENTRYPOINT ["/home/main"]

Explanation of the Dockerfile

Multi-Stage Build

The Dockerfile uses a multi-stage build to optimize the final image size:

  1. Build Stage (FROM golang:alpine AS build):
    • Uses the official Go Alpine image for building the application.
    • Sets up environment variables for time zone and Go modules.
    • Installs necessary packages like Git and compilers.
    • Copies go.mod and go.sum files and runs go mod download to cache dependencies.
    • Copies the entire source code and builds the application binary.
  2. Deployment Stage (FROM busybox:stable-musl):
    • Uses a minimal base image (BusyBox) to reduce the final image size.
    • Copies only the necessary files from the build stage: the binary and any required configuration or asset files.
    • Sets up the working directory and exposes the application port.

Why Use Multi-Stage Builds?

  • Smaller Image Size: By excluding development tools and source code, the final image is much smaller.
  • Security: Reduces the attack surface by including only what’s necessary to run the application.
  • Portability: The resulting image can run anywhere since it contains the statically compiled binary.

Caching Dependencies

  • Caching go.mod and go.sum: By copying go.mod and go.sum before the source code and running go mod download, Docker can cache the dependencies layer. This means that unless the module files change, Docker will use the cached layer, speeding up subsequent builds.

Setting Environment Variables

  • GO111MODULE=on: Ensures Go modules are used.
  • GOPROXY=https://goproxy.io,direct: Sets the Go proxy to speed up module downloads.

Setting ulimit

  • ulimit -n 10000: Increases the maximum number of open file descriptors. This is important for applications that handle many concurrent connections or file operations.

Exposing Ports and Entry Point

  • EXPOSE 4000: Informs Docker that the container listens on port 4000.
  • ENTRYPOINT ["/home/main"]: Specifies the command to run when the container starts.

Deploying to Kubernetes

To run the application on Kubernetes, we need to create Deployment and Service manifests and handle any necessary configurations like image pulling and resource limits.

Kubernetes Service Manifest

The Service exposes the application internally within the cluster.

apiVersion: v1
kind: Service
metadata:
  name: astring-backend-service
  namespace: astring
spec:
  selector:
    app: astring-backend
  ports:
    - protocol: TCP
      port: 4000
      targetPort: 4000

Kubernetes Deployment Manifest

The Deployment manages the application pods and ensures the desired number of replicas are running.

apiVersion: apps/v1
kind: Deployment
metadata:
  name: astring-backend
  namespace: astring
spec:
  replicas: 3
  selector:
    matchLabels:
      app: astring-backend
  template:
    metadata:
      labels:
        app: astring-backend
    spec:
      containers:
        - name: astring-backend-container
          image: registry.gitlab.com/<repo>:latest
          ports:
            - containerPort: 4000
          resources:
            requests:
              memory: "256Mi"
              cpu: "500m"
            limits:
              memory: "512Mi"
              cpu: "500m"
          env:
            - name: GOMAXPROCS
              value: "1"
      imagePullSecrets:
        - name: gitlab-registry-secret

Explanation of the Deployment Manifest

Image Pull Secret

  • imagePullSecrets: Used to authenticate with a private Docker registry. In this case, we’re pulling the image from GitLab’s registry.

Resource Requests and Limits

  • requests: The minimum amount of resources the container requires. Kubernetes uses this to schedule pods on nodes with sufficient resources.
  • limits: The maximum amount of resources the container can use. Enforced by the Kubernetes scheduler.

Setting GOMAXPROCS

  • GOMAXPROCS Environment Variable:
    • By default, Go applications use all available CPU cores on the host machine.
    • In a Kubernetes cluster, a node might have multiple CPU cores, but we may limit the CPU available to a pod.
    • Setting GOMAXPROCS to match the CPU limit ensures the Go runtime schedules goroutines appropriately, preventing CPU throttling.
    • In the example, GOMAXPROCS is set to "1" because the CPU limit is 500m (0.5 CPU). This should be adjusted based on the CPU limit set.

CPU Limits and GOMAXPROCS

  • Understanding Kubernetes CPU Limits:
    • cpu: "500m" means the container can use up to half of a CPU core.
    • If the Go application thinks it has access to more CPU cores, it may spawn more threads, leading to context switching and inefficiency.
  • Why Set GOMAXPROCS?:
    • Aligns the Go scheduler with the actual CPU resources available.
    • Reduces unnecessary goroutines and threads.
    • Improves application performance under CPU limits.

Adjusting GOMAXPROCS Dynamically

Alternatively, you can set GOMAXPROCS dynamically based on the CPU limits:

env:
  - name: GOMAXPROCS
    valueFrom:
      resourceFieldRef:
        resource: limits.cpu

However, note that resourceFieldRef may return fractional CPU values (e.g., 500m), which can cause issues because GOMAXPROCS expects an integer. You may need to handle this within your application or set GOMAXPROCS explicitly.

Creating a Secret for the Private Registry

Since the Docker image is hosted in a private registry (e.g., GitLab’s registry), you need to create a Kubernetes secret for authentication:

kubectl create secret docker-registry gitlab-registry-secret \
  --docker-server=registry.gitlab.com \
  --docker-username=<username> \
  --docker-password=<password> \
  --docker-email=<email> \
  -n astring

  • gitlab-registry-secret: The name of the secret.
  • -docker-server: The registry URL.
  • -docker-username, -docker-password, -docker-email: Your registry credentials.
  • n astring: The namespace where the secret will be created.

Applying the Manifests

After creating the manifests and the secret, apply them to your cluster:

kubectl apply -f deployment.yaml
kubectl apply -f service.yaml

Understanding Kubernetes Resource Limits and Go Runtime

How Kubernetes CPU Limits Work

  • CPU Requests:
    • The amount of CPU guaranteed to the container.
    • Kubernetes uses this value to schedule pods on nodes that have sufficient capacity.
  • CPU Limits:
    • The maximum amount of CPU the container is allowed to use.
    • Enforced by the Linux Kernel’s CFS (Completely Fair Scheduler) quota system.
    • If a container exceeds its CPU limit, it will be throttled.

The Problem with Default GOMAXPROCS

  • Default Behavior:
    • By default, Go sets GOMAXPROCS to the number of CPU cores available on the host machine.
    • If the node has 8 cores, GOMAXPROCS will be 8.
  • Issue in Kubernetes:
    • If your container is limited to less than the full CPU capacity of the node, Go will still think it has access to all cores.
    • This can lead to the Go runtime scheduling more threads and goroutines than allowed, causing excessive context switching and CPU throttling.

Setting GOMAXPROCS Correctly

  • Explicitly Set GOMAXPROCS:
    • Set GOMAXPROCS to match the CPU limit of the container.
    • For example, if cpu: "500m", set GOMAXPROCS to 1 (since 500m is half of a CPU core).
  • Benefits:
    • Ensures the Go scheduler operates within the CPU constraints.
    • Reduces overhead from unnecessary goroutines and threads.
    • Improves performance and resource utilization.

Conclusion

Deploying a Go application to Kubernetes involves several steps:

  • Building an Optimized Docker Image:
    • Use multi-stage builds to reduce image size.
    • Cache dependencies to speed up builds.
    • Minimize the runtime environment for security and efficiency.
  • Configuring Kubernetes Manifests:
    • Define Deployment and Service resources.
    • Handle private image registries with image pull secrets.
    • Set resource requests and limits appropriately.
  • Understanding and Configuring GOMAXPROCS:
    • Align the Go runtime with Kubernetes CPU limits.
    • Improve application performance under resource constraints.

By paying attention to these details, you can ensure your Go applications run efficiently and reliably in a Kubernetes environment.