Bootstrapping MLflow with Docker

This project sets up a containerized MLflow tracking server with Docker, using Postgres for metadata and S3 for artifacts.

Motivation

A containerized MLflow stack provides a consistent experiment tracking and model registry service that can be reused across future projects. Postgres and S3 provide persistent storage, and Docker Compose makes the environment easy to spin up or extend.

Implementation

Key pieces:

MLflow server: logs metrics, parameters, and artifacts.
Postgres: backend database for MLflow metadata.
S3: object storage for artifacts.

All components are defined in compose.yml. The environment can be started with a single command:

docker compose up -d

The Compose spins up the MLflow server and Postgres as separate containers. Postgres data is mapped to a named volume for persistence, so experiments aren’t lost when containers stop.

  
postgres:
  image: postgres:15
  environment:
    POSTGRES_USER: mlflow
    POSTGRES_PASSWORD: mlflow_pass
    POSTGRES_DB: mlflow_db
  volumes:
    - pgdata:/var/lib/postgresql/data
  networks: [mlflow]

postgres:
  image: postgres:15
  environment:
    POSTGRES_USER: mlflow
    POSTGRES_PASSWORD: mlflow_pass
    POSTGRES_DB: mlflow_db
  volumes:
    - pgdata:/var/lib/postgresql/data
  networks: [mlflow]

The MLflow tracking server runs on a user-defined Docker network, allowing future containers to connect directly by service name (mlflow:5000), without relying on the host or container IP.

mlflow:
  build: .
  ports:
    - "5000:5000"
  env_file:
    - .env
  depends_on:
    - postgres
  networks: [mlflow]
  command: >
    mlflow server
    --host 0.0.0.0
    --port 5000
    --default-artifact-root "${S3_ARTIFACT_ROOT}"
    --backend-store-uri postgresql://mlflow:mlflow_pass@postgres:5432/mlflow_db

The base ghcr.io/mlflow/mlflow image for the mlflow service does not include the psycopg2 and boto3 packages required for Postgres and AWS. To enable connectivity, the image must be extended (including optional cleanup to reduce the image size):

FROM ghcr.io/mlflow/mlflow

RUN apt-get update && \
apt-get install -y --no-install-recommends pkg-config && \
apt-get clean && rm -rf /var/lib/apt/lists/* && \
pip install --no-cache --upgrade pip && \
pip install --no-cache psycopg2-binary boto3

CMD ["bash"]

A simple Python script verifies that logging works:

import mlflow

mlflow.set_tracking_uri("http://localhost:5000")

with mlflow.start_run():
    mlflow.log_param("learning_rate", 0.01)
    mlflow.log_metric("accuracy", 0.92)