Add docker flow by yangm2 · Pull Request #262 · codeforpdx/tenantfirstaid

yangm2 · 2026-02-11T01:31:47Z

What type of PR is this? (check all applicable)

Description

This PR adds docker and apple/container containers to the development flow. This also opens up the way to publish docker containers on ghcr.io and deploying those containers to staging and production.

Note: Building the backend-dev image took >15 minutes(!) on my Macbook Pro w/ M2 CPU. And the resulting image (TYPE=dev) is ~350MB. We may want to look into streamlining the dev dependencies.

Disclosure: The code was initially generated with Claude coding agent 🤖.

Related Tickets & Documents

Related Issue Audit the project for Containerization #173, Misc small improvements to backend evaluation flow #272 (minimizes prd deps)
Closes improve local testing and deployments with docker containers #253

QA Instructions, Screenshots, Recordings

With the frontend container, I was able to run and pass the regression tests. Without the container, the frontend tests fail on my Macbook Pro (presumably due to a misconfigured npm development environment ... i.e. old version of npm?).

with apple/containers
- frontend
  - run lint
  - run tests
- backend
  - run make check
  - ~~Known Issue: mounting the google default application credential json results in a directory instead of the json file. This needs to be resolved for local integration testing.~~ FIXED
- local integration testing
with docker/compose
- frontend
  - run lint
  - run tests
- backend
  - run make check
- local integration testing

Added/updated tests?

Yes
No, and this is why: development environment changes
I need help with writing tests

Documentation

If this PR changes the system architecture, Architecture.md has been updated

[optional] Are there any post deployment tasks we need to perform?

backend/Dockerfile

github-actions · 2026-02-15T03:39:28Z

backend/Dockerfile

+CMD ["uv", "run", "gunicorn", \
+     "--timeout", "300", \
+     "--capture-output", \
+     "--access-logfile", "-", \
+     "--error-logfile", "-", \
+     "--log-level", "info", \
+     "-w", "4", \
+     "-b", "0.0.0.0:5000", \
+     "tenantfirstaid.app:app"]


Production Readiness: Missing health check endpoint

The Gunicorn configuration looks solid, but consider adding health check configuration and signal handling:

# Add healthcheck HEALTHCHECK --interval=30s --timeout=5s --start-period=10s --retries=3 \ CMD curl -f http://localhost:5000/health || exit 1 # Improve graceful shutdown CMD ["uv", "run", "gunicorn", \ "--timeout", "300", \ "--graceful-timeout", "30", \ "--keep-alive", "5", \ "--capture-output", \ "--access-logfile", "-", \ "--error-logfile", "-", \ "--log-level", "info", \ "-w", "4", \ "-b", "0.0.0.0:5000", \ "tenantfirstaid.app:app"]

This requires adding a /health endpoint to the Flask app.

backend/Dockerfile

github-actions · 2026-02-15T03:39:36Z

frontend/Dockerfile

+        try_files $uri /index.html;
+    }
+
+    # Security headers


Security: Add CSP and additional hardening headers

Good security headers present, but consider adding Content Security Policy and additional hardening:

# Content Security Policy add_header Content-Security-Policy "default-src 'self'; script-src 'self'; style-src 'self' 'unsafe-inline'; img-src 'self' data:; font-src 'self'; connect-src 'self' http://localhost:5000;" always; # Additional security headers add_header Referrer-Policy "strict-origin-when-cross-origin" always; add_header Permissions-Policy "geolocation=(), microphone=(), camera=()" always;

Note: You may need to adjust CSP based on actual frontend requirements (external fonts, CDN assets, etc.).

github-actions · 2026-02-15T03:39:39Z

frontend/Dockerfile

+}
+EOF
+
+EXPOSE 80


Production Readiness: Add healthcheck and proper logging

Consider adding nginx healthcheck and improving logging configuration:

HEALTHCHECK --interval=30s --timeout=3s \ CMD wget --quiet --tries=1 --spider http://localhost/health || exit 1 # Configure nginx for better logging RUN cat > /etc/nginx/nginx.conf <<'EOF' user nginx; worker_processes auto; error_log /var/log/nginx/error.log warn; pid /var/run/nginx.pid; events { worker_connections 1024; } http { include /etc/nginx/mime.types; default_type application/octet-stream; log_format main '$remote_addr - $remote_user [$time_local] "$request" ' '$status $body_bytes_sent "$http_referer" ' '"$http_user_agent" "$http_x_forwarded_for"'; access_log /var/log/nginx/access.log main; sendfile on; keepalive_timeout 65; gzip on; include /etc/nginx/conf.d/*.conf; } EOF

docker-compose.yml

github-actions · 2026-02-15T03:39:49Z

docker-compose.yml

+      - "5000:5000"
+    volumes:
+      # Mount source for hot reload
+      - ./backend/tenantfirstaid:/app/tenantfirstaid:ro


Development Experience: Read-only volumes limit functionality

The :ro (read-only) flag on source volumes prevents some development workflows:

Can't run formatters/linters that modify files in place

Can't generate coverage reports

Can't create new test files from within container

Recommendation: Remove :ro for development volumes or make it configurable:

volumes: - ./backend/tenantfirstaid:/app/tenantfirstaid # writable for dev tools - ./backend/tests:/app/tests - ./backend/scripts:/app/scripts

For security-sensitive mounts (credentials), keep :ro.

github-actions · 2026-02-15T03:39:53Z

docker-compose.yml

+      - VERTEX_AI_DATASTORE=${VERTEX_AI_DATASTORE}
+      - MODEL_NAME=${MODEL_NAME:-gemini-2.5-pro}
+      - LOG_LEVEL=${LOG_LEVEL:-DEBUG}
+      - LANGSMITH_API_KEY=${LANGSMITH_API_KEY}


Configuration: Verify LangSmith defaults

LangSmith tracing is enabled by default in the dev environment. This could:

Generate significant API usage/costs if developers aren't aware

Send data to external service during local dev

Cause issues if LANGSMITH_API_KEY is not set

Recommendation:

- LANGSMITH_TRACING=${LANGSMITH_TRACING:-false} # Opt-in instead of opt-out - LANGCHAIN_TRACING_V2=${LANGCHAIN_TRACING_V2:-false}

Update .env.docker.example to document when developers should enable this.

github-actions · 2026-02-15T03:39:57Z

docker-compose.prod.yml

+    ports:
+      - "127.0.0.1:5000:5000"


Production Configuration: Bind to localhost only

Good security practice binding to 127.0.0.1 (prevents direct external access), but this assumes:

A reverse proxy (nginx, traefik) will handle external traffic

The proxy runs on the same host or has network access

Recommendation: Document this requirement in Architecture.md and consider adding the reverse proxy to the compose file:

services: nginx-proxy: image: nginx:alpine ports: - "80:80" - "443:443" volumes: - ./nginx.conf:/etc/nginx/nginx.conf:ro depends_on: - frontend - backend

github-actions · 2026-02-15T03:40:01Z

docker-compose.prod.yml

+    environment:
+      - ENV=prod
+      - FLASK_ENV=production
+      - FLASK_SECRET_KEY=${FLASK_SECRET_KEY}


Security: FLASK_SECRET_KEY in environment

Passing FLASK_SECRET_KEY via environment variable is less secure than using Docker secrets for production.

Recommendation:

secrets: flask_secret: external: true services: backend: secrets: - flask_secret environment: - FLASK_SECRET_KEY_FILE=/run/secrets/flask_secret

Then update Flask app to read from file:

secret_key_file = os.getenv('FLASK_SECRET_KEY_FILE') if secret_key_file: with open(secret_key_file) as f: app.config['SECRET_KEY'] = f.read().strip()

github-actions · 2026-02-15T03:40:04Z

.env.docker.example

+LOG_LEVEL=DEBUG
+
+# Flask configuration
+FLASK_SECRET_KEY=dev-secret-key-unsafe


Security: Unsafe default for production

The example includes FLASK_SECRET_KEY=dev-secret-key-unsafe which could accidentally be used in production.

Recommendation:

# Flask configuration # WARNING: Generate a secure random key for production! # python -c 'import secrets; print(secrets.token_hex(32))' FLASK_SECRET_KEY=dev-secret-key-CHANGE-THIS-IN-PRODUCTION

Consider adding a startup check in the Flask app that errors if the secret key matches known insecure defaults.

Include reasoning in LangSmith experiment data upload Move 'facts' from input to reference_output in LangSmith dataset creation so that the LLM-as-a-Judge has that for comparison

Implements issue codeforpdx#253 with complete Docker support: - Multi-stage Dockerfiles for backend and frontend (dev/deploy types) - docker-compose.yml for local development with hot reload - docker-compose.prod.yml for production deployment - Documentation updates in README, Architecture, and CLAUDE files Key features: - Dev images include all tooling, tests, and hot reload - Deploy images are minimal with production-only dependencies - Support for Mac, Linux, and Windows development environments Note: GitHub Actions workflow file (.github/workflows/docker-build.yml) needs to be added separately by a maintainer with workflow permissions. Co-authored-by: yangm2 <yangm2@users.noreply.github.com>

only install `make` in dev use uv's Docker best practices i.e. caching

backend/scripts/create_langsmith_dataset.py

backend/Dockerfile

github-actions · 2026-02-23T02:43:40Z

docker-compose.yml

+      - MODEL_NAME=${MODEL_NAME:-gemini-2.5-pro}
+      - LOG_LEVEL=${LOG_LEVEL:-DEBUG}
+      - LANGSMITH_API_KEY=${LANGSMITH_API_KEY}
+      - LANGSMITH_TRACING=${LANGSMITH_TRACING:-true}


LangSmith tracing defaults to true, meaning every developer who runs docker compose up without a .env file will send traces (and incur API costs) unless they explicitly opt out. Consider defaulting to false:

Suggested change

- LANGSMITH_TRACING=${LANGSMITH_TRACING:-true}

- LANGSMITH_TRACING=${LANGSMITH_TRACING:-false}

github-actions · 2026-02-23T02:43:43Z

docker-compose.yml

+      - LOG_LEVEL=${LOG_LEVEL:-DEBUG}
+      - LANGSMITH_API_KEY=${LANGSMITH_API_KEY}
+      - LANGSMITH_TRACING=${LANGSMITH_TRACING:-true}
+      - LANGCHAIN_TRACING_V2=${LANGCHAIN_TRACING_V2:-true}


Same opt-out concern as LANGSMITH_TRACING above.

Suggested change

- LANGCHAIN_TRACING_V2=${LANGCHAIN_TRACING_V2:-true}

- LANGCHAIN_TRACING_V2=${LANGCHAIN_TRACING_V2:-false}

github-actions · 2026-02-23T03:37:30Z

docker-compose.yml

+      - ./backend/tests:/app/tests:ro
+      - ./backend/scripts:/app/scripts:ro
+      # Mount Google credentials from host
+      - ${GOOGLE_APPLICATION_CREDENTIALS}:/app/secrets/google-creds.json:ro


The variable here is ${GOOGLE_APPLICATION_CREDENTIALS}, but .env.docker.example defines GOOGLE_APPLICATION_CREDENTIALS_HOST. These names don't match, so a developer who copies the example file to .env will have an unset variable — Docker will then create a directory at the target path instead of mounting the file (the root cause of the "directory instead of file" issue mentioned in the PR description).

Suggested change

- ${GOOGLE_APPLICATION_CREDENTIALS}:/app/secrets/google-creds.json:ro

- ${GOOGLE_APPLICATION_CREDENTIALS_HOST}:/app/secrets/google-creds.json:ro

github-actions · 2026-02-23T03:37:37Z

docker-compose.yml

+      - "3000:5000"
+    volumes:
+      # Mount source for hot reload
+      - ./backend/tenantfirstaid:/app/tenantfirstaid:ro


Read-only mounts here prevent make fmt (ruff format) and make lint --fix from modifying files in-place. But CLAUDE.md documents exactly these commands as the Docker workflow:

docker compose exec backend make fmt docker compose exec backend make lint

Running those will silently produce no changes (or fail with a permission error), which is confusing. Either drop :ro on the source mounts, or note in CLAUDE.md that formatters must be run on the host.

Suggested change

- ./backend/tenantfirstaid:/app/tenantfirstaid:ro

- ./backend/tenantfirstaid:/app/tenantfirstaid

- ./backend/tests:/app/tests

- ./backend/scripts:/app/scripts

github-actions · 2026-02-23T03:37:42Z

docker-compose.yml

+      - MODEL_NAME=${MODEL_NAME:-gemini-2.5-pro}
+      - LOG_LEVEL=${LOG_LEVEL:-DEBUG}
+      - LANGSMITH_API_KEY=${LANGSMITH_API_KEY}
+      - LANGSMITH_TRACING=${LANGSMITH_TRACING:-true}


LangSmith tracing defaults to opt-in (true), meaning any developer who runs docker compose up without a .env will send traces and incur API costs. Consistent with the second reviewer's note — prefer opt-out:

Suggested change

- LANGSMITH_TRACING=${LANGSMITH_TRACING:-true}

- LANGSMITH_TRACING=${LANGSMITH_TRACING:-false}

github-actions · 2026-02-23T03:37:44Z

docker-compose.yml

+      - LOG_LEVEL=${LOG_LEVEL:-DEBUG}
+      - LANGSMITH_API_KEY=${LANGSMITH_API_KEY}
+      - LANGSMITH_TRACING=${LANGSMITH_TRACING:-true}
+      - LANGCHAIN_TRACING_V2=${LANGCHAIN_TRACING_V2:-true}


Suggested change

- LANGCHAIN_TRACING_V2=${LANGCHAIN_TRACING_V2:-true}

- LANGCHAIN_TRACING_V2=${LANGCHAIN_TRACING_V2:-false}

github-actions · 2026-02-23T03:37:50Z

backend/Dockerfile

+
+# Sync the project
+RUN --mount=type=cache,target=/root/.cache/uv \
+    uv sync --locked


The first uv sync on line 30 uses --frozen, but this second sync uses --locked. Both prevent lockfile writes, but --frozen is the stricter form (skips resolution entirely), matches the deploy stage, and avoids any possibility of touching the lockfile. Use --frozen for consistency:

Suggested change

uv sync --locked

uv sync --frozen

github-actions · 2026-02-23T03:37:55Z

docker-compose.prod.yml

@@ -0,0 +1,46 @@
+services:
+  backend:
+    image: ghcr.io/codeforpdx/tenantfirstaid/backend:latest


Using :latest in production means two docker compose pull calls a week apart may deploy different code. Pin to a specific tag (e.g., a git SHA or semantic version) so deployments are reproducible and rollbacks are straightforward.

github-actions · 2026-02-23T03:38:00Z

.env.docker.example

+
+# Path to Google credentials on your HOST machine
+# Default location after 'gcloud auth application-default login'
+GOOGLE_APPLICATION_CREDENTIALS_HOST=/home/<USERNAME>/.config/gcloud/application_default_credentials.json


The variable name here is GOOGLE_APPLICATION_CREDENTIALS_HOST, but docker-compose.yml reads ${GOOGLE_APPLICATION_CREDENTIALS} on line 18. One of them needs to change — see the inline comment on the compose file for the suggested fix there.

github-actions · 2026-02-23T03:38:05Z

.env.docker.example

+LOG_LEVEL=DEBUG
+
+# Flask configuration
+FLASK_SECRET_KEY=dev-secret-key-unsafe


The "unsafe" suffix is easy to miss. Add an explicit generation command and make the warning harder to skip:

Suggested change

FLASK_SECRET_KEY=dev-secret-key-unsafe

# WARNING: Replace with a strong random value before any non-local use.

# Generate one with: python -c 'import secrets; print(secrets.token_hex(32))'

FLASK_SECRET_KEY=dev-secret-key-CHANGE-THIS

github-actions · 2026-02-23T03:38:09Z

.env.docker.example

+# LangSmith (optional)
+LANGSMITH_API_KEY=<YOUR_LANGSMITH_KEY>
+LANGSMITH_TRACING=true
+LANGCHAIN_TRACING_V2=true


If LANGSMITH_TRACING defaults to false in the compose file (see suggestion there), the values here should match to avoid confusing developers who copy the example as-is.

Suggested change

LANGCHAIN_TRACING_V2=true

LANGSMITH_TRACING=false

LANGCHAIN_TRACING_V2=false

github-actions · 2026-02-23T03:43:52Z

Claude finished @yangm2's task in 2m 30s —— View job

PR #262 Review: Add Docker Flow

Good overall structure — the multi-stage Dockerfile pattern is solid, and the recent commits addressed several of the earlier review concerns. Here's what's confirmed fixed vs. what still needs attention.

Confirmed Fixed (recent commits)

--frozen flag: all three uv sync calls now use --frozen consistently
curl | sh antipattern: replaced with the official ghcr.io/astral-sh/uv image
Google credentials variable name: GOOGLE_APPLICATION_CREDENTIALS now consistent between .env.docker.example and docker-compose.yml

Outstanding Issues

1. LangSmith tracing defaults to true — docker-compose.yml lines 30–31

Any developer who runs docker compose up without a .env will send traces and incur API costs with no warning. This should opt-out by default:

- LANGSMITH_TRACING=${LANGSMITH_TRACING:-false}
- LANGCHAIN_TRACING_V2=${LANGCHAIN_TRACING_V2:-false}

Fix this →

2. Read-only volume mounts block make fmt — docker-compose.yml lines 14–16

CLAUDE.md documents exactly these commands for Docker dev workflow:

docker compose exec backend make fmt
docker compose exec backend make lint

With :ro on the source mounts, both will silently produce no changes (ruff can't write) or fail with a permission error. The :ro flag makes sense for credential mounts but not for source code in a dev environment.

- ./backend/tenantfirstaid:/app/tenantfirstaid
- ./backend/tests:/app/tests
- ./backend/scripts:/app/scripts

Fix this →

3. Production uses :latest — docker-compose.prod.yml lines 3, 30

image: ghcr.io/codeforpdx/tenantfirstaid/backend:latest
image: ghcr.io/codeforpdx/tenantfirstaid/frontend:latest

Two docker compose pull calls a week apart can deploy different code silently. A git SHA or semantic tag makes rollbacks straightforward and deployments auditable. Architecture.md documents that main-<sha> tags are built — those should be used here instead.

4. .env.docker.example LangSmith defaults should match compose file

If the compose file defaults to false (per item 1 above), the example file should match to avoid confusing developers who copy it as-is:

LANGSMITH_TRACING=false
LANGCHAIN_TRACING_V2=false

5. FLASK_SECRET_KEY warning is easy to miss — .env.docker.example line 16

dev-secret-key-unsafe is subtle. A clearer signal with a generation command would help:

# WARNING: Replace with a strong random value before any non-local use.
# Generate one with: python -c 'import secrets; print(secrets.token_hex(32))'
FLASK_SECRET_KEY=dev-secret-key-CHANGE-THIS

Minor / Design Notes

Gunicorn worker count is hardcoded — backend/Dockerfile line 81 uses -w 4. The conventional formula is 2 * CPU_count + 1. With 4 workers on a 2-CPU server this is fine, but a note or WEB_CONCURRENCY env var makes it easier to tune without rebuilding the image.

nginx:alpine is a floating tag — frontend/Dockerfile line 36. On every build this could pull a different patch of nginx. Pinning to a specific version (e.g. nginx:1.27-alpine) or a digest makes builds reproducible. Less critical in dev, but worth noting.

No healthchecks in either Dockerfile — Containers will appear "healthy" to Docker even if the app is unresponsive. A simple HEALTHCHECK instruction (curl/wget for backend, wget for frontend) improves visibility in production and in compose depends_on scenarios.

Frontend nginx only serves on port 80 — In production, TLS is presumably terminated upstream (Nginx reverse proxy is documented in Architecture.md), so this is fine. Worth a comment in docker-compose.prod.yml clarifying the expected network topology.

Documentation

The Architecture.md and CLAUDE.md additions are well-written and accurate. The containerization section is a good addition. One small note: CLAUDE.md documents docker compose exec backend make fmt but the current :ro mounts make this non-functional — that should be resolved along with issue #2 above.

Summary: The core infrastructure is solid. The blocking issues before merge are the LangSmith opt-out default (#1), and the read-only source mounts that contradict the documented Docker workflow (#2). The latest production tag (#3) is worth fixing before this is used to deploy.

github-actions · 2026-02-23T03:46:20Z

docker-compose.yml

+      - MODEL_NAME=${MODEL_NAME:-gemini-2.5-pro}
+      - LOG_LEVEL=${LOG_LEVEL:-DEBUG}
+      - LANGSMITH_API_KEY=${LANGSMITH_API_KEY}
+      - LANGSMITH_TRACING=${LANGSMITH_TRACING:-true}


LangSmith tracing defaults to true, so every developer who runs docker compose up without a .env will send traces and incur API costs. Prefer opt-in:

Suggested change

- LANGSMITH_TRACING=${LANGSMITH_TRACING:-true}

- LANGSMITH_TRACING=${LANGSMITH_TRACING:-false}

github-actions · 2026-02-23T03:46:23Z

docker-compose.yml

+      - LOG_LEVEL=${LOG_LEVEL:-DEBUG}
+      - LANGSMITH_API_KEY=${LANGSMITH_API_KEY}
+      - LANGSMITH_TRACING=${LANGSMITH_TRACING:-true}
+      - LANGCHAIN_TRACING_V2=${LANGCHAIN_TRACING_V2:-true}


Same as above — opt-out:

Suggested change

- LANGCHAIN_TRACING_V2=${LANGCHAIN_TRACING_V2:-true}

- LANGCHAIN_TRACING_V2=${LANGCHAIN_TRACING_V2:-false}

github-actions · 2026-02-23T03:46:28Z

docker-compose.yml

+      - ./backend/tenantfirstaid:/app/tenantfirstaid:ro
+      - ./backend/tests:/app/tests:ro
+      - ./backend/scripts:/app/scripts:ro


These :ro mounts prevent make fmt (ruff format) and make lint --fix from writing back to the source files. But CLAUDE.md documents docker compose exec backend make fmt as the Docker dev workflow — that command will silently do nothing with read-only mounts.

Drop :ro on the source directories; keep it only on the credentials mount below.

Suggested change

- ./backend/tenantfirstaid:/app/tenantfirstaid:ro

- ./backend/tests:/app/tests:ro

- ./backend/scripts:/app/scripts:ro

- ./backend/tenantfirstaid:/app/tenantfirstaid

- ./backend/tests:/app/tests

- ./backend/scripts:/app/scripts

yangm2 self-assigned this Feb 11, 2026

yangm2 added enhancement New feature or request Not ready infrastructure Pull requests related to infrastructure and underlying workflows labels Feb 11, 2026

This comment was marked as outdated.

Sign in to view

codeforpdx deleted a comment from github-actions bot Feb 15, 2026