Skip to content

Dockerizing#1

Draft
stepdi wants to merge 94 commits into
mainfrom
docker
Draft

Dockerizing#1
stepdi wants to merge 94 commits into
mainfrom
docker

Conversation

@stepdi

@stepdi stepdi commented May 14, 2025

Copy link
Copy Markdown
Collaborator

No description provided.

stepdi and others added 30 commits May 13, 2025 22:22
…cking

feat: add cost tracking (clean PR)
…e-mode

Add offline mode support (cherry-pick)
Robert Leonard and others added 23 commits June 1, 2025 01:11
…-stepname-model-selection

Fix: Ensure summarization uses correct model by aligning step name
…-markitdown

Improve Ingestion for Web Documents
…core-filtering

Fix citation score filtering
…n permissions (huggingface#101)

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>
…g the full document text into each row (useful when your docs are large)
Changed model to gpt-4o (from gpt-4.1).
Comment thread README.docker.md

The container requires the following environment variables:

- `INPUT_S3_BUCKET`: S3 bucket name for input data

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think we are missing some of the variables here and below in the example docker run command

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants