Feature/add phi4 convo by EricApgar · Pull Request #14 · EricApgar/large-language-model

EricApgar · 2026-03-08T21:39:46Z

Mainly got the Phi4 model up to date with a similar use as the GPT model.

Closing out some issues that were technically closed by older PR's.

…d response.

…ate and see if the mangled output still happened.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

EricApgar · 2026-03-15T02:53:11Z

Test suite added

Added a pytest test suite and GitHub Actions workflow as part of this PR.

New files:

tests/test_gpt_oss_20b.py — loads the real GPT model and calls ask(), asserting a non-empty string response
tests/test_phi4_multimodal_instruct.py — loads the real Phi4 model and calls ask() with a synthetic image, asserting a non-empty string response
.github/workflows/tests.yml — triggers on PRs and pushes to main

Key design decisions:

Tests call the real models rather than mocking, since LLM output consistency can't be tested — only that the model runs without errors
Model cache path is passed via LLM_MODEL_CACHE as an inline environment variable (e.g. LLM_MODEL_CACHE=/path/to/cache pytest) so it doesn't persist in the shell
Each model fixture uses scope='module' to load the model once per file, then tears down with del + torch.cuda.empty_cache() so both models can be tested sequentially without exceeding GPU memory
Tests skip automatically if LLM_MODEL_CACHE is not set, so CI doesn't hard-fail without model weights
The workflow is configured for a self-hosted runner (a machine with a GPU and local model weights). See Settings → Actions → Runners to register one, and add LLM_MODEL_CACHE as a repository secret.

To run locally:

uv sync --extra all
LLM_MODEL_CACHE=/home/yourname/Repos/model_cache pytest

EricApgar added 6 commits March 7, 2026 14:10

Updated gpt start index function to return type hint.

c951b1f

Fixed Phi4 model inputs to avoid bad token wrapping.

92b061b

Updated examples.

448c4b1

Removed bad input arg of temperature.

89bb7ea

Commented out several unused input params for ask() method.

3d9c5f0

Updated version in project file.

f8b6f84

EricApgar self-assigned this Mar 8, 2026

EricApgar added the enhancement New feature or request label Mar 8, 2026

EricApgar and others added 6 commits March 8, 2026 17:23

Added some fixes to GPT model for extracting clean output from mangle…

e1a210d

…d response.

Added a pipeline specific version of GPT-oss to compare against gener…

75fedab

…ate and see if the mangled output still happened.

Misc testing changes.

5cba18b

Updated pipeline version of gpt model.

cc0a4b3

Renamed gpt files.

9d7282b

Added pytest test suite and GitHub Actions workflow.

3487b70

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Updated readme.

b81a504

EricApgar merged commit 245c09b into main Mar 15, 2026
1 check failed

EricApgar deleted the feature/add-phi4-convo branch March 15, 2026 02:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/add phi4 convo#14

Feature/add phi4 convo#14
EricApgar merged 13 commits intomainfrom
feature/add-phi4-convo

EricApgar commented Mar 8, 2026

Uh oh!

EricApgar commented Mar 15, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

EricApgar commented Mar 8, 2026

Uh oh!

EricApgar commented Mar 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test suite added

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

EricApgar commented Mar 15, 2026 •

edited

Loading