Mission-critical audit fixes: model cache safety, task race, and stale tests by christopherkarani · Pull Request #40 · christopherkarani/Conduit

christopherkarani · 2026-03-18T19:52:06Z

Summary

This PR delivers the mission-critical audit fixes identified in the 2026-03-18 automation run.

Correctness and Safety Fixes

ModelCache partial-failure safety (P0)

Fixed ModelCache.clearAll() so it no longer wipes in-memory metadata when disk deletion partially fails.
New behavior: successfully deleted entries are removed, failed entries are retained and persisted, then an error is surfaced.
Prevents metadata/data divergence and orphaned-cache corruption.

ModelManager download task race (P1)

Fixed ModelManager.downloadTask(for:) race where callers could observe .taskNotStarted before backing work was wired.
The backing async task is now attached before returning the DownloadTask.

OpenAI streaming hardening (P1)

Hardened tool-call delta parsing to skip malformed deltas with empty id/name.
Prevents upstream malformed payloads from propagating into PartialToolCall precondition crashes.

ChatSession state snapshot hardening

stream(_:) now snapshots config under lock to keep captured state consistent.

API/Test Drift + Build Hygiene

Aligned DiffusionModelDownloaderTests compile guard with production feature flags.
Updated stale provider/model count tests to assert stable invariants after provider-catalog growth.
Updated ProviderType case coverage test to include new providers.
Removed deprecated String(cString:) usage in DeviceCapabilities.
Made GenerationSchema encoding error payload Sendable-safe.
Updated stale registry catalog doc comment.

Regression Tests Added

ModelCacheTests.testClearAllRetainsEntriesWhenDeletionFails
ModelManagerRegressionTests.testDownloadTaskImmediatelyWiresUnderlyingTask

Verification

swift test --filter ModelCacheTests --filter ModelManagerRegressionTests
swift test --filter ModelIdentifierTests
swift test --filter ProtocolCompilationTests
swift test
swift build

All passing in this worktree.

- align DiffusionModelDownloaderTests compile guard with production feature flags - replace deprecated String(cString:) CPU brand parsing path - make schema encoding error payload Sendable-safe - repair brittle model registry/provider count tests for expanded provider catalog - fix ModelCache.clearAll() to preserve metadata for entries whose deletion fails - wire ModelManager.downloadTask(for:) backing task before returning to prevent taskNotStarted races - harden OpenAI streaming parser by skipping malformed tool call deltas with empty id/name - capture ChatSession stream config under lock for consistent snapshot semantics - add regression tests for clearAll partial-failure retention and immediate task wiring

chatgpt-codex-connector · 2026-03-18T19:52:13Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

christopherkarani · 2026-04-08T01:19:41Z

Closing — these fixes were superseded by a164c7c ("Stabilize MLX, Foundation Models, and warning cleanup") which addressed the MLX gating, stale registry tests, and Sendable warnings comprehensively.

christopherkarani closed this Apr 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mission-critical audit fixes: model cache safety, task race, and stale tests#40

Mission-critical audit fixes: model cache safety, task race, and stale tests#40
christopherkarani wants to merge 1 commit intomainfrom
automation/check-frameworks-issues-20260318

christopherkarani commented Mar 18, 2026

Uh oh!

chatgpt-codex-connector bot commented Mar 18, 2026

Uh oh!

christopherkarani commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

christopherkarani commented Mar 18, 2026

Summary

Correctness and Safety Fixes

API/Test Drift + Build Hygiene

Regression Tests Added

Verification

Uh oh!

chatgpt-codex-connector bot commented Mar 18, 2026

Uh oh!

christopherkarani commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant