feat: partial turn preservation and cooperative stream cancellation by cpsievert · Pull Request #279 · posit-dev/chatlas

cpsievert · 2026-04-02T17:47:50Z

Summary

PR 2 in the streaming improvements series (after #276). Python port of ellmer's tidyverse/ellmer#951. Adds:

Partial turn preservation: When a stream is interrupted (closed early, cancelled), the accumulated content so far is saved as a partial AssistantTurn with partial_reason set, so conversation state isn't lost
StreamController: A cooperative cancellation mechanism for stream() and stream_async() — callers can request the stream stop cleanly via controller.cancel(), which triggers the partial turn preservation path
Display improvements: Partial turns show [interrupted] in the Chat repr; partial turns are excluded from token accounting and cost calculations

Changes

chatlas/_turn.py: Added partial_reason field to AssistantTurn and merge_content_text helper
chatlas/_stream_controller.py: New StreamController class for cooperative cancellation
chatlas/_chat.py: stream()/stream_async() accept optional controller parameter; _submit_turns/_submit_turns_async wrap streaming in try/finally to preserve partial turns on interruption
chatlas/__init__.py: Export StreamController

Test plan

make check-types passes (0 errors)
VCR tests for partial turn preservation (sync + async)
VCR tests for StreamController cancellation (sync + async)
Snapshot test for [interrupted] display in Chat repr
Existing tests still pass (495 passed, 11 skipped)

Add partial_reason field and is_partial property to AssistantTurn for marking incomplete turns on stream interruption. Add merge_content_text() helper to combine adjacent ContentText/ContentThinking fragments. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Restructure _submit_turns and _submit_turns_async to eagerly append a partial AssistantTurn to self._turns before streaming begins. On each chunk, content is appended to the partial turn in-place. On normal completion, the partial turn is replaced with the full turn. On interruption (GeneratorExit, KeyboardInterrupt, CancelledError), the finally block merges adjacent content fragments via merge_content_text(). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Partial turns (from interrupted streams) have no token or cost data. Filter them out in get_cost() and get_tokens() to avoid errors. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Partial assistant turns now show their partial_reason (e.g. [interrupted]) instead of token counts. Token/cost totals in the Chat header exclude partial turns. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Cast Content to ContentUnion for list append compatibility and merge_content_text results. Use isinstance check in finally block instead of accessing is_partial on Turn base type. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

StreamController provides a simple cancel/reset/cancelled/reason API for cooperatively cancelling streaming responses. Exported from chatlas. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Thread StreamController through stream → _chat_impl → _submit_turns (and async equivalents). When controller.cancelled is True, the streaming loop breaks and the partial turn's reason is set from the controller. Also skips tool invocation when cancelled. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Both chat() and chat_async() now create an internal StreamController and thread it through _chat_impl. This ensures the try/finally partial turn machinery is always active, even for non-streaming chat calls. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Capture all content types (not just text) in partial turns so ContentToolRequest etc. aren't silently dropped on interruption - Default-create StreamController when none provided, eliminating all `if controller is not None` guards - Add comments explaining for/else + GeneratorExit interaction - Add thread-safety comment on StreamController.cancel() ordering - Return list[ContentUnion] from merge_content_text to avoid casts Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Introduces TurnAccumulator in chatlas/_turn_accumulator.py mirroring ellmer's R6 class, along with merge_content_text helper and full test coverage in tests/test_turn_accumulator.py. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Replace assert with RuntimeError for precondition checks - Narrow update_turn param to ContentUnion (removes cast) - Use model_construct for ContentThinking merge (consistency) - Remove unused ContentToolRequest import from tests Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…ator Delegates partial-turn lifecycle management to TurnAccumulator, replacing the inline for/else + partial turn index tracking with clean begin/update/ complete/finalize calls. Also closes the HTTP response in finally, drops the local merge_content_text (now in _turn_accumulator.py), and updates the test import accordingly. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…tion Four copies of the validate-type/compute-tokens/compute-cost/log pattern (sync/async × streaming/non-streaming) consolidated into one function. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…turn filtering - Add _ensure_ready() to StreamController that warns and auto-resets if already cancelled (aligns with ellmer's as_controller() behavior) - Add _as_controller() helper, replacing redundant StreamController() creation at 6 call sites with one consistent pattern - Widen TurnAccumulator.update_turn to accept Content, removing 2 cast sites and the ContentUnion import from _chat.py - Fix get_tokens() to filter partial turns at any position in history, not just trailing (aligns with ellmer's discard approach) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Copilot

Pull request overview

Copilot reviewed 15 out of 15 changed files in this pull request and generated 1 comment.

Comments suppressed due to low confidence (2)

chatlas/_chat.py:1271

stream() wraps _chat_impl() but doesn’t explicitly close the underlying generator if the caller closes the wrapper early. Because the partial-turn preservation relies on generator finalization (finally in _submit_turns), it’s safer to ensure generator.close() is called in a finally block inside wrapper() so the partial turn (and provider response) are finalized deterministically (especially on non-refcounted Python implementations).

            controller=controller,
        )

        def wrapper() -> Generator[
            str | ContentThinking | ContentToolRequest | ContentToolResult, None, None

chatlas/_chat.py:1386

Similar to the sync path: stream_async()’s wrapper doesn’t explicitly ensure the underlying async generator is closed when the wrapper is closed early. Adding a try/finally that awaits the inner generator’s aclose() (when available) would make partial-turn preservation and transport cleanup deterministic.

        controller = _as_controller(controller)

        async def wrapper() -> AsyncGenerator[
            str | ContentThinking | ContentToolRequest | ContentToolResult, None
        ]:
            with display:
                async for chunk in self._chat_impl_async(

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Resolve conflicts integrating ContentThinkingDelta streaming changes from main with stream cancellation/partial turn preservation. Also address PR feedback: - Move cancellation check to top of loop iteration for responsiveness - Move warnings import to top-level in _stream_controller.py - Preserve extra metadata when merging ContentThinking fragments - Update VCR cassettes for new default model (gpt-5.4)

Extract thinking-delta phase tracking and content emit/yield logic from the duplicated sync/async streaming loops into TurnAccumulator.process_content() and flush_thinking(). Also add cancellation section to the streaming docs.

cpsievert and others added 11 commits April 2, 2026 12:45

feat: exclude partial turns from token accounting and cost

d9ceae9

Partial turns (from interrupted streams) have no token or cost data. Filter them out in get_cost() and get_tokens() to avoid errors. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: display partial turns with [interrupted] in Chat repr

110c082

Partial assistant turns now show their partial_reason (e.g. [interrupted]) instead of token counts. Token/cost totals in the Chat header exclude partial turns. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: resolve pyright errors in try/finally blocks

b93e7c0

Cast Content to ContentUnion for list append compatibility and merge_content_text results. Use isinstance check in finally block instead of accessing is_partial on Turn base type. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: add StreamController class for cooperative stream cancellation

cd51551

StreamController provides a simple cancel/reset/cancelled/reason API for cooperatively cancelling streaming responses. Exported from chatlas. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

style: auto-format with ruff

6290ce4

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

docs: document controller parameter on stream() and stream_async()

b37be74

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

cpsievert mentioned this pull request Apr 2, 2026

refactor: add stream_content() abstract method to Provider #276

Merged

3 tasks

docs: add changelog entries for stream cancellation and partial turns

736c6bc

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

cpsievert requested a review from Copilot April 2, 2026 19:01

Copilot started reviewing on behalf of cpsievert April 2, 2026 19:02 View session

This comment was marked as outdated.

Sign in to view

cpsievert and others added 7 commits April 2, 2026 14:24

style: auto-format with ruff

383eae1

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

refactor: extract resolve_assistant_turn to deduplicate turn finaliza…

a2e145a

…tion Four copies of the validate-type/compute-tokens/compute-cost/log pattern (sync/async × streaming/non-streaming) consolidated into one function. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge branch 'main' into feat/stream-cancellation

1fb7b2c

cpsievert requested a review from Copilot April 2, 2026 20:45

Copilot AI reviewed Apr 2, 2026

View reviewed changes

Comment thread chatlas/_turn_accumulator.py

cpsievert commented Apr 2, 2026

View reviewed changes

Comment thread chatlas/_chat.py

cpsievert commented Apr 2, 2026

View reviewed changes

Comment thread chatlas/_stream_controller.py Outdated

cpsievert added 3 commits May 7, 2026 19:39

docs: explain why streaming responses need explicit close()

5cb333d

cpsievert added 2 commits May 7, 2026 20:24

chore: move stream cancellation changelog entries to unreleased

1602000

fix: use private _update_turn in TurnAccumulator tests

5b66d8d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: partial turn preservation and cooperative stream cancellation#279

feat: partial turn preservation and cooperative stream cancellation#279
cpsievert wants to merge 24 commits intomainfrom
feat/stream-cancellation

cpsievert commented Apr 2, 2026 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cpsievert commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Test plan

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cpsievert commented Apr 2, 2026 •

edited

Loading