feat: support reasoning_content in OpenAI-compatible (completions) providers by cpsievert · Pull Request #295 · posit-dev/chatlas

cpsievert · 2026-05-06T23:29:31Z

Summary

OpenAI-compatible providers using the Completions API (ChatOpenAICompletions, ChatDeepSeek, ChatOpenRouter, etc.) now extract reasoning_content from model responses and produce ContentThinking objects — matching the behavior already present in the Responses API provider (ChatOpenAI).

A new preserve_thinking parameter controls whether reasoning content is sent back to the API in multi-turn conversations. This is necessary because providers disagree on whether reasoning traces belong in conversation history:

Provider	Requirement
DeepSeek V4 (with tool calls)	Must include — omitting causes 400
DeepSeek V4 (without tool calls)	Ignored if included, safe either way
DeepSeek legacy `deepseek-reasoner`	Must exclude — including causes 400
OpenRouter	Should include — recommended for quality
Others (Groq, Cloudflare, etc.)	Don't return `reasoning_content` (N/A)

Changes

OpenAICompletionsProvider: Extract reasoning_content from both streaming deltas and non-streaming responses. Handle ContentThinking in turn serialization — drop by default, preserve when preserve_thinking=True.
ChatOpenAICompletions: Expose preserve_thinking parameter for users of custom OpenAI-compatible endpoints.
ChatOpenRouter: Set preserve_thinking=True (OpenRouter recommends including reasoning traces).
ChatDeepSeek: Set preserve_thinking=True (required for V4 thinking models with tool calls; harmlessly ignored for non-thinking responses). Also updates default model from deprecated deepseek-chat to deepseek-v4-flash.

Motivation

This is the Python equivalent of tidyverse/ellmer#972. The ellmer PR defaults to preserve_thinking=False for DeepSeek based on the old deepseek-reasoner docs, but DeepSeek's current V4 models (which replace deepseek-reasoner and deepseek-chat as of 2026-07-24) actually require reasoning_content back when tool calls are present. We default to True for DeepSeek since it's a no-op for non-thinking responses and required for the tool-call case.

Relationship to #288

This PR overlaps significantly with #288, which also adds reasoning_content support. The key difference is that #288 always preserves thinking unconditionally, while this PR adds the preserve_thinking toggle so each provider wrapper can choose the correct behavior. This PR also updates the DeepSeek default model and re-records VCR cassettes.

One thing #288 includes that this PR does not: reordering tool result messages to precede user content in _turns_as_inputs. That may be worth investigating separately if DeepSeek requires that ordering.

Test plan

New unit tests for streaming extraction, non-streaming extraction, thinking drop (default), and thinking preserve
All existing provider tests pass (79 tests across 12 providers)
DeepSeek VCR cassettes re-recorded against live API with new default model
pyright passes with 0 errors across all modified files

Extract `reasoning_content` from streaming deltas and non-streaming responses in OpenAICompletionsProvider, producing ContentThinking objects. Add `preserve_thinking` parameter (default False) to control whether reasoning content is sent back in multi-turn conversations. Set preserve_thinking=True for OpenRouter (which recommends including reasoning traces). DeepSeek's default (False) prevents 400 errors when reasoning_content is included in input messages. Equivalent of tidyverse/ellmer#972.

…serve_thinking deepseek-chat is deprecated (2026-07-24) and maps to v4-flash anyway. V4 thinking models require reasoning_content back for tool-call turns, so preserve_thinking=True is the correct default.

cpsievert · 2026-05-07T23:27:28Z

@copilot resolve the merge conflicts in this pull request

Co-authored-by: cpsievert <1365941+cpsievert@users.noreply.github.com>

Copilot · 2026-05-07T23:39:07Z

@copilot resolve the merge conflicts in this pull request

Resolved by merging the latest origin/main into this branch and fixing the CHANGELOG.md conflict in commit f050c7a. The branch now has no merge conflicts.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

api.cloudflare.com
- Triggering command: /home/REDACTED/work/chatlas/chatlas/.venv/bin/pytest pytest (dns block)
api.portkey.ai
- Triggering command: /home/REDACTED/work/chatlas/chatlas/.venv/bin/pytest pytest (dns block)
dummy-databricks-host
- Triggering command: /home/REDACTED/work/chatlas/chatlas/.venv/bin/pytest pytest (dns block)
openaipublic.blob.core.windows.net
- Triggering command: /home/REDACTED/work/chatlas/chatlas/.venv/bin/pytest pytest (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

cpsievert added 2 commits May 6, 2026 17:44

feat(deepseek): update default model to deepseek-v4-flash, enable pre…

a568d85

…serve_thinking deepseek-chat is deprecated (2026-07-24) and maps to v4-flash anyway. V4 thinking models require reasoning_content back for tool-call turns, so preserve_thinking=True is the correct default.

cpsievert requested a review from Copilot May 6, 2026 23:31

Copilot started reviewing on behalf of cpsievert May 6, 2026 23:32 View session

This comment was marked as resolved.

Sign in to view

cpsievert added 2 commits May 6, 2026 19:00

docs: add changelog entries for reasoning_content support

8289352

Merge branch 'main' into feature/completions-reasoning-content

46e08bc

Copilot started work on behalf of cpsievert May 7, 2026 23:27 View session

Merge origin/main and resolve changelog conflict

f050c7a

Co-authored-by: cpsievert <1365941+cpsievert@users.noreply.github.com>

Copilot finished work on behalf of cpsievert May 7, 2026 23:39

cpsievert merged commit e24bf49 into main May 7, 2026
9 checks passed

This was referenced May 7, 2026

fix(deepseek): support reasoning_content in tools and streaming #288

Open

ChatDeepSeek fails with models that return reasoning_content (e.g. deepseek-v4-flash) when tools are used #287

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support reasoning_content in OpenAI-compatible (completions) providers#295

feat: support reasoning_content in OpenAI-compatible (completions) providers#295
cpsievert merged 5 commits intomainfrom
feature/completions-reasoning-content

cpsievert commented May 6, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

cpsievert commented May 7, 2026

Uh oh!

Copilot AI commented May 7, 2026 •

edited

Loading

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

cpsievert commented May 6, 2026

Summary

Changes

Motivation

Relationship to #288

Test plan

Uh oh!

This comment was marked as resolved.

Uh oh!

cpsievert commented May 7, 2026

Uh oh!

Copilot AI commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented May 7, 2026 •

edited

Loading