Coagent

Coagent implements the advisor strategy pattern: a cheap executor model handles tasks turn-by-turn, while a powerful advisor model is consulted only when the executor signals it needs help. The result is frontier-level performance at a fraction of the cost.

The typical setup pairs a local model as the executor with a state-of-the-art model (claude-opus-4-6, gpt-4o) as the advisor. The advisor is called sparingly, not on every turn.

Reference: https://claude.com/blog/the-advisor-strategy

Quick Start

Install

pip install coagent
# or with uv:
uv add coagent

Run from CLI

# With Ollama (local)
coagent run --executor ollama/llama3 --advisor ollama/llama3 "Explain REST vs GraphQL tradeoffs"

# With an OpenAI-compatible endpoint (e.g. LM Studio)
coagent run \
  --executor openai/local-model --executor-api-base http://localhost:1234/v1 \
  --advisor openai/gpt-4o \
  "Write a CSV parser in Python"

# With a config file (auto-discovered: config.yaml or config.yml in current directory)
coagent run "Write a CSV parser in Python"

# View a trace
coagent trace traces/run.jsonl

Python API

from coagent import run_task, load_config
from coagent.schemas import CoagentConfig, ModelConfig

config = CoagentConfig(
    executor=ModelConfig(model="ollama/llama3", api_base="http://localhost:11434"),
    advisor=ModelConfig(model="openai/gpt-4o", api_key="..."),
)

result = run_task("Explain REST vs GraphQL tradeoffs", config=config)
print(result.final_answer)
print(result.usage_summary)

Configuration

Coagent automatically loads config.yaml or config.yml from the current directory if either exists. No flag required — just place the file and run.

Copy config.example.yaml and edit for your setup:

executor:
  model: "ollama/llama3"
  api_base: "http://localhost:11434"

advisor:
  model: "openai/gpt-4o"
  api_key: "${OPENAI_API_KEY}"

policy:
  max_advisor_calls: 5
  failure_threshold: 2
  confidence_threshold: 0.4
  stagnation_turns: 4
  cooldown_turns: 2

max_turns: 20
logging:
  level: "INFO"
  trace_file: "traces/run.jsonl"

Architecture

User → CLI / Python API → ExecutorLoop
                              │
                    generate() via LiteLLM → Executor Model
                              │
                    DecisionPolicy.should_consult()
                              │
                    (if triggered) → Advisor Model
                              │
                    Parse AdvisorResponse (Pydantic)
                              │
                    Inject guidance → back to Executor

The executor is always in control. The advisor is a consulted resource — it never produces user-facing output.

Advisor Triggers

The advisor is consulted when any of these fire:

Trigger	Condition
Explicit request	Executor outputs `[NEED_ADVICE]`
Consecutive failures	N turns with failure signals
Low confidence	Executor reports `[CONFIDENCE:0.3]` below threshold
Stagnation	Last N responses have high text overlap

Gates prevent over-consulting: budget cap (max_advisor_calls) and cooldown (cooldown_turns).

Supported Models

Any model supported by LiteLLM:

Local: ollama/llama3, ollama/mistral
OpenAI: openai/gpt-4o, openai/gpt-4o-mini
Anthropic: anthropic/claude-sonnet-4-6
OpenAI-compatible: set api_base in config, or pass --executor-api-base / --advisor-api-base via CLI

Development

uv sync
uv run pytest tests/ -v
uv run ruff check src/ tests/

Limitations (MVP)

Text-only (no tool calling)
Synchronous only (no streaming or async)
Single advisor model

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
examples		examples
src/coagent		src/coagent
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
Taskfile.yml		Taskfile.yml
config.example.yaml		config.example.yaml
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Coagent

Quick Start

Install

Run from CLI

Python API

Configuration

Architecture

Advisor Triggers

Supported Models

Development

Limitations (MVP)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Coagent

Quick Start

Install

Run from CLI

Python API

Configuration

Architecture

Advisor Triggers

Supported Models

Development

Limitations (MVP)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages