Merge pull request #1 from OpenGraph-AI/feat-100234

souvikroy · web-flow · commit 5bdea67ac4b7 · 2026-04-24T12:27:44.000+05:30
Feat 100234
diff --git a/.github/FUNDING.yml b/.github/FUNDING.yml
@@ -0,0 +1,19 @@
+# https://docs.github.com/en/repositories/managing-your-repositorys-settings-and-features/customizing-your-repository/displaying-a-sponsor-button-in-your-repository
+#
+# Populate one or both fields below, then delete the rest.
+# Note: `github:` requires the org to be approved for GitHub Sponsors
+#       (apply at https://github.com/sponsors). Until approved, use `custom:`.
+#
+# An unpopulated FUNDING.yml renders no Sponsor button.
+
+github:    # e.g. OpenGraph-AI
+custom:    # e.g. ["https://opengraph.tech/sponsor"]
+
+# Unused providers — delete or populate:
+# patreon:
+# open_collective:
+# ko_fi:
+# tidelift:
+# liberapay:
+# issuehunt:
+# otechie:
diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md
@@ -0,0 +1,34 @@
+# Contributing to OpenAgent
+
+Short version: open an issue, open a PR, challenge a design decision. All three are welcome.
+
+## What's worth contributing
+
+- **A new agent stage** (or a variant of an existing one). The pipeline is intentionally modular — if you think there should be a `Critic` or `Verifier` stage between Planner and Executor, prototype it and send the PR.
+- **A new provider adapter.** OpenAI-compatible endpoints work out of the box; other SDKs (Anthropic native, Bedrock, etc.) are good fits.
+- **A new recipe.** The `Recipes` section of the README is a living document. If you found a pattern that keeps recurring, document it.
+- **Better tests on the stage contracts.** Every Pydantic schema in `backend/models/schemas.py` is a test surface.
+- **Docs fixes.** Typos, broken links, confusing phrasing — just open the PR.
+
+## How to PR
+
+1. Fork, branch from `main`.
+2. Keep the change small. One stage, one feature, one fix.
+3. If you change a stage contract, update the schema and the stage it belongs to in the same PR.
+4. Run the pipeline end-to-end with `python run.py` before sending — a smoke test counts.
+5. Open the PR with a one-paragraph "why" and a before/after if the behavior changed.
+
+## Disagreement is welcome
+
+If you think a design decision in the cookbook is wrong, open an issue titled `design: <thing you disagree with>`. We'd rather be challenged than be wrong in public.
+
+## Code style
+
+- Python: `ruff` defaults, type hints on public functions, Pydantic models for any cross-stage data.
+- Commit messages: imperative mood (*"add context gatherer"*, not *"added context gatherer"*).
+
+## Getting stuck
+
+Open an issue labeled `question`. No issue is too small.
+
+Thanks for making this better.
diff --git a/LICENSE b/LICENSE
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) 2026 OpenGraph.tech
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
diff --git a/README.md b/README.md
@@ -1,68 +1,121 @@
 <p align="center">
-  <img src="public/logo.svg" alt="OpenGraph.tech" width="96" />
+  <img src="public/logo.svg" alt="OpenAgent by OpenGraph.tech" width="72" />
 </p>
 
-<h1 align="center">OpenGraph.tech</h1>
+<h1 align="center">OpenAgent</h1>
+
+<p align="center"><sub>by <a href="https://opengraph.tech">OpenGraph.tech</a></sub></p>
 
 <p align="center">
   <strong>The open reference pipeline for AI agents that think before they act.</strong>
 </p>
 
 <p align="center">
-  A production-grade blueprint — and a cookbook — for splitting an agent's mind into stages you can observe, test, and replace independently.
+  Intent → Ambiguity → Clarifier → Planner → Executor. Five typed stages. One streaming pipeline. ~4k lines you can read in an afternoon.
 </p>
 
 <p align="center">
+  <a href="https://github.com/OpenGraph-AI/OpenAgent/stargazers"><img src="https://img.shields.io/github/stars/OpenGraph-AI/OpenAgent?style=social" alt="GitHub stars" /></a>
+  <a href="https://github.com/OpenGraph-AI/OpenAgent/commits/main"><img src="https://img.shields.io/github/last-commit/OpenGraph-AI/OpenAgent?color=324D17&style=flat-square" alt="Last commit" /></a>
+  <a href="./LICENSE"><img src="https://img.shields.io/github/license/OpenGraph-AI/OpenAgent?color=324D17&style=flat-square" alt="MIT License" /></a>
   <img src="https://img.shields.io/badge/python-3.10%2B-324D17?style=flat-square" alt="Python 3.10+" />
   <img src="https://img.shields.io/badge/fastapi-async-324D17?style=flat-square" alt="FastAPI" />
   <img src="https://img.shields.io/badge/pydantic-typed%20contracts-324D17?style=flat-square" alt="Pydantic" />
   <img src="https://img.shields.io/badge/LLM-any%20OpenAI--compatible-324D17?style=flat-square" alt="Model agnostic" />
-  <img src="https://img.shields.io/badge/PRs-welcome-324D17?style=flat-square" alt="PRs welcome" />
+  <a href="https://www.linkedin.com/company/opengraph-tech/"><img src="https://img.shields.io/badge/LinkedIn-OpenGraph.tech-0A66C2?style=flat-square&logo=linkedin&logoColor=white" alt="Follow OpenGraph.tech on LinkedIn" /></a>
+</p>
+
+<p align="center">
+  <a href="https://github.com/OpenGraph-AI/OpenAgent"><img src="https://img.shields.io/badge/%E2%AD%90%20Star%20on%20GitHub-324D17?style=for-the-badge&logo=github&logoColor=white" alt="Star on GitHub" /></a>
+  &nbsp;
+  <a href="#demo"><img src="https://img.shields.io/badge/%E2%96%B6%20Watch%20the%2090%E2%80%91second%20demo-324D17?style=for-the-badge" alt="Watch the 90-second demo" /></a>
+  &nbsp;
+  <a href="#quickstart"><img src="https://img.shields.io/badge/%E2%9A%A1%20Quickstart-324D17?style=for-the-badge" alt="Quickstart" /></a>
+</p>
+
+<a id="demo"></a>
+
+<!--
+  GitHub-native video embed.
+  HOW TO POPULATE:
+  1. Open a new issue or draft PR in this repo.
+  2. Drag-and-drop your MP4 (<= 100 MB) into the comment box. GitHub auto-uploads
+     and pastes a URL like:   https://github.com/user-attachments/assets/<uuid>
+     (older uploads use https://user-images.githubusercontent.com/...)
+  3. Replace YOUR_VIDEO_URL_HERE below with that URL.
+-->
+<p align="center">
+  <video src="YOUR_VIDEO_URL_HERE"
+         width="720" autoplay loop muted playsinline controls>
+  </video>
 </p>
 
 <p align="center">
-  <a href="#quickstart"><strong>Quickstart</strong></a> ·
-  <a href="#the-five-stages"><strong>The Cookbook</strong></a> ·
-  <a href="#how-it-compares"><strong>How it compares</strong></a> ·
-  <a href="https://opengraph.tech"><strong>opengraph.tech ↗</strong></a>
+  <sub>Not playing? <a href="https://youtu.be/WMqK6OoWpa8">Watch on YouTube ↗</a></sub>
 </p>
 
 ---
 
-## Why OpenGraph
+## The product in one image
 
-Most "AI agent" code you'll read online is one fat prompt wrapped in a loop. It demos well and falls apart the moment a real user types something vague. We built the opposite: a pipeline where every stage of an agent's reasoning is a **first-class, typed, independently testable unit**.
+<p align="center">
+  <img src="public/stages.svg" alt="OpenAgent — the five stages with typed input and output contracts" width="100%" />
+</p>
 
-- 🧠 **Five specialist stages, not one monolith.** Intent → Ambiguity → Clarifier → Planner → Executor. Each with one job, each observable and swappable.
-- 🕸️ **Auto-resolving clarifier.** The agent searches the web for what the web can answer, and only asks the user for what only the user knows. User attention is the most expensive resource — we spend it last.
-- 🔌 **Model-agnostic by default.** Any OpenAI-compatible endpoint works — OpenAI, Azure, Groq, Together, vLLM, Ollama. No SDK lock-in.
-- 🧯 **Graceful degradation.** Missing an API key for Exa, Upstash, or PageIndex? The pipeline downgrades, never crashes. Run locally with `LLM_API_KEY` alone.
-- 🪢 **Typed contracts between every stage.** Pydantic at every boundary. A bad output gets caught at parse time, not six steps later.
-- 🔴 **Streams natively, pauses like a coroutine.** WebSocket-first. Clarification pauses the pipeline mid-flight and resumes cleanly.
+<p align="center"><sub>Each stage has a <strong>typed input</strong> and a <strong>typed output</strong>. The Pydantic schema between any two stages is your test surface — and your debug trail.</sub></p>
 
 ---
 
-## Who this is for
+## What makes this different
+
+> ### *"Most agents fail in one of five places.*
+> ### *OpenAgent is built for all five."*
+
+<table>
+<tr>
+<td width="33%" valign="top">
 
-- **Engineers building their first real agent** who are tired of cargo-culting `while True: llm()` loops and want a mental model that actually scales.
-- **Platform teams** evaluating LangGraph / CrewAI / AutoGen who want something smaller, typed end-to-end, and free of framework ideology.
-- **Technical founders and researchers** shipping LLM products who need to *debug* and *trace* why an agent produced what it did — not just vibe-check it.
+#### 🧠 &nbsp; Five specialists, not one monolith
 
-If you've ever said "the agent is doing something weird and I have no idea what stage broke," this is for you.
+`Intent ▸ Ambiguity ▸ Clarifier ▸ Planner ▸ Executor` — each testable in isolation. When something breaks, you know **exactly where**.
+
+</td>
+<td width="33%" valign="top">
+
+#### 🕸️ &nbsp; Asks humans last
+
+Clarifier searches the web first, auto-fills what it can, and asks you only what it genuinely couldn't find.
+
+**One question. Not seven.**
+
+</td>
+<td width="33%" valign="top">
+
+#### 🧯 &nbsp; Degrades, never dies
+
+No Exa? &nbsp;Skips web.
+No Redis? &nbsp;In-memory.
+No RAG? &nbsp;No problem.
+
+**Missing keys are features, not errors.**
+
+</td>
+</tr>
+</table>
 
 ---
 
 ## Quickstart
 
 ```bash
-git clone https://github.com/opengraph-tech/agent-auto.git
-cd agent-auto
+git clone https://github.com/OpenGraph-AI/OpenAgent.git
+cd OpenAgent
 pip install -r requirements.txt
 cp .env.example .env                # set LLM_API_KEY at minimum
 python run.py
 ```
 
-Open `http://localhost:8000/static/index.html` and type a fuzzy request. Watch each phase stream into the UI in real time: intent extraction, ambiguity flags, clarifying questions, the plan, and finally the executor producing the answer step-by-step.
+Open `http://<your-domain>:8000/static/index.html` and type a fuzzy request. Watch each phase stream into the UI in real time: intent extraction, ambiguity flags, clarifying questions, the plan, and finally the executor producing the answer step-by-step.
 
 **Minimum config** — one variable:
 
@@ -87,7 +140,7 @@ docker compose up
 ## The mental model
 
 <p align="center">
-  <img src="public/mental-model.svg" alt="OpenGraph.tech five-stage agent pipeline: raw text → Intent → Ambiguity → Clarifier → Planner → Executor → final output" width="520" />
+  <img src="public/mental-model.svg" alt="OpenAgent five-stage agent pipeline: raw text → Intent → Ambiguity → Clarifier → Planner → Executor → final output" width="520" />
 </p>
 
 Each stage is a specialist. The output of one is the typed input of the next. If any stage misbehaves, you can swap it, mock it, or inspect it without touching the others.
@@ -102,6 +155,20 @@ Read this like a cookbook. Every stage answers a question you'll eventually have
 
 Five questions. Five agents. That's the whole book.
 
+<div align="center">
+
+| # | Stage | In | Out | Mission |
+|:-:|:------|:---|:----|:--------|
+| **01** | 🧠 &nbsp;**Intent** | `str` | `IntentSchema` | Turn fuzz into a typed goal |
+| **02** | ❓ &nbsp;**Ambiguity** | `IntentSchema` | `AmbiguityReport` | Flag the known unknowns |
+| **03** | 🕸️ &nbsp;**Clarifier** | `AmbiguityReport` | `ClarifiedIntent` | Auto-resolve, ask only the rest |
+| **04** | 🗺️ &nbsp;**Planner** | `ClarifiedIntent` | `ExecutionPlan` | A DAG of verifiable steps |
+| **05** | ⚡ &nbsp;**Executor** | `ExecutionPlan` | `ExecutionResult` | Run, stream, trace to goal |
+
+</div>
+
+&nbsp;
+
 ### 1. Intent — "What is the user actually asking?"
 
 **The problem.** Humans don't type goals. They type fragments, moods, half-sentences. *"can you make this better"* is not a specification — it's a vibe. Executing on a vibe gives you a confident-sounding wrong answer.
@@ -302,22 +369,32 @@ Small patterns that keep recurring once you start building real agents.
 
 ## How it compares
 
-|                               | OpenGraph       | LangGraph          | CrewAI           | AutoGen           |
+|                               | OpenAgent       | LangGraph          | CrewAI           | AutoGen           |
 |-------------------------------|-----------------|--------------------|------------------|-------------------|
 | Mental model                  | Typed pipeline  | Graph of nodes     | Role-playing crew | Multi-agent chat  |
 | Typed contracts between stages | ✅ Pydantic     | ⚠️ Optional        | ⚠️ Loose         | ⚠️ Loose          |
 | Auto-resolving clarifier      | ✅ Built-in     | ❌                 | ❌               | ❌                |
 | Model lock-in                 | None            | None               | None             | None              |
-| Framework weight              | ~1.5k LOC       | Heavy              | Heavy            | Heavy             |
+| Framework weight              | ~4k LOC, readable| Heavy             | Heavy            | Heavy             |
 | "Pause for user" as first-class | ✅             | ⚠️ Via interrupts  | ❌               | ⚠️ Via prompts    |
-| Reads like a cookbook         | ✅ By design    | ❌ It's a library  | ❌               | ❌                |
+| Reads like a cookbook         | ✅ By design    | ⚠️ Reference docs, not narrative | ❌ | ❌                |
 
-**When to pick OpenGraph.** You want to understand every moving part, control each prompt, and own your agent's reasoning end-to-end — not inherit someone else's abstraction.
+**When to pick OpenAgent.** You want to understand every moving part, control each prompt, and own your agent's reasoning end-to-end — not inherit someone else's abstraction.
 
 **When to pick a framework instead.** You want to ship fast without thinking about architecture, and the framework's defaults happen to match your domain.
 
 ---
 
+## Who this is for
+
+| | |
+|---|---|
+| 🛠️ **First-agent builders** | Five `while True: llm()` prototypes in a drawer. None ship. **Start here.** |
+| 🏗️ **Framework evaluators** | You've read the LangGraph docs twice and still don't trust the abstractions. This is ~4k lines. Read it in an afternoon. |
+| 🧪 **Production debuggers** | *"It's doing something weird in prod."* OpenAgent tells you exactly which stage lied — with the transcript. |
+
+---
+
 ## Where to start reading the code
 
 If you're here to learn, open files in this order:
@@ -330,15 +407,52 @@ If you're here to learn, open files in this order:
 
 ---
 
+## What's next
+
+- [x] Typed contracts between all five stages
+- [ ] Anthropic-native tool use (beyond OpenAI-compatible)
+- [ ] Step-level retries with plan-edit capability
+- [ ] First-class observability (OpenTelemetry spans per stage)
+- [ ] Browser extension: capture user intent from any form
+
+*Roadmap subject to change. Open an issue if one of these matters to you and we'll bump it.*
+
+---
+
 ## Contributing
 
-Issues, patches, and hard questions are all welcome. If you disagree with a design choice in the cookbook, open an issue — we'd rather get challenged than be wrong in public. Every stage is intentionally small enough that a PR can meaningfully change one thing at a time.
+Issues, patches, and hard questions are all welcome. See [`CONTRIBUTING.md`](CONTRIBUTING.md) for the short version — fork, keep the change small, smoke-test with `python run.py`, and open the PR with a one-paragraph *why*. Every stage is intentionally small enough that a PR can meaningfully change one thing at a time.
 
 ---
 
 ## About OpenGraph.tech
 
 OpenGraph.tech builds the infrastructure for agents that reason openly, not opaquely. This repo is our reference pipeline — the thing we run, the thing we ship against, and the thing we learn from. If you're building agents in production and want to compare notes, we'd like to hear from you.
 
+Follow us on <a href="https://www.linkedin.com/company/opengraph-tech/">LinkedIn</a> — that's where we post build notes and what shipped this week.
+
+---
+
+<p align="center">
+  <a href="https://github.com/OpenGraph-AI/OpenAgent/graphs/contributors">
+    <img src="https://contrib.rocks/image?repo=OpenGraph-AI/OpenAgent" alt="Contributors to OpenAgent" />
+  </a>
+</p>
+
+## Leave a ⭐ if this saved you a week
+
+This repo is free. The cookbook is free. The walkthrough is [on YouTube](https://youtu.be/WMqK6OoWpa8), free. The only thing we ask back is **a star** — it's the one signal that tells us to write more of these, louder.
+
+<p align="center">
+  <a href="https://github.com/OpenGraph-AI/OpenAgent">
+    <img src="https://img.shields.io/badge/%E2%AD%90%20Star%20on%20GitHub-OpenGraph--AI%2FOpenAgent-324D17?style=for-the-badge&logo=github&logoColor=white" alt="Star OpenAgent on GitHub" />
+  </a>
+  &nbsp;
+  <a href="https://youtu.be/WMqK6OoWpa8">
+    <img src="https://img.shields.io/badge/%E2%96%B6%20Watch%20the%20demo-YouTube-324D17?style=for-the-badge&logo=youtube&logoColor=white" alt="Watch the demo on YouTube" />
+  </a>
+</p>
+
+<p align="center"><sub>One click. No account prompt. It genuinely helps.</sub></p>
 
 <p align="center"><sub>Made with intent, by <a href="https://opengraph.tech">OpenGraph.tech</a>.</sub></p>
diff --git a/public/stages.svg b/public/stages.svg