GitHub - sentinelrca/sentinel: Root cause analysis for AI agents. Detects agent loops, retry storms, and optimization opportunities in LangSmith, Langfuse, Arize Phoenix, and OpenTelemetry traces.

Root cause analysis for AI agents.

SentinelRCA connects to your existing observability tools (LangSmith, Langfuse, Arize Phoenix, OpenTelemetry) and tells you why your AI agents fail, what's slowing them down, and what to fix — not just what happened.

$ sentinel analyze --source langsmith --api-key lsv2_pt_...

  Detector                  Severity  Trace           Evidence
  ──────────────────────────────────────────────────────────────────────────
  agent_loop                HIGH      trace-abc123    PlannerAgent invoked 4×
  sequential_tools          WARNING   trace-def456    search_web + query_db could save 2.1s
  context_cache_opportunity WARNING   trace-ghi789    Input tokens grew 3200→9800 over 6 calls
  missing_session_memory    WARNING   trace-jkl012    7 turns, tokens +340% — no memory tool detected

Why SentinelRCA

Langfuse and LangSmith show you a tree of spans. They tell you what your agent called and when. They don't tell you:

Why your agent is looping between the same two sub-agents
Which tool calls could run in parallel and save 40% of latency
Why your costs are growing unbounded across a multi-turn session
That your agent has no memory layer and your users are repeating themselves

SentinelRCA reconstructs the call graph from your traces and runs deterministic detectors against it to surface specific, actionable fixes.

Quickstart

Connect your observability source in 5 minutes:

Source	Guide
Langfuse	Connect Langfuse →
LangSmith	Connect LangSmith →
Arize Phoenix	Connect Arize Phoenix →

Or use the CLI (no server needed):

cd tools/cli && uv sync
uv run sentinel analyze --source langsmith --api-key lsv2_pt_YOUR_KEY
uv run sentinel analyze --source langfuse  --public-key pk-lf-... --secret-key sk-lf-...

Detectors — 8 open source, all free

Detector	What it catches	Severity
`agent_loop`	Same agent invoked 3+ times — infinite handoff	HIGH
`retry_storm`	Same span retried 3+ times without backoff	HIGH
`retrieval_without_grounding`	Empty retrieval followed by LLM call — hallucination risk	HIGH
`missing_termination_condition`	Unbounded agent workflow with no iteration guard	HIGH
`token_cost_runaway`	Single trace consuming anomalously high tokens	HIGH
`latency_spike`	One span consuming >50% of total trace time	WARNING
`sequential_tools`	Tool calls that could run in parallel	WARNING
`context_cache_opportunity`	Input tokens growing unbounded across LLM calls	WARNING
`missing_session_memory`	Token growth across turns with no memory tool calls	WARNING

All detectors run on trace structure only — no prompt or response content is ever stored by default.

Architecture

Source (LangSmith / Langfuse / Arize / OTLP)
        ↓  connector.pull()
  list[NormalizedSpan]
        ↓  build_graph()
      FlowGraph (NetworkX DiGraph)
        ↓  extract_signals()
         Signals
        ↓  run_detectors()
      list[Insight]  ←  specific recommendation + evidence

Connectors — thin pull adapters per source, always MIT licensed
Graph builder — reconstructs parent-child tree, cycle detection, clock skew correction
Signal extractor — critical path, sequential tool pairs, token growth, retry inference
Detector engine — deterministic pattern matching, no LLMs involved in detection

Self-hosting

# 1. Start infrastructure
task up   # Postgres + ClickHouse + Redis
cd infra/migrations/postgres && uv run alembic upgrade head

# 2. Start services
cd services/api    && uv run uvicorn sentinel_api.main:app --port 8000
cd services/worker && uv run celery -A sentinel_worker.main worker

# 3. Start UI → http://localhost:3001
cp services/ui/.env.local.example services/ui/.env.local
cd services/ui && npm install && npm run dev

Or run the full stack with Docker Compose:

docker compose up

Requirements: Docker, go-task, Python 3.12+, uv, Node.js 20+

Data privacy

Prompt and response content is never stored by default (store_content=False)
Only structural metadata: span IDs, timestamps, token counts, agent names, latency
Fully self-hostable — traces never leave your network
store_content=True is an explicit opt-in per source

Roadmap

M1 — Langfuse connector, flow graph, 2 detectors, CLI
M2 — LangSmith connector, 7 detectors, web UI, PII-safe by default
M3 — Arize Phoenix connector, 8 detectors, workspace API, v1.0 GA
M4 — Detectors 9–17, Slack/PagerDuty alerting, insight lifecycle
M5 — Cross-trace detectors, workflow discovery, Pro tier
M6 — SSO, on-prem Helm, custom detector builder, enterprise tier

Community

GitHub Discussions — questions, ideas, show & tell
Issues — bug reports and feature requests
Building a connector for a source we don't support? Open a discussion first so we can align on the interface.

Contributing

PRs welcome. See CONTRIBUTING.md for the connector and detector authoring guides.

cd tests && uv sync --no-install-project
uv run --no-project pytest unit/   # 340 tests, no Docker needed

License

MIT — connectors and core pipeline.

The commercial detector engine (sentinel-engine) is a separate private package. Free users get all 8 core detectors. See sentinelrca.com for the hosted version.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.github		.github
assets		assets
connectors		connectors
docs		docs
examples		examples
infra		infra
proto/sentinel/v1		proto/sentinel/v1
services		services
tests		tests
tools/cli		tools/cli
.coderabbit.yml		.coderabbit.yml
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
Taskfile.yml		Taskfile.yml
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Why SentinelRCA

Quickstart

Detectors — 8 open source, all free

Architecture

Self-hosting

Data privacy

Roadmap

Community

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Why SentinelRCA

Quickstart

Detectors — 8 open source, all free

Architecture

Self-hosting

Data privacy

Roadmap

Community

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages