Plugin: AI chat over the vault — keyword RAG V1 (#70) by thetechjon · Pull Request #145 · ipapakonstantinou/noteser

thetechjon · 2026-06-07T06:04:39Z

Closes #70.

Summary

Adds noteser-ai-chat as a reference plugin under public/plugins/,
built on the v1.2 plugin API (fullscreen view + vault.read.all +
vault.write + VNode events). Brings noteser its first conversational
RAG surface without touching core.

Provider choice

Bring-your-own-key for both OpenAI (gpt-4o-mini default, gpt-4o
opt-in) and Anthropic (claude-sonnet-4-6 default,
claude-haiku-4-5 opt-in). The plugin posts directly to
api.openai.com/v1/chat/completions and api.anthropic.com/v1/messages
from the Worker via the ambient fetch global. No noteser-hosted
inference; nothing routes through any noteser server. Aligns with the
positioning note on #70 ("private, opt-in, bring-your-own-key" — the
issue's defining constraint).

Streaming is Server-Sent Events for both providers. The Worker
consumes the ReadableStream from response.body, decodes the
data: …\n\n events, and pushes one setFullscreenContent per token
delta so the user watches the response render in real time.

RAG approach

Keyword BM25-lite for V1. The flow on each Send:

Extract a bag-of-words from the prompt (extractKeywords).
Lowercase, strip punctuation, drop ~50 inline stopwords, dedupe.
Snapshot the vault via ctx.vault.read.getAllNotes(), cached
per-session for 30 seconds.
Score each note with Σ idf(term) × tf / (tf + 1). Title hits
weigh 2× body hits.
Top 5 notes' bodies, truncated to ~500 chars at a word boundary,
get stitched into a system prompt asking the model to cite by
title in [brackets].

Toggle in Settings to disable RAG; off means a plain LLM call with
no vault content sent.

V2 roadmap (in README): swap keyword scoring for embeddings via the
existing src/utils/embeddings.ts + src/utils/aiClient.ts once a
plugin-side embedding capability lands.

Key storage caveat

API key lives unencrypted in localStorage via the plugin's
per-plugin settings namespace (ctx.setSetting('apiKey', …)). Display
masks all but the last 4 chars. The audit trail (Settings → Plugins
→ Audit log) records vault.write calls only — it never sees the
key, the prompt, or the response. README ships an explicit warning:
"Do not paste a key on a shared machine."

V2 roadmap

Embeddings-based retrieval (using the shipped embeddings.ts /
aiClient.ts).
Citations as clickable VNode link nodes back to source notes.
Per-conversation system-prompt overrides.
network.fetch permission once v1.3 surfaces it, so the install
modal can list api.openai.com / api.anthropic.com explicitly.

Test plan

Ships a v1.2 plugin that opens a fullscreen chat panel, runs a keyword BM25-lite retrieval pass over the vault, and streams an OpenAI or Anthropic completion back into the modal. Bring-your-own-key only; nothing leaves the browser until the user enters a key and hits Send. Key lives in the per-plugin settings namespace, displayed masked, and is never recorded in the audit trail (which logs vault writes only). V1 RAG is keyword-only — extract bag-of-words from the prompt, score each note with `Σ idf × tf/(tf+1)` (title hits weigh 2×), inject the top 5 bodies (truncated to ~500 chars) as a system message. Embeddings are a V2 roadmap item documented in the plugin README. Tests cover the pure surface (keyword extraction, scoring, request shape per provider, SSE parsing for both shapes, key masking, markdown serialisation). No real API calls in tests.

vercel Bot deployed to Preview June 7, 2026 06:06 View deployment

thetechjon merged commit ddc140c into dev Jun 7, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Plugin: AI chat over the vault — keyword RAG V1 (#70)#145

Plugin: AI chat over the vault — keyword RAG V1 (#70)#145
thetechjon merged 1 commit into
devfrom
feat/ai-chat-plugin-70

thetechjon commented Jun 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

thetechjon commented Jun 7, 2026

Summary

Provider choice

RAG approach

Key storage caveat

V2 roadmap

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant