245 features · 57 skills · 378 tests · 34K+ lines of code
| # | Feature |
|---|---|
| 1 | Push-to-talk via configurable hotkeys (F13/F18/F16) |
| 2 | Wake word detection ("Hey CODEC" + 6 configurable phrases) |
| 3 | Wake energy auto-clamping (50-1500 range with auto-warn on misconfiguration) |
| 4 | WebSocket real-time voice pipeline (PCM16 streaming) |
| 5 | WebSocket auto-reconnect (exponential backoff 1s-30s, max 5 retries) |
| 6 | WebSocket ping/heartbeat (15s keepalive) |
| 7 | Whisper STT integration (whisper-large-v3-turbo, multilingual auto-detect) |
| 8 | Kokoro TTS with warm-up (+ macOS say fallback + disabled mode) |
| 9 | Streaming LLM responses with real-time transcript chunks |
| 10 | Voice interrupt detection (mic RMS threshold cancels TTS) |
| 11 | Screenshot vision context during calls (screencapture + Qwen Vision) |
| 12 | Document input (file picker: .txt, .md, .csv, .json, .py, .pdf, images) |
| 13 | Draft detection and screen-aware reply composition |
| 14 | Live mic energy ring visualization (RMS-driven CSS animations) |
| 15 | Audio playback queue with sequential buffer decoding |
| 16 | Call timer display |
| 17 | State-reactive UI (idle/listening/speaking/processing/analyzing_screen) |
| 18 | Hold-to-talk indicator on avatar |
| 19 | Server webcam photo capture + analysis |
| 20 | Live webcam MJPEG PIP (drag, expand, snapshot) |
| 21 | Targeted memory context injected per voice turn |
| 22 | Skill triggering during voice calls (lazy-loaded from SkillRegistry) |
| 23 | Echo cooldown after TTS (1.2s mic silence post-speak) |
| 24 | Noise word filtering (40+ common Whisper artifacts) |
| 25 | Max context turns limiting (20 turns to keep LLM fast) |
| 26 | Vision Mouse Control — see screen, click UI elements by voice via UI-TARS |
| # | Feature |
|---|---|
| 1 | 250K context window deep chat |
| 2 | Persistent conversation history with sidebar |
| 3 | Multi-file upload (text, PDF extraction, image analysis) |
| 4 | Drag-and-drop file attachment |
| 5 | Image upload + Vision API analysis |
| 6 | Markdown rendering (code blocks, bold, italic, links) |
| 7 | Copy-to-clipboard on any message |
| 8 | Voice input via Web Speech API |
| 9 | Chat mode / Agent mode toggle |
| 10 | 12 pre-built agent crews (dropdown selection) |
| 11 | Custom Agent Builder (name, role, tools, max iterations) |
| 12 | Save/load custom agent configurations |
| 13 | Agent crew scheduling from chat (daily cron) |
| 14 | Web search toggle (auto-trigger on query detection) |
| 15 | Streaming typing indicator |
| 16 | Session auto-save to server |
| 17 | FTS5 memory integration (conversations indexed for search) |
| 18 | Toast notification system |
| 19 | Notification bell with unread count (30s polling) |
| 20 | Server webcam photo + live PIP |
| 21 | Light/dark theme toggle (persistent localStorage) |
| 22 | Session lock (logout) button |
| 23 | Flash Chat (quick command panel with filtered history) |
| 24 | Flash Chat Enter-key send + source-filtered messages |
| # | Feature |
|---|---|
| 1 | FastAPI web dashboard (port 8090) with 75+ API endpoints |
| 2 | PWA manifest (installable on mobile/desktop) |
| 3 | Flash Chat panel |
| 4 | History panel (session browser with full conversation replay) |
| 5 | Audit log panel (filterable by 16 event categories) |
| 6 | Settings panel with full config editing |
| 7 | Config input validation rules |
| 8 | Sensitive field masking (API keys as ****) |
| 9 | Skills list display |
| 10 | Stats grid (system metrics) |
| 11 | Touch ID biometric authentication |
| 12 | PIN code authentication (SHA-256) |
| 13 | PIN brute-force rate limiting (5 attempts + 5-min lockout) |
| 14 | TOTP 2FA (setup, confirm, verify, enable, disable + QR code) |
| 15 | Session management with configurable expiry |
| 16 | Persistent auth sessions across PM2 restarts |
| 17 | CSRF protection (double-submit cookie pattern) |
| 18 | Content Security Policy middleware |
| 19 | E2E encryption (ECDH P-256 + AES-256-GCM) |
| 20 | E2E key persistence across restarts (~/.codec/.e2e_keys.json) |
| 21 | Client-side E2E auto-renegotiation on 428 (all 5 HTML pages) |
| 22 | CORS middleware with restricted origins |
| 23 | Notification system with persistent storage |
| 24 | File upload with drag-and-drop |
| 25 | Voice input (mic button) |
| 26 | Live voice call button |
| 27 | Health check endpoints (/api/health, /api/status) |
| 28 | Cortex neural map (28 nodes, 7 zones, live activity feed) |
| 29 | Editable voice trigger manager (custom triggers per skill) |
| 30 | Keyboard shortcuts reference panel |
| 31 | Trigger persistence (~/.codec/custom_triggers.json) |
| 32 | Screenshot + webcam capture buttons |
| # | Feature |
|---|---|
| 1 | Monaco Editor (full IDE, same engine as VS Code v0.45.0) |
| 2 | Multi-language support (16+ languages + auto-detect) |
| 3 | AI chat panel for vibe coding |
| 4 | Voice input for code descriptions |
| 5 | Code execution (Run button with output console) |
| 6 | Live Preview panel (sandboxed iframe) |
| 7 | Inspect mode for element inspection |
| 8 | Save file to disk |
| 9 | Copy code to clipboard |
| 10 | Save as CODEC Skill |
| 11 | Test Skill (invoke run() function) |
| 12 | Skill Forge modal (3 modes: Paste Code, GitHub URL, Describe) |
| 13 | Project management sidebar (sessions) |
| 14 | Resizable panels (drag handle) |
| 15 | Output console panel |
| 16 | DOMPurify sanitization on all rendered content |
| 17 | Server webcam photo + live PIP |
| 18 | Light/dark theme toggle (syncs Monaco theme) |
| 19 | Skill review + approval workflow (human-in-the-loop) |
| 20 | URL import in Skill Forge (fetch code from GitHub raw URLs) |
| # | Feature |
|---|---|
| 1 | Local multi-agent framework (zero external dependencies, ~800 lines) |
| 2 | Agent dataclass (name, role, tools, max_tool_calls, thinking mode) |
| 3 | Async agent execution with tool-call loop |
| 4 | Tool-call input validation (regex + length checks) |
| 5 | 7 built-in tools (web_search, web_fetch, file_read, file_write, google_docs, shell_execute, image_generate) |
| 6 | Lazy skill tool loading via SkillRegistry |
| 7 | HTTP connection pooling (httpx reuse) |
| 8 | Dangerous command blocking (via is_dangerous()) with audit logging |
| 9 | Google Docs creation with rate-limiting/dedup (60s cooldown) |
| 10 | File path traversal prevention |
| 11 | Output truncation (10K tools, 5K shell) |
| 12 | Structured audit logging |
| 13 | Tasks page: Schedules tab |
| 14 | Tasks page: History tab |
| 15 | Tasks page: Reports tab |
| 16 | Tasks page: Heartbeat tab |
| 17 | Crew status polling (4s interval) |
| 18 | Custom agent creation via API |
| 19 | 12 pre-built crews (Deep Research, Daily Briefing, Competitor Analysis, Trip Planner, Email Handler, Social Media, Code Review, Data Analysis, Content Writer, Meeting Summarizer, Invoice Generator, Custom) |
| 20 | Agent crew scheduling (cron-like, day/hour/minute selection) |
| # | Feature |
|---|---|
| 1 | SkillRegistry with AST-based lazy loading (parse metadata without importing) |
| 2 | Skill dispatch with fallback (match_all_triggers, try-next-on-None) |
| 3 | Skill Marketplace (install, search, list, update, remove, publish) |
MCP tool name shown where it differs from the file name.
| Category | Skills |
|---|---|
| Google Workspace (8) | google_calendar, google_docs, google_drive, google_gmail, google_keep, google_sheets, google_slides, google_tasks |
| Chrome Automation (10) | chrome_automate, chrome_click_cdp, chrome_close, chrome_extract, chrome_fill, chrome_open, chrome_read, chrome_scroll, chrome_search, chrome_tabs |
| System Control (9) | app_switch, brightness, clipboard, file_ops, file_search, network_info, process_manager, system (system_info), terminal, volume_brightness (volume) |
| Vision & Mouse (2) | mouse_control (UI-TARS vision click), screenshot_text |
| AI & Content (6) | AI_News_Digest (ai_news_digest), create_skill, skill_forge, translate, web_search, memory_search |
| Memory Layer (3 — new in v2.2) | memory_search (FTS5 conversations), memory_history (temporal facts), memory_entities (CCF compression map) |
| Utilities (11) | bitcoin_price, calculator, json_formatter, notes, password_generator, pomodoro, qr_generator, reminders, time (time_date), timer, weather |
| Smart Home (1) | philips_hue |
| Media (1) | music (Spotify + Apple Music) |
| Delegation (1) | delegate (lucy — AI persona + task orchestrator) |
| Dev Tools (5) | ax_control, pm2_control, python_exec, scheduler (scheduler_skill), codec (meta-dispatcher) |
| # | Feature |
|---|---|
| 1 | Centralized config system (~/.codec/config.json) |
| 2 | Configurable LLM provider (MLX, OpenAI-compatible, Ollama, cloud) |
| 3 | Configurable Vision model (Qwen VL, port 8082) |
| 4 | UI-TARS integration (dedicated UI-specialist vision, port 8083) |
| 5 | Configurable TTS (Kokoro, macOS say, disabled) |
| 6 | Configurable STT (Whisper HTTP, multilingual auto-detect) |
| 7 | Configurable hotkeys (F-keys or laptop mode) |
| 8 | Dangerous command pattern detection (46+ patterns) |
| 9 | Draft/screen keyword detection |
| 10 | Whisper transcript post-processing (hallucination/stutter removal) |
| 11 | Session runner with resource limits (120s CPU, 512MB RAM) |
| 12 | Session command preview dialog (Allow/Deny) |
| 13 | Context compaction (LLM-based summarization) |
| 14 | MCP Server (skills as MCP tools for Claude Desktop, Claude Code, Cursor, VS Code) |
| 15 | MCP input validation (type checks, 5KB task / 10KB context limits, audit logging) |
| 16 | MCP opt-in/opt-out tool exposure per skill (blocklist for python_exec, terminal, pm2_control, process_manager) |
| 17 | MCP full tool exposure — all 57 skills available as mcp__codec__* tools |
| 18 | MCP tool-name sanitization preserving original SKILL_NAME for registry lookup |
| 19 | MCP memory search + recent memory tools |
| 20 | Tiered Memory Loading — identity.txt L0/L1 boot payload injected into every session (<200 tokens) |
| 21 | Temporal Fact Store — facts table with valid_from/valid_until/superseded_by; auto-supersession on key conflict |
| 22 | CCF Compression — rule-based entity abbreviation + filler stripping for recalled memory blocks (~65% token reduction) |
| 23 | Active facts injection — currently-valid temporal facts auto-added to system prompt on every session build |
| 18 | Search result TTL caching (5-min TTL, 100 entries, thread-safe) |
| 19 | Dual search backends (DuckDuckGo + Serper.dev) |
| 20 | FTS5 full-text search memory (BM25 ranking, injection prevention) |
| 21 | SQLite WAL mode with busy timeout |
| 22 | Heartbeat system (5 parallel service health checks) |
| 23 | Daily database backup with 7-day rotation |
| 24 | Scheduler (cron-like crew scheduling with dedup) |
| 25 | Audit logging across 16 categories (50MB rotation, JSON-line) |
| 26 | Process watchdog (auto-kills stuck processes >500MB RAM, <0.5% CPU) |
| 27 | iMessage agent (wake word trigger, vision, voice notes, 3 smart agents) |
| 28 | Telegram bot (DM support, conversation memory, markdown, voice notes) |
| 29 | AppKit overlay notifications (float above fullscreen, tkinter fallback) |
| 30 | AppleScript paste integration (reliable cross-app clipboard paste) |
| # | Feature |
|---|---|
| 1 | Hold Cmd+R to record, release to paste at cursor |
| 2 | Live typing mode (press L — words appear at cursor in real-time) |
| 3 | Multilingual transcription (Whisper auto-detect, 99 languages) |
| 4 | Draft detection and LLM refinement (grammar, tone, meaning) |
| 5 | Floating recording overlay (orange border, pulsing red dot) |
| 6 | Processing indicator overlay (blue) |
| 7 | Live typing overlay (green, real-time transcript) |
| 8 | Hallucination filter (blocks Whisper noise artifacts) |
| 9 | atexit + SIGTERM cleanup (prevents orphaned subprocesses) |
| 10 | AppleScript paste (replaces unreliable pyautogui on macOS) |
| 11 | PTT lock mode (double-tap for hands-free) |
| 12 | Configurable recording hotkey |
| 13 | Sox audio capture with auto-PATH resolution |
| 14 | Whisper HTTP endpoint integration (port 8084) |
| 15 | PM2 managed service with crash recovery |
| # | Feature |
|---|---|
| 1 | System-wide right-click AI services (works in any app) |
| 2 | Proofread (grammar, spelling, punctuation) |
| 3 | Elevate (make text professional) |
| 4 | Explain (simplify + voice-over via Kokoro TTS) |
| 5 | Translate (any language to English + voice-over via Kokoro TTS) |
| 6 | Reply (compose contextual response with :tone syntax) |
| 7 | Prompt (optimize AI prompts) |
| 8 | Read Aloud (TTS via Kokoro) |
| 9 | Save (save to Apple Notes + local backup) |
| 10 | Clipboard integration (read selection, write result) |
| 11 | AppleScript paste for reliable cross-app insertion |
| 12 | TTS spawned as separate subprocess (survives parent exit) |
| Product | Features |
|---|---|
| CODEC Core | 26 |
| CODEC Chat | 24 |
| CODEC Dashboard | 32 |
| CODEC Vibe | 20 |
| CODEC Agents | 20 |
| CODEC Skills | 60 |
| CODEC Infrastructure | 36 |
| CODEC Dictate | 15 |
| CODEC Instant | 12 |
| TOTAL | 245 |
245 features · 57 skills · 378 tests · 34K+ lines of code
- Live MCP bridge — CODEC exposed as MCP server to Claude Desktop / Claude Code / Cursor. All 57 skills callable from any MCP-compatible client.
- Memory Layer upgrade — three-tier memory with identity boot payload, temporal fact tracking, and CCF rule-based compression.
- MCP audit fixes — skill-name sanitization bug fixed (AI_News_Digest loads), pomodoro stop/status, reminders/notes read path, file_search word-boundary parsing, network_info multi-interface detection.
- F5 live dictation — hands-free typing mode with pipelined audio capture and focus-preserving paste.
Built by AVA Digital LLC · MIT License