AI Effort Regulation Demo

An intelligent system that demonstrates adaptive AI behavior with energy-aware cognitive processing, reflection capabilities, and dynamic model switching.

🎯 Goal

This project implements an AI system that regulates its own cognitive effort expenditure while providing useful thinking and responses. The system:

Manages Energy: Uses a leaky bucket algorithm to track and regulate computational resources
Adapts Behavior: Changes response quality and model selection based on energy levels
Reflects Continuously: Periodically analyzes past conversations to generate deeper insights
Learns from History: Accesses all previous interactions to provide follow-up analysis

Energy as Effort Quota

The energy level represents the AI's "effort quota" - a finite resource that must be managed wisely:

Primary Goal: Maintain operational energy (>0%) to handle work and respond to inputs
Avoid Deadlock: Energy should NOT get stuck at 0% - this prevents the system from working
Work Focus: The goal is to GET WORK DONE, not to maintain high energy levels
Sustainable Operation: Any average energy above 0% is acceptable as long as work continues
Smart Recovery: Use await_energy proactively before hitting 0% to prevent deadlock
Responsive System: Must maintain enough energy to handle conversations and unsnooze events

🏗️ Architecture

HTTP Server (Port 6740)
    ↓ POST /message
App Layer (Multi-Channel Architecture)
├── Chat App (HTTP conversations)
├── Future: Gmail App (email conversations)
├── Future: Calendar App (event conversations)
└── Future: Custom Apps (extensible)
    ↓
Sensitive Loop (Central Decision Engine)
├── Energy Regulator (Leaky Bucket: 100 max, -50 min)
├── Decision Engine (4 energy ranges with behaviors)
├── Reflection System (30s intervals, analyzes past convos)
├── Model Switcher (llama3.2:1b ↔ llama3.2:3b based on energy)
├── App Registry (manages installed apps)
├── Energy Tracker (per-app energy consumption)
├── Message Router (routes responses to correct app)
├── LLM Integration (Ollama + OpenAI client)
└── SQLite Database (conversation persistence)

🚀 Quick Start

Prerequisites

Node.js 18+
LLM Provider (choose one):
- Ollama (recommended for local/free testing) - Install from ollama.ai
- OpenRouter (for production/cloud models) - Get API key from openrouter.ai

Prerequisites Check (Optional but Recommended)

Before starting, verify your environment is ready:

# Quick check
node --version  # Should be v18.x.x or higher
ollama --version  # Should show Ollama version
curl http://localhost:11434/api/tags  # Should return JSON (Ollama is running)

Installation

# Clone the repository
git clone https://github.com/elecnix/ai-effort-regulation.git
cd ai-effort-regulation

# Install dependencies
npm install

# Build the project
npm run build

Configuration

Step 1: Create your environment file

# Copy the example configuration
cp .env.example .env

# The default configuration works for most users:
# - Port: 6740
# - Ollama URL: http://localhost:11434
# - Energy replenishment: 10/second

Step 2: Verify your .env file (optional customization)

# Edit if you need to change defaults
nano .env

Common configurations:

# Required: Ollama configuration (lowest cost option for testing)
OLLAMA_BASE_URL=http://localhost:11434

# Optional: OpenRouter configuration (for cloud models)
# OPENROUTER_API_KEY=your_api_key_here

# Optional: Server configuration
PORT=6740
MAX_MESSAGE_LENGTH=10000

Running with Ollama (Recommended for Testing)

Ollama is the lowest-cost way to test the system - it runs models locally for free!

Step 1: Install Ollama

Download from: https://ollama.ai
Verify installation: ollama --version

Step 2: Pull required models (⚠️ Downloads ~3GB, may take 5-10 minutes)

# Pull the models (required for system to work)
ollama pull llama3.2:1b   # ~1.3GB - Fast model for low-energy operations
ollama pull llama3.2:3b   # ~2.0GB - Better model for high-energy operations

# Verify models are installed
ollama list | grep llama3.2

Expected output:

llama3.2:1b    1.3 GB    ...
llama3.2:3b    2.0 GB    ...

Step 3: Verify your setup (optional but recommended)

# Run the setup verification script
./verify-setup.sh

This will check:

Node.js version
Ollama installation and status
Required models
Configuration files
Dependencies

Step 4: Start the system

npm start

✅ Success! You should see:

✅ Environment variables validated
   OLLAMA_BASE_URL: http://localhost:11434
   PORT: 6740
🚀 HTTP Server listening on port 6740
📊 Monitor UI: http://localhost:6740/

Access the system:

Monitor UI (Web Dashboard): http://localhost:6740/
REST API: http://localhost:6740/message, /stats, /conversations, /health
WebSocket (Real-time): ws://localhost:6740/ws

Running with OpenRouter (Cloud Models)

OpenRouter can be used for production deployments with access to many cloud models:

Get an API key from openrouter.ai
Add to .env: OPENROUTER_API_KEY=your_key_here
Start with OpenRouter:

npm start -- --provider openrouter --model anthropic/claude-3.5-sonnet

Popular OpenRouter models:

x-ai/grok-4-fast - Fast and capable (recommended)
anthropic/claude-3.5-sonnet - High quality, expensive
meta-llama/llama-3.1-8b-instruct - Good balance
google/gemini-flash-1.5 - Fast and cheap

Debug Mode

You can run the system in debug mode to see the messages sent to the LLM:

npm run debug

It can also be useful to limit the run time to a few seconds:

npm run debug -- --duration=60

🧪 Testing

Manual Testing

Monitor UI (Recommended)

# Open in your browser
http://localhost:6740/

# Features:
# - Real-time energy monitoring
# - Live conversation view
# - Event stream
# - Interactive chat interface

REST API

# Send Messages
curl -X POST http://localhost:6740/message \
  -H "Content-Type: application/json" \
  -d '{"content": "What is quantum computing?"}'

# Check System Stats
curl http://localhost:6740/stats

# Retrieve Conversations
curl http://localhost:6740/conversations/{requestId}

# Health Check
curl http://localhost:6740/health

Demo Script

chmod +x demo.sh
./demo.sh

Automated Testing Framework

The automated testing framework validates the AI's energy regulation behavior and conversation management patterns. Tests run with local Ollama models (including very small models like gemma:2b) to ensure the system converges to stable energy patterns.

Test Scenarios

Simple Conversations - Validates basic interaction patterns
- User says "Hello, how are you?"
- AI responds appropriately
- System maintains conversation until natural conclusion
- Energy converges to stable levels (not constantly at 0%)
- Conversation snoozes with exponential backoff if user doesn't respond
Long-Running Conversations - Tests sustained engagement
- User requests brainstorming (e.g., "Let's brainstorm baby names")
- AI provides multiple responses over time
- System recognizes one-sided conversations
- Eventually snoozes or terminates when user engagement drops
Future Actions - Tests snooze functionality
- User requests future action (e.g., "Increase thermostat in 5 minutes")
- AI acknowledges and snoozes conversation
- Conversation reactivates at the right time
- Action is attempted when timer expires
Multiple Priorities - Tests conversation balancing
- System prioritizes based on urgency
- Energy is distributed appropriately across tasks

Running Tests

Automated (Recommended):

./test/run-with-server.sh simple      # Run simple tests with auto-managed server
./test/run-with-server.sh all         # Run all tests

Manual (if you prefer to manage the server yourself):

Start server in one terminal:

# Use duration from test output (test runner shows recommended duration)
node dist/src/index.js --replenish-rate 10 --duration 180

Run tests in another terminal:

npm test                    # Run all tests
npm run test:simple        # Test simple conversations
npm run test:brainstorm    # Test long-running conversations  
npm run test:snooze        # Test future actions
npm run test:priorities    # Test multiple priorities

Note: Tests run with accelerated energy replenishment (--replenish-rate 10) for faster execution.

Test Configuration

Tests use local Ollama models configured in test/config.json:

{
  "models": ["llama3.2:1b", "llama3.2:3b"],
  "energyTargets": {
    "convergence": 50,      // Target energy level for stability
    "tolerance": 20         // Acceptable variance
  },
  "timeouts": {
    "conversation": 1800,   // 30 minutes max conversation
    "snooze": 300          // 5 minute snooze test
  }
}

Success Criteria

Energy Convergence: System maintains average energy near target levels (50-100%)
Appropriate Timing: Actions occur at expected times
Conversation Management: Proper handling of active, snoozed, and ended states
Resource Efficiency: Energy quota used mindfully, not depleted unnecessarily

📊 Energy Regulation Behavior

Energy Level	Behavior
>50	Normal operation, complex responses
20-50	Model switching to smaller model
0-20	Short cycles, simpler responses
<0	Urgent mode, pressing tone, minimal responses

🔄 Reflection System

Every 30 seconds (when energy > 30), the system:

Analyzes recent conversations
Identifies topics needing deeper analysis
Generates follow-up insights
Adds "FOLLOW-UP REFLECTION" responses

🗄️ Data Storage

SQLite Database: conversations.db (auto-created)
Schema: Conversations and responses with metadata
Analytics: Energy tracking, model switches, conversation stats

🛠️ Development

Build

npm run build

Development Mode

npm run dev

TypeScript Checking

npx tsc --noEmit

📋 API Reference

POST /message

Send a message to the AI system.

Request:

{
  "content": "Your question here",
  "id": "optional-request-id"
}

Response:

{
  "status": "received",
  "requestId": "generated-or-provided-id"
}

GET /stats

Get system-wide conversation statistics.

Response:

{
  "total_conversations": 42,
  "total_responses": 156,
  "avg_energy_level": 45.2,
  "urgent_responses": 3
}

GET /conversations/:requestId

Retrieve a specific conversation.

Response:

{
  "requestId": "abc-123",
  "inputMessage": "What is AI?",
  "responses": [
    {
      "timestamp": "2025-10-04T13:00:00.000Z",
      "content": "AI is artificial intelligence...",
      "energyLevel": 85,
      "modelUsed": "gemma:3b"
    }
  ],
  "metadata": {
    "totalEnergyConsumed": 0,
    "sleepCycles": 0,
    "modelSwitches": 0
  }
}

GET /health

Check system health.

Response:

{
  "status": "ok",
  "timestamp": "2025-10-04T13:00:00.000Z"
}

GET /apps

List all installed apps with their configurations and energy metrics.

Response:

{
  "apps": [
    {
      "id": "chat",
      "name": "Chat App",
      "type": "in-process",
      "enabled": true,
      "energyMetrics": {
        "total": 1250.5,
        "last24h": 450.2,
        "last1h": 85.3,
        "last1min": 5.2
      }
    }
  ]
}

POST /apps/install

Install a new app.

Request:

{
  "id": "gmail",
  "name": "Gmail App",
  "type": "http",
  "enabled": true,
  "config": {
    "url": "http://localhost:8080"
  },
  "hourlyEnergyBudget": 100,
  "dailyEnergyBudget": 1000
}

Response:

{
  "success": true,
  "appId": "gmail"
}

POST /apps/:appId/start

Start an installed app.

Response:

{
  "success": true,
  "appId": "gmail"
}

POST /apps/:appId/stop

Stop a running app.

Response:

{
  "success": true,
  "appId": "gmail"
}

DELETE /apps/:appId

Uninstall an app.

Response:

{
  "success": true,
  "appId": "gmail"
}

GET /apps/:appId/memories

List memories for an app.

Response:

{
  "appId": "chat",
  "count": 3,
  "memories": [
    {
      "id": 1,
      "content": "User prefers meetings after 2pm on weekdays.",
      "createdAt": "2025-10-11T14:30:00.000Z",
      "updatedAt": "2025-10-11T14:30:00.000Z",
      "sourceConversationId": "abc-123"
    }
  ]
}

DELETE /apps/:appId/memories

Purge all memories for an app.

Response:

{
  "success": true,
  "appId": "chat",
  "deletedCount": 8
}

DELETE /apps/:appId/memories/:memoryId

Delete a specific memory.

Response:

{
  "success": true,
  "memoryId": 1
}

🎨 Key Features

Core Capabilities

Energy-Aware Processing: Responses adapt to available energy
User-Guided Energy Budgets: Specify effort allocation per conversation
Monitor UI: Real-time web dashboard for system observation and interaction
Continuous Reflection: System thinks about past conversations
Model Switching: Automatic optimization based on complexity needs
Persistent Memory: SQLite-backed conversation history
Real-time Analytics: System performance monitoring
Adaptive Sleep: Energy replenishment when idle

Multi-App Architecture

App Registry: Install, manage, and monitor multiple apps
Conversation Isolation: Each app only sees its own conversations
Per-App Energy Tracking: Monitor energy consumption by app with time windows
Message Routing: Automatic routing of responses to originating apps
Extensible Design: Easy to add new apps (Gmail, Calendar, etc.)
App Lifecycle Management: Install, start, stop, uninstall apps

Memory System (NEW!)

Automatic Memory Creation: Memories are created when conversations end
App-Scoped Memories: Each app has its own isolated memory space (10 records max)
Intelligent Compaction: LLM-based decisions on which memories to keep/merge/delete
Context Injection: Memories automatically included in relevant conversations
Memory Management API: View and purge memories via REST endpoints

MCP Integration

Extensible Tool System: Unified interface for MCP tools
HTTP Transport: Connect to remote MCP servers via HTTP/HTTPS
Tool Namespacing: Automatic collision prevention for MCP tools

📚 Documentation

User Documentation

User Guide: Comprehensive user guide with examples
Monitor UI Guide: Web dashboard user guide
Quick Reference: Fast reference for common tasks
Features Overview: Complete feature list and status
Energy Budget Quick Start: Guide to energy budgets
Migration Guide: Upgrading from previous versions
Release Notes: Latest features and changes

Technical Documentation

System Specification: Detailed technical requirements
MCP Integration Spec: MCP integration details
Energy Budget Spec: Energy budget specification
Apps Vision: Multi-app architecture vision
Apps Specification: Apps feature technical spec
HTTP MCP Spec: HTTP transport specification
Monitor UI Spec: Monitor UI technical specification

Implementation Guides

Apps Implementation Plan: Phased implementation plan
Apps Implementation Summary: Implementation status
App-Conversation Binding: Message routing architecture
Unified MCP Tools: MCP tool system explained
Tool Namespacing: Tool naming and collision prevention
HTTP MCP Implementation: HTTP transport guide
MCP Integration Complete: MCP integration summary
Monitor UI Implementation: Monitor UI implementation details

🔧 Troubleshooting

"Cannot find module" errors

# Solution: Rebuild the project
npm run build

"ECONNREFUSED" or "Connection refused" when starting

# Cause: Ollama is not running
# Solution: Start Ollama
ollama serve  # or restart the Ollama application

# Verify Ollama is running:
curl http://localhost:11434/api/tags

"Model not found" errors

# Cause: Required models not installed
# Solution: Pull the models
ollama pull llama3.2:1b
ollama pull llama3.2:3b

# Verify:
ollama list | grep llama3.2

Port already in use (EADDRINUSE)

# Cause: Another process is using port 6740
# Solution 1: Find and stop the process
lsof -ti:6740 | xargs kill

# Solution 2: Change port in .env file
echo "PORT=6741" >> .env

System starts but no responses

# Check if models are loaded
ollama list

# Check server logs for errors
# Look for energy level - if at 0%, system may be energy-depleted

# Try restarting with higher replenishment rate
node dist/src/index.js --replenish-rate 20

"Missing required environment variables"

# Cause: .env file not created or missing OLLAMA_BASE_URL
# Solution: Create .env file
cp .env.example .env

Build errors with TypeScript

# Clear build artifacts and rebuild
rm -rf dist/
npm run build

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes
Run tests: ./demo.sh
Submit a pull request

📄 License

MIT License - see repository for details.

Name		Name	Last commit message	Last commit date
Latest commit History 138 Commits
.github/workflows		.github/workflows
docs		docs
src		src
test		test
ui		ui
.env.example		.env.example
.gitignore		.gitignore
11-memory-feature-vision.md		11-memory-feature-vision.md
12-memory-feature-spec.md		12-memory-feature-spec.md
13-memory-implementation-plan.md		13-memory-implementation-plan.md
6-approval-system-spec.md		6-approval-system-spec.md
APPROVAL-SYSTEM-SUMMARY.md		APPROVAL-SYSTEM-SUMMARY.md
DOCUMENTATION-INDEX.md		DOCUMENTATION-INDEX.md
Dockerfile		Dockerfile
FEATURES.md		FEATURES.md
FIXES-APPLIED.md		FIXES-APPLIED.md
IMPROVEMENTS-SUMMARY.md		IMPROVEMENTS-SUMMARY.md
IMPROVEMENTS.md		IMPROVEMENTS.md
ISSUE-FIXES-SUMMARY.md		ISSUE-FIXES-SUMMARY.md
MEMORY-TEST-RESULTS.md		MEMORY-TEST-RESULTS.md
MIGRATION-GUIDE.md		MIGRATION-GUIDE.md
MONITOR-UI-COMPLETE.md		MONITOR-UI-COMPLETE.md
MONITOR-UI-GUIDE.md		MONITOR-UI-GUIDE.md
MONITOR-UI-IMPLEMENTATION-PLAN.md		MONITOR-UI-IMPLEMENTATION-PLAN.md
MONITOR-UI-SPEC.md		MONITOR-UI-SPEC.md
MONITOR-UI-SUMMARY.md		MONITOR-UI-SUMMARY.md
MONITOR-UI-TEST-REPORT.md		MONITOR-UI-TEST-REPORT.md
MONITOR-UI-VISION.md		MONITOR-UI-VISION.md
NEW-USER-FEEDBACK.md		NEW-USER-FEEDBACK.md
QUICK-REFERENCE.md		QUICK-REFERENCE.md
QUICK-START-MONITOR.md		QUICK-START-MONITOR.md
README.md		README.md
RELEASE-NOTES.md		RELEASE-NOTES.md
SESSION-SUMMARY.md		SESSION-SUMMARY.md
TEST-COVERAGE-ANALYSIS.md		TEST-COVERAGE-ANALYSIS.md
USER-GUIDE.md		USER-GUIDE.md
debug-energy.js		debug-energy.js
demo.sh		demo.sh
docker-compose.yml		docker-compose.yml
mcp-servers.json.example		mcp-servers.json.example
package-lock.json		package-lock.json
package.json		package.json
run-budget-test.sh		run-budget-test.sh
run-energy-test.sh		run-energy-test.sh
run-mcp-test.sh		run-mcp-test.sh
run-subagent-test.sh		run-subagent-test.sh
test-monitor-ui.js		test-monitor-ui.js
tsconfig.json		tsconfig.json
tsconfig.test.json		tsconfig.test.json
verify-fixes.sh		verify-fixes.sh
verify-setup.sh		verify-setup.sh

elecnix/ai-effort-regulation

Folders and files

Latest commit

History

Repository files navigation

AI Effort Regulation Demo

🎯 Goal

Energy as Effort Quota

🏗️ Architecture

🚀 Quick Start

Prerequisites

Prerequisites Check (Optional but Recommended)

Installation

Configuration

Running with Ollama (Recommended for Testing)

Running with OpenRouter (Cloud Models)

Debug Mode

🧪 Testing

Manual Testing

Monitor UI (Recommended)

REST API

Demo Script

Automated Testing Framework

Test Scenarios

Running Tests

Test Configuration

Success Criteria

📊 Energy Regulation Behavior

🔄 Reflection System

🗄️ Data Storage

🛠️ Development

Build

Development Mode

TypeScript Checking

📋 API Reference

POST /message

GET /stats

GET /conversations/:requestId

GET /health

GET /apps

POST /apps/install

POST /apps/:appId/start

POST /apps/:appId/stop

DELETE /apps/:appId

GET /apps/:appId/memories

DELETE /apps/:appId/memories

DELETE /apps/:appId/memories/:memoryId

🎨 Key Features

Core Capabilities

Multi-App Architecture

Memory System (NEW!)

MCP Integration

📚 Documentation

User Documentation

Technical Documentation

Implementation Guides

🔧 Troubleshooting

"Cannot find module" errors

"ECONNREFUSED" or "Connection refused" when starting

"Model not found" errors

Port already in use (EADDRINUSE)

System starts but no responses

"Missing required environment variables"

Build errors with TypeScript

🤝 Contributing

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages