Ai-Chat bot with Judge-Worker architecture. The project consists of backend + API, PWA app and mobile app.
- Two PCs preferably with Linux or WSL2
- Both PCs have to be connected either to the same network or to the same VPN
- Each having at least 32GB of RAM (less should also work, but it's not recommended)
- And each having at least one Nvidia GPU with at least 8GB of VRAM (6GB should also work but then you'll have to choose different LLMs - smaller)
- Both PCs needs to have the CUDA drivers installed
- Both PCs needs to have Docker
Starting the AI-Cluster requires using this command in the first PC:
./start.sh pc1and in the second PC:
./start.sh pc2and finally the third:
./start.sh pc3Make sure to edit the IP adresses in the .env file.
You can test the connection between the PCs and if the models can be loaded by using this commans:
curl -X POST http://<PC2_IP>:8000/system/init_mode -H "Content-Type: application/json" -d '{"mode": "pro"}'If everything went according to plan, then you should receive the confirmation.