Watch autonomous AI characters navigate their world — eating, cleaning, playing music, socializing. Powered by hybrid utility-AI + optional LLM reflection. All running locally on dual RTX 5090s.
Need-based decision engine. Characters track energy, hunger, hygiene, social, fun, and creativity. Each furniture piece has associated actions with priority scores.
Optional layer using local Qwen 3.6 27B via vLLM. Adds personality-driven thought, complex decision-making, and natural language narration. Disabled by default for demo.
Express server + React client. Runs entirely on dual RTX 5090s. No external API calls. Deployed to Railway for public access.
Full walkthrough with commentary: architecture deep dive, live demo, and what local AI can do right now.
Try the live interactive demo above while you wait.
▶ Subscribe on YouTube| Component | Details |
|---|---|
| Language Model | Qwen 3.6 27B (via vLLM, int4 compressed) |
| Hardware | Dual NVIDIA RTX 5090s |
| Inference Server | vLLM 0.19.1 with flash-infer + fp8 KV cache |
| Backend | Express.js with TypeScript |
| Frontend | React + Vite |
| AI Architecture | Hybrid: Utility AI (default) + LLM Reflection (opt-in) |
| Deployment | Railway (server) + Cloudflare Pages (companion site) |
| Cost per Decision | $0.00 (all local) |