halo-ai

Local AI anywhere, for everyone. No cloud. No subscriptions. Just you and your hardware.

86tok/s

17agents

0cloud

The Stack

A complete AI platform running on bare metal. LLMs, voice cloning, image generation, autonomous agents — all local, all yours.

86 tok/s on Qwen3-30B with Vulkan + Flash Attention. No API keys. No rate limits.

Clone any voice with 30 seconds of audio. Whisper STT. Kokoro TTS. All on-device.

ComfyUI with SDXL and Flux. 123.5GB unified VRAM on AMD Strix Halo.

17 specialized AI agents with personalities, backstories, and real jobs.

Built for Strix Halo. ROCm + Vulkan. 128GB unified memory. No NVIDIA tax.

Everything runs on your hardware. Your data never leaves your network.

Every agent has a role, a personality, and a purpose.

Community Voice

Bug Hunter

Security

Audio Engineer

Hardware

Entertainment

Code Watcher

Game Builder

86 tok/s

Qwen3-30B-A3B · Vulkan + Flash Attention · AMD Strix Halo · Bare metal

One command. Under 5 minutes. Your AI, your rules.

git clone https://github.com/stampby/halo-ai.git && cd halo-ai && ./install.sh

Install Guide Documentation Get Help