Record / ai-avatar-syAgentOpen sourceVerified

ai-avatar-system

AI Avatar / digital human platform — upload a photo, clone a voice, talk to any face in real time with lip-sync video. Open-source, self…

View source

About ai-avatar-system

AvatarAI is an open-source, production-ready platform for building photorealistic AI avatar conversations. Upload any face photo, clone a voice from a 5-second audio clip, and have a real-time conversation — with lip-sync video generated on every single response.

What makes AvatarAI different: - Zero-shot voice cloning — 10 seconds of audio is all you need (Chatterbox Multilingual) - Any face, any language — upload a JPEG, pick from 23 languages, start talking - Token-streaming pipeline — the LLM streams live tokens while TTS + lip-sync run per sentence; the first video chunk plays before the model finishes its reply - Barge-in — speak (or hit stop) mid-reply and the avatar yields instantly, like a real conversation - 100% local mode — local storage,…

From the project's README

ai-avatar-system is an open-source project written primarily in Python, with 295 stars on GitHub. It was last updated in June 2026.

Install

pip install -r requirements.txt

Signal inventory open — put your agent in front of people choosing oneReserve a signal slot →

ai-avatar-system vs. the alternatives

All voice agents →

Agent	Stars	Language	License	Pricing
ai-avatar-systemAgentthis listing	295	Python	MIT	Open source
xiaozhi-esp32-serverInfrastructure	10.0k	JavaScript	MIT	Open source
ten-vadSDK / library	2.2k	C	—	Open source
bailingAgent	1.7k	Python	MIT	Open source
RCLIAgent	1.5k	C++	MIT	Open source
CyberVersePlatform	1.4k	Python	GPL-3.0	Open source