TTS-driven lipsync
Type any text, stream it through OpenAI TTS, and the avatar's mouth animates in
real time. Visemes are extracted in-browser via wawa-lipsync
analysing the audio's frequency bands — no server-side audio processing, no
per-word timing data, no plugins.
live · TTS → viseme → morph