Voila is a family of large voice-language foundation models designed for real-time autonomous interaction and voice role-play. It features an end-to-end architecture enabling full-duplex, low-latency conversations with rich vocal nuances. Voila supports over one million pre-built voices and efficient customization from brief audio samples.
