VoicePersona — AI Microservices

All Languages AIM supports, several registers each
Live A/B learning, per contact
0 Voices shipped without domain-term audit

VoicePersona is the service that decides, turn by turn, which voice speaks on the call. It hosts a bank of pre-auditioned voices across every supported language and several registers per language — neutral, warmer, older, younger, with regional accents where they matter — and chooses which one to use for each call the stack places or receives.

The choice is not random. A new contact gets a reasonable default based on language and region. Over time the system A/B tests voices against the metric that matters for this product — pickup-to-booking, pickup-to-close, pickup-to-renewal, pickup-to-registration — and settles on whichever voice that individual contact picks up for the most, listens to the longest and converts through. Two years into a programme, no two contacts necessarily hear the same voice.

Every voice is reviewed for cadence, pronunciation of domain terms and handling of pauses. This is not a hobbyist TTS — a B2B sales call cannot mangle a product name, a collections call cannot sound robotic, a healthcare call cannot botch clinical vocabulary. The platform maintains a vetted shortlist per tenant and retires voices that stop performing.

Capabilities

Everything VoicePersona handles for you

Multilingual bank

Every language AIM supports, each in several registers — formal, warm, technical, concise.
Per-contact A/B

System converges each contact to the voice that maximises their own pickup-to-conversion rate.
Domain-term hygiene

Pronunciation of industry vocabulary reviewed per language; new terms flagged for audit.
Cadence preservation

Natural pauses, soft hand-offs and disfluencies kept intact — no uncanny, stacked-syllable delivery.
Quick persona swap

An operator can retire a voice or A/B a new one without touching the rest of the stack.

Integrations

The voice layer of every outbound call

VoicePersona is a specialised surface — it renders speech into VoiceDialer, takes instructions from AgentBuilder, and is tuned per tenant by the platform admin.

VoiceDialer plays the rendered audio into the live phone call.
AgentBuilder drives what the voice says and when; persona renders, agent composes.
SuperAdmin manages the persona bank, retires voices and controls which personas are active per tenant.

Wire VoicePersona into your product today

Book a consultation with our founders and we'll walk you through the whole microservice stack — not just this one — live on your domain.

Book a consultation Browse all components

Multilingual bank

Per-contact A/B

Domain-term hygiene

Cadence preservation

Quick persona swap

Wire VoicePersona into your product today