Skip to main content
ZeroTwo offers 10 AI voices with distinct tonal qualities and character. All 10 voices use the same underlying model and audio pipeline — the difference is in character, pacing, and style. Try a few to find what fits how you like to work.

Available voices

VoiceCharacterBest for
AlloyBalanced, neutral, versatileGeneral use, professional contexts — a reliable default
AshWarm, conversationalCasual conversation, brainstorming, coaching
BalladExpressive, nuancedCreative work, storytelling, reading aloud
CedarClear, professional, crispBusiness tasks, presentations, formal writing
CoralFriendly, approachable, upbeatEveryday Q&A, learning, customer-facing use
EchoPrecise, sharp, technicalTechnical discussions, step-by-step guides, coding
MarinCalm, smooth, measuredLong sessions, focus work, ambient listening
SageThoughtful, careful, wiseResearch, analysis, nuanced topics, advice
ShimmerBright, energetic, livelyCreative tasks, motivation, brainstorming
VerseNatural, flowing, conversationalGeneral use, dictation, note-taking, storytelling
If you are unsure where to start, try Alloy for general use or Coral for a friendlier tone. Both are popular starting points and work well across a wide range of tasks.

Voice descriptions in detail

Alloy is ZeroTwo’s default voice for most users. It has a neutral, confident tone that works well across a wide range of tasks — technical questions, creative writing, casual chat, or professional discussions. If you are unsure which voice to pick, Alloy is a safe starting point.
Ash has a warmer, more personal quality. It is well-suited for conversations where tone matters — coaching, brainstorming, or extended conversations where you want the AI to feel more like a collaborator than a tool.
Ballad has more dynamic, expressive vocal quality with natural variation in pacing and tone. It stands out when reading stories, working through creative projects, or any content where emotional range adds value.
Cedar is precise and authoritative without being cold. It is ideal for business contexts: drafting emails out loud, summarizing reports, preparing talking points for presentations, or any task where a professional register matters.
Coral has an open, welcoming quality. It works well for everyday tasks, learning new topics, onboarding scenarios, or any situation where you want a non-intimidating conversational partner. Popular for lighter-tone use cases.
Echo prioritizes clarity and precision. It enunciates well and keeps a consistent pace, which makes it excellent for technical explanations, numbered steps, code walkthroughs, or any content where accuracy and following along matter most.
Marin has a gentle, unhurried quality. It is ideal for long listening sessions — hearing lengthy documents read aloud, extended focus work sessions, or situations where a calmer, more measured energy is helpful.
Sage speaks with deliberation and care. It suits topics that benefit from a reflective, considered tone: philosophy, research discussions, ethical questions, detailed analysis, or situations where you want the response to feel carefully weighed.
Shimmer is upbeat and enthusiastic. It works well for motivational contexts, creative brainstorming sessions, generating ideas, or any workflow where a higher-energy voice keeps you engaged and moving.
Verse sounds closest to a natural conversational cadence. It is great for dictation, note-taking, and general-purpose use where the voice should feel unobtrusive and easy to listen to for extended periods.

How to change your voice

1

Open Settings

Click your profile icon or navigate to Settings from the main menu.
2

Go to Preferences

Select the Preferences tab within Settings.
3

Find the Voice setting

Scroll to the Voice section. A dropdown shows all 10 available voices.
4

Select a voice

Click the dropdown and choose a voice. A short preview clip may play to help you decide.
5

Save

Your selection is saved automatically. The new voice applies to all future voice sessions.
Voice preference changes apply to future voice sessions. If you change your voice while a session is active, end the session and restart it to hear the new voice. Your preference syncs across all devices and browsers where you use ZeroTwo.

Audio technical specifications

All ZeroTwo voices use the same audio format optimized for real-time streaming:
PropertyValue
FormatPCM16
Sample rate24 kHz
ChannelsMono
DeliveryWebRTC streaming
TranscriptionWhisper-1
PCM16 at 24 kHz provides clear, intelligible speech with minimal latency. The mono channel is intentional — it reduces bandwidth requirements without meaningfully affecting voice quality for speech-based conversations.

Network and quality notes

Voice quality depends on your network connection:
Connection typeExpected quality
Strong WiFi or wired EthernetFull quality, low latency
4G/5G mobile connectionGenerally good, minor fluctuations possible
Weak or congested WiFiIncreased latency, possible choppiness
VPN (especially remote servers)May add latency affecting real-time feel
If you notice voice quality degrading mid-session, it is almost always a network condition, not a ZeroTwo issue. Switching to a more stable connection resolves it in most cases.
See Voice Troubleshooting for help with specific audio issues.