Available voices
| Voice | Character | Best for |
|---|---|---|
| Alloy | Balanced, neutral, versatile | General use, professional contexts — a reliable default |
| Ash | Warm, conversational | Casual conversation, brainstorming, coaching |
| Ballad | Expressive, nuanced | Creative work, storytelling, reading aloud |
| Cedar | Clear, professional, crisp | Business tasks, presentations, formal writing |
| Coral | Friendly, approachable, upbeat | Everyday Q&A, learning, customer-facing use |
| Echo | Precise, sharp, technical | Technical discussions, step-by-step guides, coding |
| Marin | Calm, smooth, measured | Long sessions, focus work, ambient listening |
| Sage | Thoughtful, careful, wise | Research, analysis, nuanced topics, advice |
| Shimmer | Bright, energetic, lively | Creative tasks, motivation, brainstorming |
| Verse | Natural, flowing, conversational | General use, dictation, note-taking, storytelling |
Voice descriptions in detail
Alloy — Balanced and Neutral
Alloy — Balanced and Neutral
Alloy is ZeroTwo’s default voice for most users. It has a neutral, confident tone that works well across a wide range of tasks — technical questions, creative writing, casual chat, or professional discussions. If you are unsure which voice to pick, Alloy is a safe starting point.
Ash — Warm and Conversational
Ash — Warm and Conversational
Ash has a warmer, more personal quality. It is well-suited for conversations where tone matters — coaching, brainstorming, or extended conversations where you want the AI to feel more like a collaborator than a tool.
Ballad — Expressive and Nuanced
Ballad — Expressive and Nuanced
Ballad has more dynamic, expressive vocal quality with natural variation in pacing and tone. It stands out when reading stories, working through creative projects, or any content where emotional range adds value.
Cedar — Clear and Professional
Cedar — Clear and Professional
Cedar is precise and authoritative without being cold. It is ideal for business contexts: drafting emails out loud, summarizing reports, preparing talking points for presentations, or any task where a professional register matters.
Coral — Friendly and Approachable
Coral — Friendly and Approachable
Coral has an open, welcoming quality. It works well for everyday tasks, learning new topics, onboarding scenarios, or any situation where you want a non-intimidating conversational partner. Popular for lighter-tone use cases.
Echo — Precise and Technical
Echo — Precise and Technical
Echo prioritizes clarity and precision. It enunciates well and keeps a consistent pace, which makes it excellent for technical explanations, numbered steps, code walkthroughs, or any content where accuracy and following along matter most.
Marin — Calm and Smooth
Marin — Calm and Smooth
Marin has a gentle, unhurried quality. It is ideal for long listening sessions — hearing lengthy documents read aloud, extended focus work sessions, or situations where a calmer, more measured energy is helpful.
Sage — Thoughtful and Careful
Sage — Thoughtful and Careful
Sage speaks with deliberation and care. It suits topics that benefit from a reflective, considered tone: philosophy, research discussions, ethical questions, detailed analysis, or situations where you want the response to feel carefully weighed.
Shimmer — Bright and Energetic
Shimmer — Bright and Energetic
Shimmer is upbeat and enthusiastic. It works well for motivational contexts, creative brainstorming sessions, generating ideas, or any workflow where a higher-energy voice keeps you engaged and moving.
Verse — Natural and Flowing
Verse — Natural and Flowing
Verse sounds closest to a natural conversational cadence. It is great for dictation, note-taking, and general-purpose use where the voice should feel unobtrusive and easy to listen to for extended periods.
How to change your voice
Select a voice
Click the dropdown and choose a voice. A short preview clip may play to help you decide.
Voice preference changes apply to future voice sessions. If you change your voice while a session is active, end the session and restart it to hear the new voice. Your preference syncs across all devices and browsers where you use ZeroTwo.
Audio technical specifications
All ZeroTwo voices use the same audio format optimized for real-time streaming:| Property | Value |
|---|---|
| Format | PCM16 |
| Sample rate | 24 kHz |
| Channels | Mono |
| Delivery | WebRTC streaming |
| Transcription | Whisper-1 |
Network and quality notes
Voice quality depends on your network connection:| Connection type | Expected quality |
|---|---|
| Strong WiFi or wired Ethernet | Full quality, low latency |
| 4G/5G mobile connection | Generally good, minor fluctuations possible |
| Weak or congested WiFi | Increased latency, possible choppiness |
| VPN (especially remote servers) | May add latency affecting real-time feel |
If you notice voice quality degrading mid-session, it is almost always a network condition, not a ZeroTwo issue. Switching to a more stable connection resolves it in most cases.

