The first voice connection in a new browser session may take a few seconds to initialize the WebRTC stream. Subsequent connections in the same session start faster.
Steps to start a voice chat
Open any chat
Navigate to an existing conversation or start a new one at
/c. Voice is available in all chat contexts.Click the microphone icon
Find the microphone icon in the prompt bar at the bottom of the chat window. Click it once to activate voice mode.
Grant microphone permission
Your browser will show a permission prompt asking for microphone access. Click Allow. This only appears the first time you use voice on zerotwo.ai (or if you previously denied access).If you accidentally clicked Block or previously denied access, see Voice Troubleshooting to reset permissions in your browser settings.
Wait for the listening indicator
Once the microphone is live, a pulsing indicator appears near the mic icon. This confirms ZeroTwo is actively listening and ready for your voice input.
Speak your message naturally
Talk at a normal pace. Voice Activity Detection (VAD) automatically detects when you start and stop speaking — no button to hold down. When you finish a sentence or pause, ZeroTwo begins processing.
ZeroTwo responds with voice and text
After you finish speaking, ZeroTwo processes your message and responds with a voice reply. The response is also displayed as text in the chat, so you always have a written record.
Continue the conversation
Speak again to keep the conversation going. Take turns naturally — ZeroTwo listens after it finishes speaking, but you can interrupt it at any time (see below).
Interrupting ZeroTwo
You do not need to wait for ZeroTwo to finish speaking. If you want to redirect the conversation, correct something, or ask a follow-up mid-response:- Simply start speaking while ZeroTwo is responding
- ZeroTwo stops its current response immediately
- Your new message is processed and ZeroTwo replies
During a voice chat
Voice mode and text mode are not mutually exclusive. While a voice session is active, you can still:- Type a message in the prompt bar and send it — ZeroTwo responds with voice
- Paste text into the prompt bar — useful for sharing a URL, code snippet, or long passage you don’t want to dictate
- Attach files using the paperclip icon — ZeroTwo reads and responds to them verbally
Session indicators
| Indicator | Meaning |
|---|---|
| Pulsing mic icon | ZeroTwo is actively listening for your voice |
| Solid waveform animation | ZeroTwo is currently speaking |
| Static mic icon | Voice session is not active |
Transcripts
Everything said during a voice session is saved as text in the chat history:- Your spoken messages (what you said, transcribed via Whisper-1)
- ZeroTwo’s responses (what it said, shown as text)
Changing your voice mid-conversation
If you want to switch to a different voice during an active session:- End the current voice session (click the mic icon or press Esc)
- Go to Settings → Preferences → Voice and select a new voice
- Click the mic icon to start a new session — the new voice takes effect immediately
Ending a voice session
To stop voice mode at any time:- Click the microphone icon in the prompt bar
- Press Escape on your keyboard
Browser compatibility
Voice chat uses WebRTC, supported in all major modern browsers:| Browser | Support |
|---|---|
| Chrome | Full support |
| Edge | Full support |
| Firefox | Full support |
| Safari | Full support (macOS and iOS) |

