Skip to main content
ZeroTwo’s audio Studio lets you generate AI audio — including music, sound effects, and narration — from text prompts. Describe what you need and ZeroTwo produces an audio file you can preview in-browser and download.

Accessing audio generation

Navigate to /studio/audio from the topbar at the top of the ZeroTwo app. Click Audio in the topbar to open the audio workspace.

Audio Studio vs. Voice

ZeroTwo has two separate audio features. It’s important to understand the difference:
FeatureAudio StudioVoice
What it doesGenerates audio files (music, sound effects, narration) from text promptsReal-time voice conversation with AI
OutputDownloadable audio file (MP3, WAV, etc.)Live spoken response during a chat session
Use caseBackground music, sound effects, produced audio assetsHands-free chat, accessibility, voice interaction
Location/studio/audioIn-chat voice mode
Audio Studio is for producing audio assets. Voice is for speaking with AI in real time.

What you can create

  • Background music: ambient tracks, genre-specific music, mood-driven compositions for videos, presentations, or apps
  • Sound effects: UI sounds, environmental effects, specific sound descriptions
  • AI narration: spoken audio of written text in various tones and styles
  • Jingles and short musical pieces: branded audio, intros, outros
  • Creative audio: experimental sound design, unique soundscapes

How it works

1

Navigate to audio Studio

Go to /studio/audio via the topbar.
2

Describe your audio

Write a description of what you want: genre, mood, tempo, instruments, duration, and intended use. The more specific, the better.
3

Select a model

Choose an audio generation model. Different models are optimized for different audio types — music, voice, effects.
4

Set duration (if available)

Specify how long the audio should be, if the selected model supports duration control.
5

Generate and preview

Click Generate. When complete, preview the audio in-browser and download the file.

Plan requirements

PlanAudio generation
FreeVery limited or unavailable
ProAvailable
Pro 2xAvailable
Plus UltraUnlimited
Audio generation is primarily a Pro+ feature. Free plan users may have very limited or no access. Upgrade in Settings → Account.

Prompt essentials

Audio prompts follow different best practices depending on what you’re generating: For background music: Include genre, mood, instruments, tempo, and intended duration. Also specify the context (what it’s for) — this helps the model calibrate energy and style: Calm, professional background music for a corporate explainer video, piano and light strings, 60 seconds, no percussion, subtle and understated For narration / text-to-speech: Provide the text you want read, plus a description of the voice — tone, pace, and style: Read the following text in a warm, friendly female voice at a conversational pace: "Welcome to our platform. We're glad you're here." For sound effects: Be specific about the exact sound event, material, and environment: A single wooden door knock, 2 knocks, interior space with light reverb, natural sound For ambient soundscapes: Describe the environment and the mood you want to evoke: Quiet office ambience: distant keyboard typing, subtle air conditioning hum, occasional muffled conversation, focused and productive atmosphere

Frequently asked questions

Yes. Audio Studio generates audio files (music, effects, narration) that you download. Voice mode is real-time spoken conversation with the AI during a chat session. They are separate features.
MP3 and WAV are the primary formats, depending on the selected model. MP3 for general sharing; WAV for professional production. Format availability depends on the model.
Generated audio from ZeroTwo Studio is generally usable for commercial purposes, subject to ZeroTwo’s Terms of Service and the content policies of the underlying model providers. Review the terms for your specific use case.
Free plan access to audio generation is very limited or unavailable. Audio is primarily a Pro+ feature. Upgrade in Settings → Account.

Explore further

Creating audio

Step-by-step guide with prompt examples for music, effects, and narration.

Audio models

Available audio generation models and their best use cases.

Troubleshooting

Fix common audio generation issues.