Skip to main content
ZeroTwo’s video Studio lets you generate short AI videos from text descriptions. Describe a scene, subject, or action and ZeroTwo will produce a video clip using one of its AI video generation models.

Accessing video generation

Navigate to /studio/video from the topbar at the top of the ZeroTwo app. Click Video in the topbar to open the video workspace.
Video generation is Pro+ only. Free plan users cannot access video generation. Upgrade in Settings → Account to unlock this feature.

How it works

  1. Navigate to /studio/video
  2. Write a description of the video you want — subject, action, setting, camera movement, mood
  3. Select a video generation model
  4. Configure output settings: duration, aspect ratio, format
  5. Click Generate
  6. Wait for the generation to complete (typically 30 seconds to several minutes)
  7. Preview the video in-browser, then download or share

Generation time

Video generation is significantly more compute-intensive than image generation. Expect generation times of 30 seconds to several minutes depending on clip length, selected model, and current server load. Do not close the tab while a video is generating.
Typical generation times:
  • Short clips (2–4 seconds): 30 seconds to 2 minutes
  • Longer clips (5–10 seconds): 2–5 minutes or more

Output formats

Generated videos can be downloaded in three formats:
FormatBest for
MP4Universal sharing — works on all platforms and devices
WebMWeb embedding and smaller file sizes
MOVMac and iOS workflows, video editing software
MP4 is recommended for most use cases due to maximum compatibility.

Plan requirements

PlanVideo generation
FreeNot available
ProAvailable
Pro 2xAvailable
Plus UltraUnlimited
BusinessUnlimited

Tips for your first video generation

Getting good results from AI video starts with a well-structured prompt. Here’s what to focus on: 1. Describe the subject clearly Who or what is in the video? What are they doing? Be specific:
  • Vague: a person outdoors
  • Better: a young woman in a red coat walking through a snow-covered park, looking around with curiosity
2. Describe camera movement AI video models respond well to explicit camera direction:
  • slow pan from left to right
  • static camera, no movement
  • zoom out gradually revealing the full scene
  • close-up on subject's face
3. Set the visual style and mood
  • cinematic, warm tones, 24fps film look
  • documentary style, natural lighting
  • dreamy and soft, slightly desaturated
4. Keep it short for best results Start with 2–4 second clips. Short clips generate faster and are more consistent. Chain multiple short clips together in a video editor for longer sequences.

Frequently asked questions

No. Video generation is a Pro+ feature. Free plan users do not have access to video generation. Upgrade in Settings → Account.
Typically 30 seconds to several minutes depending on clip duration, model, and current server load. Do not close the tab while generating — this will interrupt the process.
This depends on the selected model. Most models produce clips of 2–10 seconds. Check the model-specific limits in the model dropdown within the video Studio.
Image-to-video capability depends on the selected model. Some video models support an input image as a starting frame. Check the model controls in the video Studio for this option.
MP4, WebM, and MOV. See Supported formats for guidance on which to choose.

Explore further

Creating videos

Step-by-step guide with prompt tips for better video output.

Video models

Available video generation models and their capabilities.

Supported formats

MP4, WebM, MOV — format details and recommendations.

Troubleshooting

Fix common video generation issues.