The two tiers
Standard models
Standard models have no monthly message cap. You can use them as much as you want on any plan, including Free. Standard models are fast, capable, and suitable for the majority of tasks. They include the smaller and more efficient variants from major providers:- GPT-5-mini
- GPT-4o-mini
- GPT-4.1-nano
- Gemini 2.5 Flash Lite
- Gemini 2.0 Flash
- Mistral Small
- Mistral Nemo
- Grok 4 fast
- Other smaller/faster model variants
Premium (rate-limited) models
Premium models are ZeroTwo’s highest-capability models. They’re counted against your monthly quota and require a Pro+ plan to access. Premium models include:- Claude Opus 4.6
- Claude Sonnet 4.6
- Claude Sonnet 4.5
- Claude Sonnet 4.1 / 4.0
- Claude Haiku 4.5
- Claude 3.5 Sonnet / 3.5 Haiku
- GPT-5
- GPT-4o
- GPT-4.1
- Gemini 3.1 Pro
- Gemini 2.5 Pro
- Gemini 2.5 Flash
- Grok-4
- Cohere Command A
- Mistral Large
- Magistral
- Qwen Max
- Perplexity Sonar Pro
- o3, o4-mini (reasoning models)
- DeepSeek Reasoner
Monthly quotas by plan
| Plan | Premium message quota |
|---|---|
| Free | 0 — standard models only |
| Pro | 555 messages/month |
| Pro 2x | 1,150 messages/month |
| Plus Ultra | Unlimited |
| Business | Unlimited |
The quota resets at the start of each billing cycle. Go to Settings → Account to see your current premium message usage and remaining quota for the month.
What counts as a premium message
A premium message is any message you send in a chat where the selected model is a rate-limited premium model. The following do not count against your premium quota:- Messages sent using standard models
- AI responses (only your sent messages count)
- Messages in Studio (image, video, and audio generation use separate Studio credits, not the premium message quota)
When your quota is exceeded
When you reach your monthly premium message limit, ZeroTwo automatically falls back to a standard model for that message. You’ll see a notification in the chat indicating that the fallback occurred. What you can do when quota is exceeded:- Continue with the fallback standard model — it will be a capable standard model and sufficient for many tasks
- Wait until next month — your quota resets at your billing cycle renewal date
- Upgrade your plan — move from Pro to Pro 2x, or from Pro 2x to Plus Ultra for more or unlimited premium messages
- GPT-5-mini
- GPT-4o-mini
- Gemini Flash Lite
- Mistral Small
- Grok 4 fast
Checking your quota
To see your current premium message usage:- Go to Settings → Account
- Look for the Premium messages section
- Your usage for the current billing cycle and remaining quota are displayed
Strategies for managing your quota
Practical approaches:- Iterate with standard, finalize with premium: Write first drafts with GPT-4o-mini, then switch to Claude Sonnet 4.6 or GPT-5 for the final polished version
- Use premium for hard problems only: Complex coding, detailed analysis, multi-step reasoning — these benefit most from premium models
- Match model to task: Not every task needs the most capable model. Simple questions, reformatting, and quick lookups are just as well handled by standard models
- Check usage mid-month: If you’re burning through quota quickly, check Settings → Account and adjust your habits before hitting the limit

