Every file format ZeroTwo accepts, how it processes each one, and what you can do with it.
ZeroTwo supports a broad range of file types across documents, code, data, media, and notebooks. The processing method depends on the file category — here is a complete breakdown.
ZeroTwo reads the full source file with syntax awareness — it understands language structure, can follow function calls, identify patterns, and reason about logic.
Language
Extensions
JavaScript / JSX
.js, .jsx
TypeScript / TSX
.ts, .tsx
Python
.py
Java
.java
C / C++
.c, .cpp, .h, .hpp
C#
.cs
Go
.go
PHP
.php
Ruby
.rb
Swift
.swift
Kotlin
.kt
Scala
.scala
Rust
.rs
Shell / Bash
.sh, .bash, .zsh
PowerShell
.ps1
SQL
.sql
R
.r, .R
Julia
.jl
LaTeX
.tex
What you can ask:
“Review this file for bugs and security vulnerabilities.”
“Explain what each function does.”
“Refactor this code to follow modern best practices.”
“Write unit tests for the functions in this file.”
Markup and stylesheet files are read as text with structural awareness — ZeroTwo understands HTML semantics, CSS selectors, Markdown formatting, and GraphQL schema syntax.
Format
Extensions
What ZeroTwo Can Do
HTML
.html, .htm
Convert to Markdown, review structure, extract content
CSS
.css, .scss, .less
Review for unused classes, explain selectors, refactor
Markdown
.md, .mdx
Rewrite with better structure, summarize, convert formats
Images are analyzed by ZeroTwo’s vision model, which can describe content, read text (OCR), identify objects, and answer specific visual questions.Supported extensions:.png, .jpg, .jpeg, .gif, .webp, .bmp, .tiffWhat you can ask:
“What is in this image?”
“Extract all the text from this screenshot.”
“Describe the chart in this image and summarize the data shown.”
“What does the error message in this screenshot say?”
“Is there anything unusual or out of place in this photo?”
“Identify all the UI elements in this wireframe.”
Very small or very low-resolution images may produce less detailed analysis. For text extraction (OCR), use clear, high-contrast images at a minimum of 800×600 pixels for best results. Animated GIFs — only the first frame is analyzed.
Audio files are automatically transcribed using Whisper (OpenAI’s speech recognition model). The transcript is then available for discussion, summarization, and analysis.Supported extensions:.mp3, .wav, .m4a, .aac, .ogg, .flac, .opusWhat you can ask:
“Transcribe this meeting recording.”
“Summarize the key decisions from this audio.”
“Pull out all action items and who they were assigned to.”
“What topics were discussed in this conversation?”
Format
Notes
MP3
Most common compressed audio format
WAV
Uncompressed; larger files, no quality loss
M4A
Apple’s AAC format; common for voice memos and iPhone recordings
AAC
Compressed audio, good quality at smaller sizes
OGG
Open format, common in web and game audio
FLAC
Lossless compressed audio
OPUS
Modern codec optimized for speech quality
For non-English audio, specify the language in your prompt for better transcription accuracy: “Transcribe this audio — the speaker is speaking in Japanese.”
Frames are extracted from the video at regular intervals, then ZeroTwo’s vision model analyzes each frame. This allows scene description, object identification, and visual Q&A across the video.Supported extensions:.mp4, .mov, .webm, .avi, .mkvWhat you can ask:
“Describe what happens in this video, step by step.”
“What products are shown in this advertisement?”
“Identify the steps demonstrated in this tutorial video.”
“At what point does the presenter show the dashboard?”
Format
Notes
MP4
Most compatible and widely used video format
MOV
Apple QuickTime; common for screen recordings and iPhone video
WEBM
Web-optimized; common for browser-captured video
AVI
Older Microsoft format; still widely supported
MKV
Matroska container; often used for high-quality video
Video analysis is frame-based — fast-moving content between sampled frames may be missed. For analysis of specific moments, reference timestamps in your prompt: “At around the 2:30 mark, what is shown on screen?” Audio from videos is not automatically transcribed — upload the audio track separately for Whisper transcription.
Notebooks are read in full cell context — code cells, Markdown cells, and cell outputs (including printed results, dataframes, and errors) are all available for analysis.Supported formats:.ipynb (Jupyter Notebook), .Rmd (R Markdown)What you can ask:
“Explain what this Jupyter notebook does, cell by cell.”
“Review the code in this notebook for errors and inefficiencies.”
“Summarize the findings from the output cells.”
“Convert this notebook’s analysis steps into a clean Python script.”
“What statistical method is being used in this R Markdown document?”