Skip to main content
ZeroTwo supports a broad range of file types across documents, code, data, media, and notebooks. The processing method depends on the file category — here is a complete breakdown.

Documents

ZeroTwo extracts full text and structure from all major document formats, enabling Q&A, summarization, translation, and data extraction.
FormatExtensionsHow ZeroTwo Processes ItBest Use Case
PDF.pdfFull text extraction; scanned PDFs use OCRDocument Q&A, summarization, compliance review
Word.docx, .docText and heading structure parsed; tables preservedEditing, summarizing, extracting key data
PowerPoint.pptx, .pptSlide text extracted in order; speaker notes includedPresentation review, slide content extraction
Excel.xlsx, .xlsCell data extracted across all sheets; formula values (not strings) capturedData analysis, table Q&A, number extraction
OpenDocument.odt, .ods, .odpText parsed from OpenDocument equivalentsDocument, spreadsheet, and presentation review
Apple iWork.pages, .numbers, .keynoteAutomatically converted and extractedDocument, spreadsheet, and presentation review
What you can ask:
  • “Summarize the executive summary of this report.”
  • “What are the main arguments in chapter 3?”
  • “Extract all deadlines and owners from this contract.”
  • “Compare these two proposal versions and highlight what changed.”
  • “Translate this document from Spanish to English.”

Code Files

ZeroTwo reads the full source file with syntax awareness — it understands language structure, can follow function calls, identify patterns, and reason about logic.
LanguageExtensions
JavaScript / JSX.js, .jsx
TypeScript / TSX.ts, .tsx
Python.py
Java.java
C / C++.c, .cpp, .h, .hpp
C#.cs
Go.go
PHP.php
Ruby.rb
Swift.swift
Kotlin.kt
Scala.scala
Rust.rs
Shell / Bash.sh, .bash, .zsh
PowerShell.ps1
SQL.sql
R.r, .R
Julia.jl
LaTeX.tex
What you can ask:
  • “Review this file for bugs and security vulnerabilities.”
  • “Explain what each function does.”
  • “Refactor this code to follow modern best practices.”
  • “Write unit tests for the functions in this file.”
  • “Convert this Python script to TypeScript.”
  • “Add type annotations and docstrings throughout.”

Data Formats

Structured data files are parsed and made available for analysis, statistics, schema description, and visualization prompts.
FormatExtensionsNotes
CSV.csvMost common tabular format; headers detected automatically
TSV.tsvTab-separated values; same processing as CSV
JSON.jsonNested structures fully parsed; arrays of objects treated as tables
YAML.yaml, .ymlConfiguration and structured data
TOML.tomlConfiguration files
XML.xmlHierarchical data parsed; large files may be summarized
Parquet.parquetColumnar binary format; schema and column data extracted
Feather.featherColumnar binary format; schema and column data extracted
Arrow.arrowApache Arrow columnar format
Avro.avroSchema-aware serialization; data and schema read together
What you can ask:
  • “What are the column names and data types in this dataset?”
  • “Which product had the highest sales in Q3?”
  • “Describe the schema of this JSON file.”
  • “Find any rows with missing values.”
  • “Show the revenue trend by month as a bar chart.”

Markup Files

Markup and stylesheet files are read as text with structural awareness — ZeroTwo understands HTML semantics, CSS selectors, Markdown formatting, and GraphQL schema syntax.
FormatExtensionsWhat ZeroTwo Can Do
HTML.html, .htmConvert to Markdown, review structure, extract content
CSS.css, .scss, .lessReview for unused classes, explain selectors, refactor
Markdown.md, .mdxRewrite with better structure, summarize, convert formats
GraphQL.graphql, .gqlExplain schema, write queries, identify unused types

Images

Images are analyzed by ZeroTwo’s vision model, which can describe content, read text (OCR), identify objects, and answer specific visual questions. Supported extensions: .png, .jpg, .jpeg, .gif, .webp, .bmp, .tiff What you can ask:
  • “What is in this image?”
  • “Extract all the text from this screenshot.”
  • “Describe the chart in this image and summarize the data shown.”
  • “What does the error message in this screenshot say?”
  • “Is there anything unusual or out of place in this photo?”
  • “Identify all the UI elements in this wireframe.”
Very small or very low-resolution images may produce less detailed analysis. For text extraction (OCR), use clear, high-contrast images at a minimum of 800×600 pixels for best results. Animated GIFs — only the first frame is analyzed.

Audio

Audio files are automatically transcribed using Whisper (OpenAI’s speech recognition model). The transcript is then available for discussion, summarization, and analysis. Supported extensions: .mp3, .wav, .m4a, .aac, .ogg, .flac, .opus What you can ask:
  • “Transcribe this meeting recording.”
  • “Summarize the key decisions from this audio.”
  • “Pull out all action items and who they were assigned to.”
  • “What topics were discussed in this conversation?”
FormatNotes
MP3Most common compressed audio format
WAVUncompressed; larger files, no quality loss
M4AApple’s AAC format; common for voice memos and iPhone recordings
AACCompressed audio, good quality at smaller sizes
OGGOpen format, common in web and game audio
FLACLossless compressed audio
OPUSModern codec optimized for speech quality
For non-English audio, specify the language in your prompt for better transcription accuracy: “Transcribe this audio — the speaker is speaking in Japanese.”

Video

Frames are extracted from the video at regular intervals, then ZeroTwo’s vision model analyzes each frame. This allows scene description, object identification, and visual Q&A across the video. Supported extensions: .mp4, .mov, .webm, .avi, .mkv What you can ask:
  • “Describe what happens in this video, step by step.”
  • “What products are shown in this advertisement?”
  • “Identify the steps demonstrated in this tutorial video.”
  • “At what point does the presenter show the dashboard?”
FormatNotes
MP4Most compatible and widely used video format
MOVApple QuickTime; common for screen recordings and iPhone video
WEBMWeb-optimized; common for browser-captured video
AVIOlder Microsoft format; still widely supported
MKVMatroska container; often used for high-quality video
Video analysis is frame-based — fast-moving content between sampled frames may be missed. For analysis of specific moments, reference timestamps in your prompt: “At around the 2:30 mark, what is shown on screen?” Audio from videos is not automatically transcribed — upload the audio track separately for Whisper transcription.
Maximum size: 200 MB (Pro+), 50 MB (Free plan)

Notebooks

Notebooks are read in full cell context — code cells, Markdown cells, and cell outputs (including printed results, dataframes, and errors) are all available for analysis. Supported formats: .ipynb (Jupyter Notebook), .Rmd (R Markdown) What you can ask:
  • “Explain what this Jupyter notebook does, cell by cell.”
  • “Review the code in this notebook for errors and inefficiencies.”
  • “Summarize the findings from the output cells.”
  • “Convert this notebook’s analysis steps into a clean Python script.”
  • “What statistical method is being used in this R Markdown document?”

File Size Limits Summary

File CategoryMax Size (Free)Max Size (Pro+)
Documents, Code, Data, Images, Audio, Notebooks50 MB50 MB
Video50 MB200 MB
Files per message110

Upload Files

How to attach files to chat messages.

Files Library

Browse and manage all your uploaded files.

Retention and Storage

How files are stored and how long they persist.

Troubleshooting

Fix issues with unsupported formats and failed analysis.