Feature
Upload your TikTok video to Capto. AI transcribes with word-level accuracy, you choose a bold caption style, and Capto exports a burned-in 9:16 MP4 ready to post — no CapCut required.
check_circle3 credits per minutecheck_circleWord-level karaoke highlightcheck_circle9:16 burned-in export
85% of TikTok videos are watched on mute. Captions aren’t optional — they’re what keeps viewers from scrolling past your content. Capto generates accurate, word-level captions using OpenAI Whisper and lets you burn them directly into a 9:16 MP4 in your choice of font, color, and position. The result is a TikTok-ready video with professional captions — without opening a video editor.
Any creator posting video where viewers might have sound off — which is most of TikTok.
85% of TikTok videos are watched on mute. Burned-in captions are the single highest-impact change you can make to a TikTok — they keep viewers in the video even when their phone is silent. Capto generates accurate, word-level captions and bakes them directly into a 9:16 MP4 you can upload immediately. No CapCut, no Premiere Pro, no separate export step.
TikTok's algorithm serves content globally. A single video with Spanish or Portuguese captions can reach millions of new viewers who would have scrolled past an English-only clip. Capto translates your transcript into 60+ languages and exports a separate captioned MP4 for each audience — all from one upload.
Educational TikToks — study tips, tutorials, cooking methods, fitness coaching — perform significantly better with captions. Viewers can follow along silently, pause and reread, and share with friends without needing audio. Capto's word-level timestamps make the karaoke style that stops the scroll in educational content.
All plans accept MP4, MOV, WebM, AVI, and MKV. The burned-in MP4 export costs 2 credits per minute (on top of the 1 credit per minute for transcription). Credits never expire.
| Plan | Max file size | Max video length | Concurrent exports |
|---|---|---|---|
| Essential | 100 MB | 15 min | 1 |
| Creator | 500 MB | 60 min | 2 |
| Pro | 500 MB | 120 min | 3 |
| Growth | 2 GB | 4 hr | 5 |
Need longer videos? View all plans →
Yes — TikTok has a built-in auto-caption feature, but it only works in the TikTok app, can't be styled, and is often inaccurate on accents or fast speech. Capto gives you editable, styleable burned-in captions that work across every platform and look exactly how you want them.
Bold, high-contrast fonts with a dark outline or solid background work best for scroll-stopping captions. Impact, Bebas Neue, and Montserrat are the top performers. White text with a black outline or a semi-transparent stripe background are both effective. Capto offers all of these plus custom font upload.
Yes — Capto translates into 60+ languages including Spanish, Portuguese, French, Hindi, and Arabic. After transcription, click Translate, pick your target language, and export a separate captioned MP4. You can reach Spanish-speaking TikTok audiences without re-recording.
Studies consistently show captioned videos get 40%+ more views and 12% longer watch time on average. 85% of social video is watched on mute — captions are what keep viewers watching past the first few seconds.
Under 2 minutes for most TikTok-length clips (15 seconds to 3 minutes). Transcription takes about 30 seconds; the burned-in MP4 export takes another 60–90 seconds.
Yes — every account gets 5 free minutes, no credit card required. That's enough to add captions to 2–3 TikTok-length clips before you decide whether to buy credits.
Every new account starts with 5 free minutes. No credit card required.
boltStart Free — 5 min includedRelated features