VideoText workflow guide

SDH Subtitles — Create Subtitles for the Deaf and Hard of Hearing

SDH subtitles (Subtitles for the Deaf and Hard of Hearing) include speaker identification and descriptions of non-speech audio alongside standard dialogue text. VideoText generates the dialogue and speaker labels automatically from your video. You can then add sound descriptions (like [music], [applause]) to complete the SDH file. Export as SRT or VTT ready for Netflix, YouTube, or any platform that accepts SDH tracks.

Where subtitle workflows break in real production

  • Reading speed is the invisible subtitle constraint: 14–17 characters per second is the readable range for most viewers; anything above 21 CPS causes comprehension failure even when the transcript text is accurate. Most automated subtitle tools generate lines without CPS checks.
  • Burned captions create a permanent workflow commitment — any timing correction or text fix after encoding requires re-encoding the full video. For long-form content this is hours of lost render time, which is why burn decisions need to happen after QA, not before.
  • Platform subtitle requirements are not interchangeable: TikTok displays a 2-line maximum with specific font rendering; YouTube accepts up to 1,500 SRT blocks and requires UTF-8 encoding; Instagram Reels ignores soft subtitle tracks on autoplay entirely, making burned captions the only reliable option for Reels.

From raw video to export-ready subtitle file

1. Generate subtitle file with word-level timestamps

Upload the video or paste a public URL. Word-level timestamp alignment produces more accurate line breaks than sentence-level alignment — each subtitle break falls at a natural pause rather than a word boundary mid-phrase.

2. Review CPS on every subtitle line

Flag any line above 17 CPS and split it. An 8-word subtitle in a 1.2-second window hits approximately 24 CPS — unreadable for most viewers. Merge subtitle lines shorter than 0.8 seconds, which display too briefly to register.

3. Check for timing overlap and minimum duration

Subtitle overlap — where the next block starts before the previous one ends — causes display flicker in most players. Any gap shorter than 0.1 seconds between adjacent subtitles should be merged or widened to prevent rendering artifacts.

4. Select platform-appropriate export format

Export SRT for YouTube, Vimeo, and video editors. Export VTT for HTML5 web players and streaming platforms that support caption positioning. Choose burned-caption output for Instagram Reels, TikTok clips, and any social context where autoplay without sound is expected.

SRT, VTT, and burned caption outputs

SRT subtitle file

Standard timed subtitle format: sequence number, timestamp pair (00:00:00,000 → 00:00:00,000 with comma separator), and one or two lines of caption text per block. Supported by YouTube, Vimeo, LinkedIn, most video editors, and accessibility workflows.

VTT subtitle stream

WebVTT format with period timestamp separators (00:00:00.000 → 00:00:00.000). Used by HTML5 video players, streaming services, and accessibility platforms. Unlike SRT, VTT supports <cue> positioning metadata, text styling, and karaoke-mode highlighting.

Burned captions

Permanently embedded caption text rendered directly into the video frame — cannot be toggled off by the viewer. Required for platforms that strip external caption tracks. Font rendering, shadow depth, and vertical position all need QA review after encoding because video compression can degrade caption legibility.

Creators and teams running subtitle workflows

Video creators publishing to multiple platforms

Generate SRT for YouTube upload, export burned captions for Instagram Reels, and produce VTT for course platform embeds — all from the same subtitle pass without reformatting timing.

Accessibility compliance teams

Produce synchronized captions that meet WCAG 2.1 timing and reading-speed requirements. CPS validation catches lines that fail accessibility guidelines before the video goes live.

Post-production editors

Export SRT or VTT for import into Premiere, DaVinci Resolve, or Final Cut Pro. Accurate word-level timestamps eliminate the need to manually re-sync caption timing after import.

Subtitle edge cases that cause QA failure

CPS violation — splitting required

Eight words spoken in 1.2 seconds: "we need to fix this before the deadline" at 24 CPS. The line must be split into two display segments within the same timing window to hit the readable 14–17 CPS range.

Subtitle timing overlap

Block ending at 00:01:23,500 overlapping a block starting at 00:01:23,200 causes display flicker and rendering failure in most SRT players. A subtitle validator catches this before upload.

Mobile frame crop

A two-line subtitle with 44+ characters per line at default font sizes clips at the bottom of mobile video frames. The safe area for mobile subtitles is a single line, 38 characters maximum, positioned at 85% vertical height.

Burned caption rendering after encode

Font stroke weight, drop shadow depth, and subtitle vertical position all shift slightly after H.264 encoding. Captions that look correct in preview may become harder to read in the encoded file — requires a post-encode QA pass before publishing.

Platform-specific subtitle requirements

YouTube SRT requirements

UTF-8 encoding required. Maximum 1,500 subtitle blocks per file. Timestamp format: 00:00:00,000 (comma separator, not period). Files over 1,500 blocks need splitting before upload. YouTube auto-syncs uploaded SRT to audio — small timing offsets are corrected automatically.

TikTok caption behavior

TikTok generates its own auto-captions and displays them over uploaded SRT files in most cases. Burned captions are the reliable method for ensuring accurate captions appear on TikTok content. SRT upload works but TikTok may override it.

Instagram Reels caption handling

Instagram does not display soft subtitle tracks during autoplay in Feed or Reels. Burned captions are the only reliable method for ensuring captions appear for silent autoplay viewers. Instagram's built-in auto-caption feature can also be used after upload, but accuracy varies.

VTT vs SRT encoding difference

VTT uses period timestamp separators (00:00:00.000) while SRT uses commas (00:00:00,000). Swapping the separator character breaks parsing in most players. VTT files must begin with the WEBVTT header line — SRT files must not.

Subtitle workflow questions answered

What are SDH subtitles?

SDH stands for Subtitles for the Deaf and Hard of Hearing. SDH subtitles differ from standard subtitles by including speaker identification (e.g., [JOHN:]) and descriptions of non-speech audio (e.g., [upbeat music], [door slams]). Netflix, Amazon Prime, and broadcast platforms require SDH for accessibility compliance.

What is the difference between SDH and closed captions?

SDH and closed captions serve the same accessibility purpose. Closed captions (CC) are a North American broadcast standard embedded in the video signal (CEA-608/708). SDH is an international subtitle standard used for disc and streaming delivery. SDH is styled like regular subtitles (positioned at the bottom of the screen) rather than the white-on-black CC style. Functionally they contain the same information.

Does VideoText automatically generate SDH subtitles?

VideoText generates accurate dialogue subtitles with speaker labels automatically. Sound effect descriptions ([music], [applause], etc.) need to be added manually in a text editor. The generated SRT or VTT file provides the full dialogue base — you then annotate non-speech events.

What format should SDH subtitles be in?

Netflix requires SDH in TTML (IMSC) format for final delivery. For upload to YouTube and Vimeo, SRT or VTT with SDH formatting works correctly. VideoText exports SRT and VTT.

Is VideoText free?

Yes. Free tier includes 3 uploads per day. No credit card required.

Related caption and subtitle tools

Workflow shortcuts

Fix Sdh Subtitles timing and reading-speed issuesExport Sdh Subtitles captions without timestamp drift All Pages Index Tool Alternatives Transcription Tools Subtitle Tools

Primary Transcription & Caption Tools

Video to TranscriptVideo to SubtitlesTranslate SubtitlesFix SubtitlesBurn SubtitlesCompress Video

Find More Tools

Tool Alternatives Transcription Tools Subtitle Tools