Captioning Resources

Last Updated: 02/04/2019


This page is intended to support you in creating or editing caption files on your own. Please contact with any questions or if you would like to schedule a training.

Creating Captions

There are 3 steps to creating captions: transcribing the content, breaking it into caption blocks, and aligning the caption blocks to the video. A variety of software programs can help with each step in this process.

Throughout this process, you should review the captions for completeness, accuracy, synchronicity, and placement.

Transcribing the Content

  • If you create your own videos, it’s recommended to write out a script ahead of time so that you can use the transcript to easily create captions later.
  • No software will be consistently accurate enough to transcribe audio on its own. You will need to review and correct the output of any machine-aided transcription.
  • Software and Programs for Machine-Aided Transcription

Breaking into Caption Blocks

  • You will need to split apart your transcript into separate captioning blocks that will appear on-screen, one after the other. For easier reading, try to avoid splitting your captions in the middle of a phrase.
  • If you are manually transcribing your audio in a captioning program like Amara, you can create each caption block in the software as you transcribe.
  • If you upload a transcript to YouTube’s captioning edit and choose to “set timings”, it will automatically break it into blocks for you.

Syncing to the Audio

  • This is the process in which the caption blocks are assigned start and end times so that they appear at the correct part of the video. Many subtitle programs require you to do this manually.
  • YouTube will do this process automatically if you have a transcript prepared and select “set timings”. The results may not be perfectly accurate; check any long gaps in time or blocks with non-speech sounds to ensure they are aligned accurately.

Review for Quality

Once you create your caption file, you should review it for quality. A short summary of quality issues to check for is listed below. Please reference the Captioning Quality Guidelines for a more extensive list.

  • Identify all changes in speaker (e.g. “Sarah: ”, “Man”, or “>>” if speaker name unknown.)
  • Add any meaningful non-speech sounds in brackets (e.g. [car honks])
  • Ensure all spoken content is transcribed exactly, not paraphrased.
  • Do not include any more than 2 lines of text per caption block.
  • Ensure the caption blocks appear long enough to be easily read; generally they should appear for at least 1 second.

Save or Export Your File

If you are creating your captions in a separate software from the media player they will be displayed in, you will need to save or export your captions from the caption editor software so that they can be uploaded to the destination media repository.

Captioning files are typically saved with one of the following extensions: .srt, .vtt, .sbv, .dfxp, .sami, or .ttml. SRT files are the simplest format, and are able to easily be edited by anyone using a text editor. However, they do not support features like vertical caption placement or text markup. If those features are required, VTT is recommended for ease of editing.

If you create your captions in a captioning editor like YouTube or Amara, you can export your file to a variety of caption formats which can then be uploaded to Kaltura, Vimeo, or any other player that accepts standard caption formats.

Creating Captions with Others

If you would like to work with a group to create a caption file, there are a few ways to do it.

  • Amara Public Editor: Amara lets any user contribute to an existing captioning project, and logs contributions by username. You don’t have final control over which changes are approved; any submission is approved automatically, but you have access to every version that has been uploaded.
  • YouTube community contributions: You can turn on this feature in your video settings which allows anyone viewing the video to contribute captions. You will have final control over which suggestions are implemented.