How Much Does It Cost to Do Closed Captioning In-House?

January 28, 2016 BY EMILY GRIFFIN
Updated: January 4, 2018

When it comes to closed captioning, cost is always a concern. Often an organization will look within to tackle the task. Maybe an intern or a grad student would be willing and able to transcribe and caption video to make it accessibility.

The assumption is that professional captioning services are expensive, and that it is more cost-effective caption in-house.

Is this really true?

Is it always cheaper and better to caption your own videos instead of sending them to a professional captioning company?

Let’s find out.

In-House Captioning Cost Calculation

Let’s walk through each step in the in-house captioning workflow and estimate its cost.

First, we should define the requirements for a successfully captioned video file. For most web-based video content, a video can be captioned using a small, external file that does not require any additional encoding or authoring of the video itself. That caption file is essentially a transcript that is broken up into caption frames with timecodes to denote when each caption frame should show up.

There are three main components in creating captions for video content: transcribing the video, synchronizing the text, controlling quality, and then managing the overall process.

1. Video Transcription

Let’s start with the first, and most time-consuming, task: video transcription. Traditionally, it takes a trained transcriptionist four to five hours to transcribe one hour of normal audio or video content.

If this task is to be done in-house, only a large corporation will be able to afford to hire and manage professional transcriptionists. More likely, for higher education or government, a student or intern will be available to work on video transcription part-time.

Not only will it take a student longer to transcribe than a professional transcriptionist, but they will also demand training and oversight in order to maintain consistent quality.

A conservative estimate for the transcription portion of our captioning exercise will be five hours. And let’s assume we pay our students $15 per hour.

That’s $75 to transcribe one hour of content.

2. Synchronization

Once you have a video transcript, it needs to be broken up into timed caption frames. There are a number of ways this can be accomplished.

There are free tools that allow a user to create caption frames and transcribe directly into the open fields. Alternatively, you can load a transcript into the tool and pick time points to break up lines.

Automated solutions also exist and can save time, but are extremely dependent on the quality of the audio and the quality of the transcript to properly match and synch the text to the video. YouTube actually offers this for free for any video you upload and have a transcript for.

For analysis purposes, let’s assume the synchronization effort adds 20% to the time requirement. In this case, that would be one more hour, or $15.

We’re now up to $90 per hour.

3. Quality Control

Finally, management and quality control are key factors for an ongoing captioning operation. In order for closed captions to be ADA compliant, they need to be as accurate as possible. If your captions are inaccurate, all that time and money invested in captioning will come up short.

Quality comes into play in two ways: up front training of workers on correct captioning standards and review/error checking after a file is complete. If only a couple videos need to be captioned, these issues may not be as apparent since someone can provide a bit more care and attention without driving up cost too severely. But a continuous workflow absolutely requires these quality considerations.

For a proper review process, it is safe to say that a quality check will take more than the duration of the actual content. So let’s say one and a half hours for the one hour of content. This will likely be done by another student, but at least at the same $15 per hour rate.

That adds $22.50, bringing us up to $112.50 per hour.

4. Operations Management

The last question of management time largely depends on how much content needs to be captioned. That in turn will determine how many students or interns require training and scheduling oversight.

Let’s assume a student or intern can work 20 hours per week. If the fully loaded time to caption one hour of content is 7.5 hours (transcription plus captioning plus QA), then we can’t even get 3 hours of video captioned with one person in a week. Someone has to oversee this growing staff.

Let’s assume we’re dealing with 100 hours of content per month so we can figure out what the management costs might be. One hundred hours per month would require 750 labor hours to complete.

At 20 hours a week, we need 10 people working to complete the task. A single supervisor can likely oversee this group of 10, maybe even 12 to provide some overlap. At $25 per hour for 40 hours per week, a supervisor will cost $16,000 for every 4-month stint – the equivalent of one semester or term of an intern.

The one last piece of management that we haven’t discussed is training.

Transcription and captioning each have a long list of standards that must be followed to produce a consistent output. These standards cover issues such as how to transcribe someone’s false start to a sentence, how to represent numbers and math formulae, and how to identify speaker changes.

Closed captioning has rules about timing and number of characters per line and lines per frame. Student transcriptionists must be trained well in these standards in order to produce adequate captions. A conservative estimate of training time per student worker is $500. Plus, it is likely that a new group of students or interns is coming in every four months and will need training. Total training costs are now $10,000 for two shifts of 10 people.

If we just look at 8 months of the year (one academic year), management and training costs will be $42,000 to cover 800 hours of captioning. Labor fees for the actual transcription and captioning total $90,000.

The total cost of captioning per hour of content is now $165.

Cost Estimate for Closed Captioning Videos In-House

Let’s add up our total costs in the hypothetical scenario outlined above:

In-house captioning costs per house of footage: Transcription: $75; Synchronization: $15; QA: $23;Management: $52

A conservative estimate for an in-house, large-scale captioning operation averages $165 per hour of video. That’s more than double the cost of professional closed captioning, and WAY more hassle.

This model assumes that everything goes smoothly – that 7.5 hours per hour is accurate and that little to no support is required beyond the creation of the files. For example, if it ends up taking 10 hours per hour of content, the cost per hour balloons above $200. At higher scale, management costs also quickly rise.


At lower quantities, in-house captioning may be a good way to save a few dollars.

But at scale, the cost of in-house captioning skyrockets, while quality, consistency, and efficiency become harder to maintain.

Compare your in-house costs to professional closed captioning services: download our pricing and discounts form.

Pricing & Discounts: free download

Read the free report: 2017 State of Captioning.

The closed caption CC icon shown in the middle of a TV.