3 Innovative Ways to Approach English Descriptive Audio

August 10, 2018 BY ELISA LEWIS
Updated: October 14, 2022

Discover Our Offerings for Your Accessibility Needs

3.5% of the world’s population live with vision impairment, making audio description – also referred to as English Descriptive Audio – an important component to our society’s fast-growing video content. A major barrier to audio description is cost, as traditional methods require hiring voice actors to record the audio in a studio. We’re sharing three innovative ways to approach English Descriptive Audio.

1. Synthesized speech

Traditional description requires human labor for the entire workflow. Once the description transcript is written, human voice actors are hired to record the description. This method is extremely costly and time consuming. Synthesized speech completely removes the need for voice recording and editing. Skipping these tedious steps allows audio description to be created much faster and at a lower cost.

3Play Media takes this unique approach to description, using a combination of humans and technology. Certified human describers write high quality descriptions, then synthesized speech is used to vocalize these descriptions. Using a combination of human editing and advanced technology, the cost of audio description can be significantly decreased without sacrificing quality. In fact, synthesized speech has many benefits.

The sound of synthesized speech is familiar to most blind and low vision users who are accustomed to fast-paced and mechanized audio from using screen readers.

Additionally, with synthesized speech, the user is in complete control, as the synthesized speech will vocalize the exact description written by the human describer. Likewise, if you decide that you would like to make a tweak to the description, you can do so without having to re-hire and pay a voice actor for their time in the studio.

2. Human voice narration

There are a number of ways in which you can create audio description on your own. If you are creating a talking head video with a slide deck, for example, in a classroom lecture, you can narrate the visual information that’s contained in the slides or in the background of the video as they’re happening when you record the original video. If you thoroughly describe what is in the video while you’re recording, it will eliminate the need to go through afterwards and add in descriptions. This is a great way to cut costs.

Note that if you do choose to go this route, it’s important to consider the best practices for creating accessible video for blind and low vision users. These include verbally covering all displayed visual information, identifying the speaker and speaker changes, explaining any participation by audience members, and taking frequent pauses.

Unlock the power of accessible media 📲

3. WebVTT or text file

Even if you’re not creating a talking head video, there are still a few ways that you can create audio description yourself. For instance, you can create a text-only description, or essentially write a text version describing all of the visual information that’s happening in the video. While this is certainly easier, it does lose some of the cinematic detail for the viewer, and it doesn’t include quite the same amount of accommodation.

To make this slightly more accessible, you can create a text-only description that is time-coded, just like you would time code a caption file. When doing this, you want to be sure that the description fits into the natural pauses of the video. Next, you can utilize this document to create a WebVTT file, which is similar to captions, but for description. WebVTT files are supported natively in HTML5.

Similarly, you could also create a text-only merged transcript and description, which is a text doc that contains both the transcript of the audio and your video, and the description of the visual information in your video. This is really helpful for deaf-blind viewers as an accommodation, but it’s not time-synchronized, so it would be the equivalent of providing a transcript only for a video instead of providing captions.

Why Provide Description?

ADA Update: Title II’s Final Rule Clarifies Captioning and Audio Description in Higher Education

by Elisa Lewis in Video Accessibility

ADA Title II Revisions: What You Need to Know [Free Webinar] The US Department of Justice’s (DOJ) final rule on Title II of the Americans With Disabilities Act (ADA) brings much-needed clarity for public universities and community colleges regarding web content and…

Updated July 11, 2024

Press Release: 3Play Media Study Reveals Automatic Speech Recognition (ASR) Engines are Fine Tuning After a Year of Massive Improvement

by Elisa Lewis in Industry Trends

June 20, 2024 09:59 AM Eastern Daylight Time 2024 State of ASR Report BOSTON–(BUSINESS WIRE)–After a year of profound improvement in accuracy, ASR providers are doubling down on improving the accuracy of their solutions and focusing on their differentiation, according to the…

Updated June 24, 2024

3Play’s Patent Playbook: Transforming Caption Placement at Scale with Automated Closed Caption Positioning

by Jena Wallace in Video Accessibility

3Play’s Patent Playbook blog series tells the stories behind our patented technology. Learn how 3Play Media’s Research and Development (R&D) teams are spearheading innovation in accessibility tech and creating breakthroughs in the media accessibility industry at large. Captioning Best Practices for Media…

March 22, 2024

Subscribe to the Blog Digest

Sign up to receive our blog digest and other information on this topic. You can unsubscribe anytime.

By subscribing you agree to our privacy policy.

Product

Why 3Play?

Learn

Company

Further Reading

ADA Update: Title II’s Final Rule Clarifies Captioning and Audio Description in Higher Education

Press Release: 3Play Media Study Reveals Automatic Speech Recognition (ASR) Engines are Fine Tuning After a Year of Massive Improvement

3Play’s Patent Playbook: Transforming Caption Placement at Scale with Automated Closed Caption Positioning

Subscribe to the Blog Digest