How to Transcribe Audio Content
Updated: June 6, 2019
So you want to transcribe audio content? Well, you’ve come to right place. Whether you go through a third-party transcription service or DIY (do it yourself), it’s important to weigh the pros and cons, and choose which option works best for you.
There are a wide range of benefits to transcribing audio content:
Additionally, based on the Americans with Disabilities Act and Section 504 and 508 of the Rehabilitation Act, many businesses and organizations are legally required to create transcripts for their content. WCAG 2.0 is a set of guidelines put in place by the World Wide Web Consortium to make digital content more accessible for users, including those with disabilities. WCAG 2.0 has three levels of compliance: Level A, AA, and AAA. Section 508 has been revised to be compliant with WCAG 2.0 Level A and AA. According to the lowest level, Level A, transcripts are a recommendation for audio-only content.
We’ll provide the different resources you’ll need to transcribe audio content, and help you determine the most viable choice based off your budget, time, and particular needs. Good luck and happy transcribing!
Manually transcribing audio can be a daunting task, especially when you have longer forms of content. It usually takes 5-6x the actual time of the content. Luckily, there are many free and low-cost tools available to help simplify the process. Before you begin transcribing, make sure you capture clear and loud audio. This will help to reduce red flags and inaudibles in your transcript.
If you host your audio content on YouTube, you can utilize the free automatic video transcript tool. It automatically transcribes audio into text, but keep in mind that it comes with a lot of errors. Transcripts produced by YouTube’s tool are too inaccurate to be used on their own. Therefore, it’s highly recommended to clean them up since they can hurt your video accessibility and ranking on search engine results pages (SERP).
Here’s how to leverage YouTube’s automatic video transcript:
- From the video manager, select your video and click Edit > Subtitles and CC. Select Add Subtitles or CC and choose your language.
- Select Transcribe and Set Timings, and type the transcript in the space provided. YouTube will automatically pause the video as you type so you can transcribe more quickly and accurately.
- Once you are satisfied, select Set Timings. This will sync your transcript with the video. You may always edit once the transcript is published.
Similarly, you can create a transcript beforehand and upload it onto YouTube:
- First, create a transcript with YouTube’s recommendations for formatting.
- Go to the Video Manager in YouTube and click Edit > Subtitles and CC. Select Add Subtitles or CC and choose your language.
- Choose Upload a File, select Transcript, and choose your .txt file for upload.
- Once your transcript has uploaded, click Set Timings to sync your transcript with the video and create closed captions. You may always edit once the transcript is published.
You can also download the transcript file later with timings as a caption file:
- Go to the video that you would like to download the transcript from. Click on the More Actions button (3 horizontal dots). Hint: it’s located next to the share button.
- Select the Transcript option.
- A transcript of the closed captions with the time codes will automatically generate.
Automatic Speech Recognition, otherwise known as ASR, is a technology that picks up on human speech and converts it into text. You can upload your media onto an ASR software, and it will automatically transcribe audio into text. This method still comes with many errors, but it’s much easier and faster to clean up an inaccurate transcript than to start from scratch. There are many options for transcription softwares that are free or for a small cost, such as Express Scribe, EureScribe, Dragon NaturallySpeaking, and many more.
Google offers an awesome feature to transcribe audio content. If you don’t have a Gmail account, you can sign up for one free of charge. If you have an existing account, you already have access to a feature called Google Docs: it’s a word processing tool where you can create text documents right in your web browser. Using voice typing, you can transcribe audio into text. Like many of the other manual transcription tools, there will be errors so make sure to clean it up before using it. Follow these steps to create your transcript:
- Using any browser of your choice, go to the Google Docs website and Start a New Document.
- Click on Tools and select Voice Typing. It will enable voice recognition.
- Click the Microphone icon on the left to activate Voice Typing. Google will transcribe anything being said onto the word document.
Another way to transcribe audio content is by using your smartphone. Similar to Google Docs, the microphone will pick up on the audio and transcribe it into text. Transcribing on your smartphone tends to work a little better than Google Docs since the microphone on your phone picks up less background noise, however, it still doesn’t compare to a high-quality microphone. Recording on your smartphone won’t ensure a high accuracy rate, so you will have to clean up the final transcript when you finish. Here are step-by-step instructions on how to transcribe audio using your smartphone:
- Open up a Word Processing app on your smartphone.
- On the keyboard of your smartphone, select the Microphone button and it will start recording.
- Hold your phone near your computer or other device, and Playback the Video. Your phone will automatically transcribe audio into text.
Pros vs. Cons of DIY Transcripts
Another option to transcribe audio content is to use a third-party transcription service. If you’re looking for high-quality, accurate transcripts, this is definitely the way to go!
3Play Media offers a 3-step transcription process that utilizes both technology and human transcriptionists, ensuring a 99.6% accuracy rate. When the audio file consists of difficult content, has background noise, or accents, the accuracy rate decreases. ASR typically provides 60-70% accuracy, so the use of human transcriptionists is what distinguishes 3Play from the rest.
Our patented technology utilizes ASR to automatically produce a rough transcript, which is useful for creating accurate timings, even if the words are incorrect. Using proprietary software, our extensive database of transcriptionists with backgrounds in a variety of subjects, go through and edit the transcript. All of our transcriptionists are qualified and U.S. based, which is important for understanding all the nuances of your content. Finally, after going through the editing process, your file goes through a final review called quality assurance. This is reviewed by our top editors, and ensures your transcript is virtually flawless.
One feature we also offer is the 3Play Interactive Transcript. It allows users to interact with your video by searching the video, navigating by clicking on any word, and reading along with the audio. Interactive transcripts make your content more accessible and improves the user experience.
Pros vs. Cons of a Transcription Service:
Transcription Best Practices
Now that you have a better understanding on manual transcription vs. a transcription service, you can make an informed decision. No matter which option you choose, it’s important to know how to make the most out of your transcripts.
- Grammar and Punctuation: ensure that there are no errors in your transcript so that it is easy to read.
- Speaker Identification: use speaker labels to identify who is speaking, especially when there are multiple speakers.
- Non-Speech Sounds: communicate non-speech sounds in transcripts. They are typically denoted with [square brackets]
- Verbatim: transcribe audio content as close to verbatim as possible. Leave out filler words such as “um” or “like”, unless they’re intentionally included in the audio.
How to Add Captions and Audio Description to Wistia Videos
Wistia is a video software company that believes when businesses create an immersive viewing experience for viewers, it can build brand affinity. What better way to create an immersive viewing experience than with accessibility? Captioning and audio description are two great accessibility…
How to Design a Sustainable Process for Accessible Web Design
Designing an accessible website is more important than ever before. In fact, 71% of people with disabilities leave a website immediately if it’s not accessible. Slow a steady wins the race, and accessible web design is more than just a one-time auditing…
How to Get Started with 3Play Media’s Audio Description
Audio description is an audio track that narrates the relevant visual information in media and assumes the viewer cannot see the onscreen content. It can be compared to a sports broadcaster narrating the visual action of a game over audio. The descriptions…