[Report] So, What’s the Current State of Automatic Speech Recognition?
We often get the question, “When will ASR technology be good enough to replace humans?” The answer really depends on the particular use case. While the current state of technology may work for Siri and Alexa – when it comes to captioning and transcription, human editing is still critical to accuracy. Our 2019 State of Automatic Captioning report explains the ins and outs of why that is.
About the Report
In order to closely follow trends with captioning accuracy, and because ASR is such a critical part of our process at 3Play Media, we are constantly testing to make sure we are using the best automatic speech recognition (ASR) engine. Our results, which investigate the current state of ASR technology with specific regard to captioning accuracy, will be published annually in the State of ASR report.
Our research tested the most popular ASR technologies across content from eCommerce, higher education, fitness, media and entertainment, and enterprise industries. All testing used real content, and lots of it, reflective of the most common type and volume that we receive at 3Play Media.
Some of the best Automatic Speech Recognition systems can achieve accuracy rates in the ‘80s and low ’90s if all conditions align perfectly. These accuracy levels are sufficient for certain applications, such as with personal assistants, where there are a limited number of inputs and outputs. However, when it comes to captioning and transcription, there will need to be some very fundamental advances in machine learning in order to replicate professional human editors.
3Play Media will continue to monitor the landscape for improvement in these technologies, and share those results in our annual report.
Discover more findings in the full 2019 report.
4 Tips for Combatting Zoom Fatigue (When There’s SO Much Video)
Whether or not you’ve heard the term before, you likely know what Zoom fatigue feels like. The shift to a more remote workforce has resulted in someone joining a Zoom meeting 300 million times every day, meaning so many of us have…
Captions & Interactive Transcripts Boost Student Performance, Study Finds
Instructors often search for out-of-the-box ways to improve student performance in the classroom. These days, due to the pandemic, many classes are conducted virtually and remotely. What strategies or tools can instructors incorporate into their curriculum to support student success and keep…
Audio Description for HBO Max Is Coming Soon
Per a settlement agreement, WarnerMedia Direct, LLC has pledged to increase accessibility for people who are blind and low vision by providing audio description for HBO Max. HBO Max, launched May 27, 2020, is an over-the-top (OTT) American subscription video-on-demand streaming service…