The Current State of Automatic Speech Recognition: A Report
3Play Media conducts annual research – which is then compiled into a report – to provide the most up-to-date findings on the current state of automatic speech recognition (ASR) technologies as it applies to captioning and transcription.
One of the main reasons we produce this research every year is to make sure we are using the state of the art, highest performing ASR technology for our task of transcription.
The state of ASR in 2020 was of particular interest, as the demand for video increased rapidly, and thus, so did the demand for captioning solutions. One way artificial intelligence (AI) is being applied to video is with automatic speech recognition. The 3Play Media captioning process relies on ASR as the first step in our 3-step process, followed by two rounds of human cleanup by professional editors. Having the best possible ASR technology as the starting point allows our editors to focus on the nuances of captions, ultimately leading to a better product.
The results in this report come from research in which we investigate the current state of ASR technology with specific regard to captioning accuracy. Our research tested the most popular ASR technologies across content from eCommerce, higher education, fitness, media and entertainment, and enterprise industries. To test these engines, we used a large dataset representative of 3Play Media’s diverse customer base. In this study we measured both word error rate (WER) and formatting error rate (FER) which are both necessary to gain a complete picture of accuracy for our purposes.
Along with several other key findings, the results from this year’s research revealed a significant reduction in word error rate from Speechmatics V1 API (SMX) to Speechmatics V2 API (SMX+), and has thus led us to make the move from SMX to SMX+.
Read the full 2020 State of Captioning Report to learn more about our findings and discover what you can expect from both 3Play Media and ASR engines in 2021 and beyond.
Gil v. Winn-Dixie: A Landmark Ruling is Reversed
On April 7, 2021, the U.S. Court of Appeals for the 11th Circuit overturned Gil v. Winn-Dixie Stores, Inc., a landmark case for web accessibility. Read on to learn more about the original case, the disappointing appellate opinion, and what this implies…
Video and Closed Captioning Trends for 2021
The 2021 State of Captioning report is now available to read in full. In it, you’ll find some of the top video and closed captioning trends for this year. The State of Captioning (SOC) is an annual report by 3Play Media and…
NAD v. Netflix ADA Lawsuit Requires Captioning for Streaming Video
The internet offers a unique challenge: how do we ensure that all of our digital products, services, and communications are accessible to people with disabilities? What are companies required to do to accommodate such users? Federal disability laws still await comprehensive updates…