Accuracy Still a Problem for Google’s Ears
Updated: January 4, 2018
As we’ve discussed, speech recognition can be a very powerful tool. But it can’t quite complete the transcription process all on its own. There is still a gap between what its capabilities are and what would be a high quality, legible transcript. Many have tried to conquer this automated linguistic feat including Google.
On Friday, David Gallagher of the NY Times started a discussion on Google’s new Google Voice app that allows users to have their voicemail transcribed into text automatically. The Google app uses an automatic speech recognizer to decipher the spoken content into a friendly email format. While some might expect Google to be able to put the speech recognition puzzle together, even Google Voice gives us some entertaining reading material. Yesterday, Mr. Gallagher posted the results of his Google Voice testing.
As much as the speech and AI experts try to model a human’s voice, only a human ear can pick up all the tiny nuances of speech. From dialect to tone to context, so much can go wrong so fast with a machine. There is a lot speech recognition can offer, but there has to be a way to allow a human to be part of the process to ensure quality. And if search or ad delivery is part of the equation (as we might guess with our Google friends), you can imagine what happens to those results when you start with a misguided transcript.
Study Highlights: Implementation of and Solutions for Closed Captioning in US Institutions of Higher Education
Most colleges and universities in the US are legally required to provide closed captioning on many of their videos. Despite the laws, many institutions of higher education struggle to implement closed captioning practices. The national research study, Implementation of and Solutions for…
The Growing Population with Hearing Loss
We are quite the noisy population. Everywhere we go, sound is always surrounding us. Whether it’s police sirens passing by, or music through our headphones on our way to work, there is always background noise. Over 5% of the world’s population –…
3Play Media’s Top 10 Blog Posts of 2017
It’s that time of year again! Looking back on the blog posts from last year that peaked our readers’ interests the most helps everyone imagine what to expect in the new year. We would also like to take this opportunity to thank…