オープンデータとプログラミング

音声認識における新たなマイルス トーンに達する 深い学習 技術 IBM を使用して|Using Deep Learning Technologies IBM Reaches a New Milestone in Speech Recognition

オープンデータ関連のニュースです。

https://www.infoq.com/news/2017/03/ibm-speech-recognition

日本語

TranslateApiException: Cannot find an active Azure Market Place Translator Subscription associated with the request credentials. : ID=1250.V2_Json.Translate.52154690
続きを読む…

English

The research team at IBM recently announced they’ve reached a new industry record in speech recognition with a word error rate of 5.5% using the SWITCHBOARD linguistic corpus. This brings it closer to what’s considered to be the human error rate of 5.1%. Humans typically miss one to two words out of every 20 words they hear. In a five-minute conversation, that could be as many as 80 words. The research project includes applying deep learning technologies and incorporating acoustic models. The speech recognition model used Long Short Term Memory (LSTM) and WaveNet language models with a score fusion of three acoustic models.
Read more…

Comments are closed.