▲ | abecedarius 3 days ago | |
For the hard of hearing like me the killer application would be live transcription in a noisy setting like a meetup or party, with source separation and grouping of speech from different speakers. Could be life-changing. (Android's Live Transcribe is very good now but doesn't even try to separate which words are from different speakers.) | ||
▲ | adolph 3 days ago | parent [-] | |
* Automatic speech recognition (ASR) systems have progressed to the point where humans can interact with computing devices using speech. However, the distance between a device and the speaker will cause a loss in speech quality and therefore impact the effectiveness of ASR performance. As such, there is a greater need to have reliable voice capture for far-field speech recognition. The launch of Amazon Echo devices prompted the use of far-field ASR in the consumer electronics space, as it allows its users to interact with the device from several meters away by using microphone array processing techniques.* https://assets.amazon.science/da/c2/71f5f9fa49f585a4616e49d5... |