Last month, 2Hz introduced an app called krisp which was featured on the Nvidia website. It uses deep learning for noise suppression and is powered by krispNet Deep Neural Network. krispNet is trained to recognize and reduce background noise from real-time audio and yields clear human speech.
2Hz is a company which builds AI-powered voice processing technologies to improve voice quality in communications.
Many edge devices from phones, laptops, to conferencing systems come with noise suppression technologies. Latest mobile phones come equipped with multiple microphones which helps suppress environmental noise when we talk.
Generally, the first mic is placed on the front bottom of the phone to directly capture the user’s voice. The second mic is placed as far as possible from the first mic. After the surrounding sounds are captured by both these mics, the software effectively subtracts them from each other and yields an almost clean voice.
The limitations of multiple mics design:
The traditional Digital Signal Processing (DSP) algorithms also work well only in certain use cases. Their main drawback is that they are not scalable to variety and variability of noises that exist in our everyday environment.
This is why 2Hz has come up with a deep learning solution that uses a single microphone design and all the post processing is handled by a software. This allows hardware designs to be simpler and more efficient.
There are three steps involved in applying deep learning to noise suppression:
krispNet is trained with a very large amount of distinct background noises and clean human voices.
Read more in detail about how we can use deep learning in noise suppression on the Nvidia blog.
Samsung opens its AI based Bixby voice assistant to third-party developers
Voice, natural language, and conversations: Are they the next web UI?
How Deep Neural Networks can improve Speech Recognition and generation