Identify features and uses for speech recognition and synthesis
As we saw in the NLP scenarios section, speech recognition and synthesis are tasks that can be provided by NLP as part of the speech area of AI.
In the following sections, you will explore the AI capabilities of speech recognition and speech synthesis.
Speech recognition
Speech recognition is, simply put, STT; it uses the capabilities of AI to detect spoken input and output it as written text. It uses advances in areas such as DL techniques and the availability of large training datasets.
Speech recognition can provide the following uses:
- Generating text output from users’ spoken input requests
- Generating a text response to a user based on speech input
- Generating audio file narration from a script for a video
- Generating subtitles for an audience
- Generating close captions for videos, live and recorded
- Generating notes from dictation
- Generating text transcripts of audio...