Cloud Speech API
A powerful API from GCP! This enables the user to convert speech to text by using a neural network model. This API is used to recognize over 100 languages throughout the world. It can also supports filter of unwanted noise/ content from a text, under various types of environments. It supports for context-awareness recognition, works on any device, any platform, anywhere, including IoT. It has features like Automatic Speech Recognition (ASR), Global Vocabulary, Streaming Recognition, Word Hints, Real-Time Audio support, Noise Robustness, Inappropriate Content Filtering and supports for integration with other APIs of GCP.
The architecture of the Cloud Speech API is as follows:
![](https://static.packt-cdn.com/products/9781788839686/graphics/49858c01-fbae-4f71-b6f4-e12c0883610a.png)
In other words, this model enables speech to text conversion by ML.
The components used by the Speech API are:
- REST API or Google Remote Procedure Call (gRPC) API
- Google Cloud Client Library
- JSON API
- Python
- Cloud DataLab
- Cloud Data Storage
- Cloud Endpoints
The applications of the model include:
- Voice user interfaces...