So far, we have only considered machine learning algorithms that mostly operate on numerical inputs. If we want to use text, we must find a way to convert the text into numbers. There are many ways to do this, and we will explore a few common ways to do so in this chapter.
If we consider the sentence TensorFlow makes machine learning easy, we could convert the words to numbers in the order that we observe them. This would make the sentence become 1 2 3 4 5. Then when we see a new sentence, machine learning is easy, we can translate this as 3 4 0 5, denoting words we haven't seen with an index of zero. With these two examples, we have limited our vocabulary to six numbers. With large pieces of text, we can choose how many words we want to keep, and usually keep the most frequent words, labeling everything else with a zero index.
If the word learning has a numerical...