Identifying the gender of a name is an interesting task in NLP. We will use the heuristic that the last few characters in a name is its defining characteristic. For example, if the name ends with la, it's most likely a female name, such as Angela or Layla. On the other hand, if the name ends with im, it's most likely a male name, such as Tim or Jim. As we aren't sure of the exact number of characters to use, we will experiment with this.
Identifying the gender of a name
Getting ready
In this recipe, we will use the names corpora to extract labeled names, and then we will classify the gender based on the final part of the name.