Applying suggestions
Once we get the clusters from our data, we can start selecting which suggestions we'll be using in our transformed dataset. We can obtain the clusters by running the following code:
clusters = df.string_clustering("name", "fingerprint")
In the previous example, we're getting clusters
, which is a custom class with a Python dictionary in it. Let's look at a representation of what we're storing in clusters
, as follows:
print(clusters)
This would give us the following output:
{     "name": {         "johndoe": {             "suggestions": [                 "John Doe", "john doe", "doe, john", "jóhn dóe"       ...