Data profiling
There is a handy function in Optimus called profile
that returns useful stats about our dataset. Let's see how to use it:
df.profile(bins=5)
This code will return a dictionary:
{'columns': {'id': {'stats': {'match': 504, 'missing': 0, 'mismatch': 0, 'profiler_dtype': {'dtype': 'int', 'categorical': True}, 'frequency': [{'value': 1, 'count': 1}, {'value': 332, 'count': 1}, {'value': 345, 'count': 1}, {'value': 344, 'count': 1}, {'value': 343, 'count': 1}], 'count_uniques': 504}, 'dtype': 'int64...