Defining a data modelling strategy
I was perhaps too hasty proposing this solution to Mr Clough—he is a great professional, but I have never heard of one of his requests going unsatisfied. Moreover, his words made me think is not excluding the hypothesis of fraud. And this makes me even more nervous, if that's possible.
Nevertheless, we have to do this as if it is business as usual. The point is that we need some kind of data related to default events in the past and the companies that experimented this status. What? They also gave you a dataset about past default events? And you already cleaned it? That is what I call good news. OK, just send it to me and we can start to work on it immediately.
clean_casted_stored_data_validated_complete
, uh? You don't fear long names, do you? Just run glimpse
 on it and see what is inside:
glimpse(clean_casted_stored_data_validated_complete) Observations: 11,523 Variables: 16 $ attr_3 <dbl> 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,...