Feature engineering options
In this section, we will demonstrate how feature engineering can lead to better predictive models. Second only to data cleaning, typically, feature engineering is the most time-consuming of all tasks involved in the modeling process. It can also be the "secret sauce" that makes for a great predictive model.
So, what does feature engineering mean? Put simply, it is how to extract information from raw data into a form that is both usable by the modeling algorithm and interpretable for the problem at hand. For example, a date or date-time object might be represented in data as a string or a number (for example, Unix time is the number of seconds since 00:00:00 UTCÂ on January 1, 1970). Presented with such features, an algorithm is liable to treat dates as levels of a categorical variable or a continuous numeric value. Neither of these forms is very helpful. However, embedded in this raw data is information about not only the day, the month...