Chapter 2. Data Preparation – Select
In this chapter, we will cover:
- Using the Feature Selection node creatively to remove or decapitate perfect predictors
- Running a Statistics node on an anti-join to evaluate the potential missing data
- Evaluating the use of sampling for speed
- Removing redundant variables using correlation matrices
- Selecting variables using the CHAID Modeling node
- Selecting variables using the Means node
- Selecting variables using single-antecedent Association Rules