Creating a bootstrap sample
A bootstrap sample is a random sample with replacement, meaning that each record has an equal chance of being selected; after it has been selected, that record has an equal chance of being selected again. Usually, when we select records for training and testing, we sample without replacement, so that each record will appear in only the training or the testing data set.
In this recipe we learn how to build bootstrap samples, a feature not included in Modeler.
Getting ready
This recipe uses the datafile cup98lrn_reduced_vars3.sav
and the stream Recipe – bootstrap one sample.str
.
How to do it...
To create a bootstrap sample:
Open the stream
Recipe – bootstrap one sample.str
by navigating to File | Open Stream.Make sure the datafile points to the correct path to the datafile
cup98lrn_reduced_vars3.sav
.Open the Derive node ,
recordID
. The formula for the node is just the @INDEX function that returns an integer number indicating the record number. Click on OK.Open the supernode...