Technical requirements
Most of the analysis and modeling carried out in this chapter requires access to the DataRobot software. Some manipulations were carried out using other tools, including MS Excel. The dataset utilized in this chapter is the House Dataset.
House Dataset
The House Dataset can be accessed at Eman Hamed Ahmed's GitHub account (https://github.com/emanhamed). Each row in this dataset represents a specific house. The initial feature set describes its characteristics, price, zip code, images of the bedroom, bathroom, kitchen, and frontal view. There was no missing data. We went on to develop text descriptions for each house, based on the number of bedrooms, bathrooms, city, country, state, and actual size of the property. Elsewhere, the ZIP codes were converted into latitude and longitude, which were added to the dataset as columns. More information on the base features is provided at the GitHub link and the data is provided in .csv
format.
Dataset Citation...