Search icon CANCEL
Subscription
0
Cart icon
Close icon
You have no products in your basket yet
Save more on your purchases!
Savings automatically calculated. No voucher code required
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Microsoft Azure Machine Learning

You're reading from  Microsoft Azure Machine Learning

Product type Book
Published in Jun 2015
Publisher
ISBN-13 9781784390792
Pages 212 pages
Edition 1st Edition
Languages
Authors (2):
Sumit Mund Sumit Mund
Profile icon Sumit Mund
Christina Storm Christina Storm
Profile icon Christina Storm
View More author details

Table of Contents (21) Chapters

Microsoft Azure Machine Learning
Credits
About the Author
Acknowledgments
About the Reviewers
www.PacktPub.com
Preface
Introduction ML Studio Inside Out Data Exploration and Visualization Getting Data in and out of ML Studio Data Preparation Regression Models Classification Models Clustering A Recommender System Extensibility with R and Python Publishing a Model as a Web Service Case Study Exercise I Case Study Exercise II Index

Model development


You have to predict whether the flight would be delayed or not. As you found from the dataset, any flight delayed for more than 15 minutes has been labeled as delayed and the ArrDelay15 corresponding label contains 1. Here, the ArrDelay15 column is the target variable and it only contains 0 and 1. Clearly, it's a two-class classification problem.

As you have already explored, there are several two-class classification algorithms available in ML Studio. For simplicity, we would just build the model here with the Two-Class Boosted Decision Tree module with the following parameters:

  • The Maximum number of leaves per tree option is set at 128

  • The Minimum number of samples per leaf node option is set at 50

  • The Learning rate option is set at 0.2

  • The Number of trees constructed option is set at 500

You are encouraged to try out different algorithms and also use the Sweep Parameters module to choose the optimum parameters.

To train the model, you need to split the dataset and use one...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at €14.99/month. Cancel anytime}