Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Healthcare Analytics Made Simple Techniques in healthcare computing using machine learning and Python

Product type Paperback

Published in Jul 2018

Publisher Packt

ISBN-13 9781787286702

Length 268 pages

Edition 1st Edition

Languages

Python

Tools

Scikit-learn

Concepts

Data Analysis

Authors (2):

Vikas (Vik) Kumar

Shameer Khader

View More author details

Table of Contents (11) Chapters

Preface

1. Introduction to Healthcare Analytics

2. Healthcare Foundations FREE CHAPTER

3. Machine Learning Foundations

4. Computing Foundations – Databases

5. Computing Foundations – Introduction to Python

6. Measuring Healthcare Quality

7. Making Predictive Models in Healthcare

8. Healthcare Predictive Models – A Review

9. The Future – Healthcare and Emerging Technologies

10. Other Books You May Enjoy

Leave a review - let other readers know what you think

Splitting the data into train and test sets

Now that we have our response variable, the next step is to split the dataset into train and test sets. In data science, the training set is the data that is used to determine the model coefficients. In the training phase, the model takes into account the predictor variable values together with the response value to "discover" the rules and the weights that will guide the prediction of new data. The testing set is then used to measure our model performance, as we discussed in Chapter 3, Machine Learning Foundations. Typical splits use 70-80% for the training data and 20-30% for the testing data (unless the dataset is very large, in which case a smaller percentage can be allotted toward the testing set).

Some practitioners also have a validation set that is used to train model parameters, such as the tree size in the random...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Khader

See other products by Khader

Kumar

Ashish Kumar is a seasoned data science professional, a publisher author and a thought leader in the field of data science and machine learning. An IIT Madras graduate and a Young India Fellow, he has around 7 years of experience in implementing and deploying data science and machine learning solutions for challenging industry problems in both hands-on and leadership roles. Natural Language Procession, IoT Analytics, R Shiny product development, Ensemble ML methods etc. are his core areas of expertise. He is fluent in Python and R and teaches a popular ML course at Simplilearn. When not crunching data, Ashish sneaks off to the next hip beach around and enjoys the company of his Kindle. He also trains and mentors data science aspirants and fledgling start-ups.

See other products by Kumar