Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
The Machine Learning Workshop

You're reading from   The Machine Learning Workshop Get ready to develop your own high-performance machine learning algorithms with scikit-learn

Arrow left icon
Product type Paperback
Published in Jul 2020
Publisher Packt
ISBN-13 9781839219061
Length 286 pages
Edition 2nd Edition
Languages
Tools
Arrow right icon
Author (1):
Arrow left icon
Hyatt Saleh Hyatt Saleh
Author Profile Icon Hyatt Saleh
Hyatt Saleh
Arrow right icon
View More author details
Toc

3. Supervised Learning – Key Steps

Activity 3.01: Data Partitioning on a Handwritten Digit Dataset

Solution:

  1. Import all the required elements to split a dataset, as well as the load_digits function from scikit-learn to load the digits dataset. Use the following code to do so:
    from sklearn.datasets import load_digits
    import pandas as pd
    from sklearn.model_selection import train_test_split
    from sklearn.model_selection import KFold
  2. Load the digits dataset and create Pandas DataFrames containing the features and target matrices:
    digits = load_digits()
    X = pd.DataFrame(digits.data)
    Y = pd.DataFrame(digits.target)
    print(X.shape, Y.shape)

    The shape of your features and target matrices should be as follows, respectively:

    (1797, 64) (1797, 1)
  3. Perform the conventional split approach, using a split ratio of 60/20/20%.

    Using the train_test_split function, split the data into an initial train set and a test set:

    X_new, X_test, \
    Y_new, Y_test = train_test_split(X, Y, test_size...
lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime