Packt+ | Advance your knowledge in tech

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

IBM SPSS Modeler Essentials

You're reading from IBM SPSS Modeler Essentials Effective techniques for building powerful data mining and predictive analytics solutions

Product type Paperback

Published in Dec 2017

Publisher Packt

ISBN-13 9781788291118

Length 238 pages

Edition 1st Edition

Tools

IBM SPSS

Concepts

Data Mining

Authors (2):

Keith McCormick

Jesus Salcedo

View More author details

Table of Contents (19) Chapters

Title Page

Credits

About the Authors

About the Reviewer

www.PacktPub.com

Customer Feedback

Dedication

Preface

1. Introduction to Data Mining and Predictive Analytics FREE CHAPTER

2. The Basics of Using IBM SPSS Modeler

3. Importing Data into Modeler

4. Data Quality and Exploration

5. Cleaning and Selecting Data

6. Combining Data Files

7. Deriving New Fields

8. Looking for Relationships Between Fields

9. Introduction to Modeling Options in IBM SPSS Modeler

10. Decision Tree Models

11. Model Assessment and Scoring

Identifying and removing duplicate cases

Datasets may contain duplicate records that often must be removed before data mining can begin. For example, the same individual may appear multiple times in a dataset with different addresses. The Distinct node finds or removes duplicate records in a dataset. The Distinct node, located in the Record Ops palette, checks for duplicate records and identifies the cases that appear more than once in a file so they can be reviewed and/or removed.

A duplicate case is defined by having identical data values on one or more fields that are specified. Any number or combination of fields may be used to specify a duplicate:

Place a Distinct node from the Record Ops palette onto the canvas.
Connect the Sort node to the Distinct node.
Edit the Distinct node.

The Distinct node can be a bit tricky to use; this is why we will run this node a couple of times, and hopefully in this way its options will become well-defined. The Mode option controls how the Distinct node is...

The rest of the chapter is locked

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Keith McCormick

Keith McCormick

Keith McCormick is an independent data miner, trainer, conference speaker, and author. He has been using statistics software tools since the early 90s, and has been conducting training since 1997. He has been data mining and using IBM SPSS Modeler since its arrival in North America in the late 90s. He is also an expert in other packages, IBM's SPSS software suite, including IBM SPSS Statistics, AMOS, and Text Mining. He blogs and reviews related books as well.

See other products by Keith McCormick

Jesus Salcedo

Jesus Salcedo

Jesus Salcedo has a PhD in psychometrics from Fordham University. He is an independent statistical consultant and has been using SPSS products for over 20 years. He is a former SPSS Curriculum Team Lead and Senior Education Specialist who has written numerous SPSS training courses and trained thousands of users.

See other products by Jesus Salcedo

Other recommended products

Related to this chapter

Machine Learning for Data Mining

Machine Learning for Data Mining

Most data mining opportunities involve machine learning and often come with greater financial rewards. This book will help you bring the power of machine learning techniques into your data mining work. By the end of the book, you will be able to create accurate predictive models for data mining.

Apr 2019 8h 24m

Machine Learning for Data Mining

Machine Learning for Data Mining

Most data mining opportunities involve machine learning and often come with greater financial rewards. This book will help you bring the power of machine learning techniques into your data mining work. By the end of the book, you will be able to create accurate predictive models for data mining.

Apr 2019 8h 24m

Learning Alteryx

Learning Alteryx

Alteryx, as a leading data blending and advanced data analytics platform, has taken self-service data analytics to the next level. This book will set you on a self-service data analytics journey that will help you create efficient workflows using Alteryx, without any coding involved. It will empower you and your organization to take well-informed decisions with the help of deeper business insights from the data. You will see how to use the unique features of Alteryx to perform common tasks such as data preparation and blending, and also delve into the more advanced concepts such as performing predictive analytics, before sharing the insights gained with the relevant decision makers. Whether you are a novice with Alteryx or an experienced data analyst keen to explore Alteryx's self-service analytics features, this guide will be the perfect companion for you.

Dec 2017 7h 36m

Learning Alteryx

Learning Alteryx

Alteryx, as a leading data blending and advanced data analytics platform, has taken self-service data analytics to the next level. This book will set you on a self-service data analytics journey that will help you create efficient workflows using Alteryx, without any coding involved. It will empower you and your organization to take well-informed decisions with the help of deeper business insights from the data. You will see how to use the unique features of Alteryx to perform common tasks such as data preparation and blending, and also delve into the more advanced concepts such as performing predictive analytics, before sharing the insights gained with the relevant decision makers. Whether you are a novice with Alteryx or an experienced data analyst keen to explore Alteryx's self-service analytics features, this guide will be the perfect companion for you.

Dec 2017 7h 36m

Learning Alteryx

Learning Alteryx

Alteryx, as a leading data blending and advanced data analytics platform, has taken self-service data analytics to the next level. This book will set you on a self-service data analytics journey that will help you create efficient workflows using Alteryx, without any coding involved. It will empower you and your organization to take well-informed decisions with the help of deeper business insights from the data. You will see how to use the unique features of Alteryx to perform common tasks such as data preparation and blending, and also delve into the more advanced concepts such as performing predictive analytics, before sharing the insights gained with the relevant decision makers. Whether you are a novice with Alteryx or an experienced data analyst keen to explore Alteryx's self-service analytics features, this guide will be the perfect companion for you.

Dec 2017 7h 36m

Learning Alteryx

Learning Alteryx

Alteryx, as a leading data blending and advanced data analytics platform, has taken self-service data analytics to the next level. This book will set you on a self-service data analytics journey that will help you create efficient workflows using Alteryx, without any coding involved. It will empower you and your organization to take well-informed decisions with the help of deeper business insights from the data. You will see how to use the unique features of Alteryx to perform common tasks such as data preparation and blending, and also delve into the more advanced concepts such as performing predictive analytics, before sharing the insights gained with the relevant decision makers. Whether you are a novice with Alteryx or an experienced data analyst keen to explore Alteryx's self-service analytics features, this guide will be the perfect companion for you.

Dec 2017 7h 36m

Learning Alteryx

Learning Alteryx

Alteryx, as a leading data blending and advanced data analytics platform, has taken self-service data analytics to the next level. This book will set you on a self-service data analytics journey that will help you create efficient workflows using Alteryx, without any coding involved. It will empower you and your organization to take well-informed decisions with the help of deeper business insights from the data. You will see how to use the unique features of Alteryx to perform common tasks such as data preparation and blending, and also delve into the more advanced concepts such as performing predictive analytics, before sharing the insights gained with the relevant decision makers. Whether you are a novice with Alteryx or an experienced data analyst keen to explore Alteryx's self-service analytics features, this guide will be the perfect companion for you.

Dec 2017 7h 36m

Learning Alteryx

Learning Alteryx

Alteryx, as a leading data blending and advanced data analytics platform, has taken self-service data analytics to the next level. This book will set you on a self-service data analytics journey that will help you create efficient workflows using Alteryx, without any coding involved. It will empower you and your organization to take well-informed decisions with the help of deeper business insights from the data. You will see how to use the unique features of Alteryx to perform common tasks such as data preparation and blending, and also delve into the more advanced concepts such as performing predictive analytics, before sharing the insights gained with the relevant decision makers. Whether you are a novice with Alteryx or an experienced data analyst keen to explore Alteryx's self-service analytics features, this guide will be the perfect companion for you.

Dec 2017 7h 36m

Learning Alteryx

Learning Alteryx

Alteryx, as a leading data blending and advanced data analytics platform, has taken self-service data analytics to the next level. This book will set you on a self-service data analytics journey that will help you create efficient workflows using Alteryx, without any coding involved. It will empower you and your organization to take well-informed decisions with the help of deeper business insights from the data. You will see how to use the unique features of Alteryx to perform common tasks such as data preparation and blending, and also delve into the more advanced concepts such as performing predictive analytics, before sharing the insights gained with the relevant decision makers. Whether you are a novice with Alteryx or an experienced data analyst keen to explore Alteryx's self-service analytics features, this guide will be the perfect companion for you.

Dec 2017 7h 36m

Learning Alteryx

Learning Alteryx

Alteryx, as a leading data blending and advanced data analytics platform, has taken self-service data analytics to the next level. This book will set you on a self-service data analytics journey that will help you create efficient workflows using Alteryx, without any coding involved. It will empower you and your organization to take well-informed decisions with the help of deeper business insights from the data. You will see how to use the unique features of Alteryx to perform common tasks such as data preparation and blending, and also delve into the more advanced concepts such as performing predictive analytics, before sharing the insights gained with the relevant decision makers. Whether you are a novice with Alteryx or an experienced data analyst keen to explore Alteryx's self-service analytics features, this guide will be the perfect companion for you.

Dec 2017 7h 36m

Data Analysis with IBM SPSS Statistics

Data Analysis with IBM SPSS Statistics

SPSS Statistics is a software package used for logical batched and non-batched statistical analysis. Analytical tools such as SPSS can readily provide even a novice user with an overwhelming amount of information and a broad range of options for analyzing patterns in the data. This book will have a comprehensive coverage of IBM's premier statistics and data analysis tool – IBM SPSS Statistics. It is designed for business professionals who wish to analyze their data. By the end of this book, you will have a firm understanding of the various statistical analysis techniques offered by SPSS Statistics, and be able to master its use for data analysis with ease.

Sep 2017 14h 52m

Data Analysis with IBM SPSS Statistics

Data Analysis with IBM SPSS Statistics

SPSS Statistics is a software package used for logical batched and non-batched statistical analysis. Analytical tools such as SPSS can readily provide even a novice user with an overwhelming amount of information and a broad range of options for analyzing patterns in the data. This book will have a comprehensive coverage of IBM's premier statistics and data analysis tool – IBM SPSS Statistics. It is designed for business professionals who wish to analyze their data. By the end of this book, you will have a firm understanding of the various statistical analysis techniques offered by SPSS Statistics, and be able to master its use for data analysis with ease.

Sep 2017 14h 52m

Personalised recommendations for you

Based on your interests and search pattern

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m