You're reading from Applied Computational Thinking with Python Design algorithmic solutions for complex and challenging real-world problems

Product type Paperback

Published in Nov 2020

Publisher Packt

ISBN-13 9781839219436

Length 420 pages

Edition 1st Edition

Languages

Python

Concepts

Design

Authors (2):

Dayrene Martinez

Sofía De Jesús

View More author details

Table of Contents (21) Chapters

Preface

1. Section 1: Introduction to Computational Thinking

2. Chapter 1: Fundamentals of Computer Science FREE CHAPTER

3. Chapter 2: Elements of Computational Thinking

4. Chapter 3: Understanding Algorithms and Algorithmic Thinking

5. Chapter 4: Understanding Logical Reasoning

6. Chapter 5: Exploring Problem Analysis

7. Chapter 6: Designing Solutions and Solution Processes

8. Chapter 7: Identifying Challenges within Solutions

9. Section 2:Applying Python and Computational Thinking

10. Chapter 8: Introduction to Python

11. Chapter 9: Understanding Input and Output to Design a Solution Algorithm

12. Chapter 10: Control Flow

13. Chapter 11: Using Computational Thinking and Python in Simple Challenges

14. Section 3:Data Processing, Analysis, and Applications Using Computational Thinking and Python

15. Chapter 12: Using Python in Experimental and Data Analysis Problems

16. Chapter 13: Using Classification and Clusters

17. Chapter 14: Using Computational Thinking and Python in Statistical Analysis

18. Chapter 15: Applied Computational Thinking Problems

19. Chapter 16: Advanced Applied Computational Thinking Problems

20. Other Books You May Enjoy

Preprocessing data

Preprocessing data is a technique that transforms raw data into a useable and efficient format. It is, in fact, the most important step in the data mining and machine learning process.

When we are preprocessing data, we are really cleaning it, transforming it, or doing a data reduction. In this section, we will take a look at what these all mean.

Data cleaning

Data cleaning refers to the process of making our dataset more efficient. If we go through data cleaning in really large datasets, we can expedite the algorithm, avoid errors, and get better results. There are two things we deal with when data cleaning:

Missing data: This can be fixed by ignoring the data or manually entering a value for the missing data.
Noisy data: This can be fixed/improved by using binning, regression, or clustering, among other processes.

We're going to look at each of these things in more detail.