Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On Data Science with SQL Server 2017 Perform end-to-end data analysis to gain efficient data insight

Product type Paperback

Published in Nov 2018

Publisher Packt

ISBN-13 9781788996341

Length 506 pages

Edition 1st Edition

Languages

Python

Tools

SQL Server

Concepts

Data Analysis

Authors (2):

Vladimír Mužný

Marek Chmel

View More author details

Table of Contents (14) Chapters

Preface

1. Data Science Overview FREE CHAPTER

2. SQL Server 2017 as a Data Science Platform

3. Data Sources for Analytics

4. Data Transforming and Cleaning with T-SQL

5. Data Exploration and Statistics with T-SQL

6. Custom Aggregations on SQL Server

7. Data Visualization

8. Data Transformations with Other Tools

9. Predictive Model Training and Evaluation

10. Making Predictions

11. Getting It All Together - A Real-World Example

12. Next Steps with Data Science and SQL

13. Other Books You May Enjoy

Leave a review - let other readers know what you think

Categorization, missing values, and normalization

For correct and accurate predictions calculated with machine learning models, the incoming data should be presented in the ideal format. The ideal format means that all values are present in a dataset, numerical data is used in numerical features and not categories or labels, or the distribution of features is even (Gaussian). However, many presumptions are not always true in the real world. For this reason, after basic transformations, such as joining or merging data, are done, we should undertake statistical research that shows the real format of data. Based on statistical research, we will know the difference between the ideal and real format of incoming data. This section will describe techniques used to transform data from its real format to its ideal, comparable, and meaningful format.

...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Vladimír Mužný

Vladimír Mužný has been a freelance developer and consultant since 1997. He has been a Data Platform MVP since 2017, and he has earned certifications such as MCSE: Data Management and Analytics and MCT. His first steps with SQL Server were done on version 6.5, and from that time on, he has worked with all following versions of SQL Server. Now Vladimir teaches Microsoft database courses, participates in SQL Server adoption at various companies, and collaborates on projects for production tracking and migrations.

See other products by Vladimír Mužný

Marek Chmel

Marek Chmel is a senior CSA at Microsoft, specializing in data and AI. He is a speaker and trainer with more than 15 years' experience. He has been a Data Platform MVP since 2012. He has earned numerous certifications, including Azure Architect, Data Engineer and Scientist Associate, Certified Ethical Hacker, and several eLearnSecurity certifications. Marek earned his master's degree in business and informatics from Nottingham Trent University. He started his career as a trainer for Microsoft Server courses and later worked as SharePoint team lead and principal database administrator. He has authored two books, Hands-On Data Science with SQL Server 2017 and SQL Server 2017 Administrator's Guide.

See other products by Marek Chmel