0

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

Bioinformatics with Python Cookbook

You're reading from Bioinformatics with Python Cookbook Use modern Python libraries and applications to solve real-world computational biology problems

Product type Paperback

Published in Sep 2022

Publisher Packt

ISBN-13 9781803236421

Length 360 pages

Edition 3rd Edition

Languages

Python

Tools

Dask

Concepts

Bioinformatics

Author (1):

Tiago Antao

View More author details

Table of Contents (15) Chapters

Preface

1. Chapter 1: Python and the Surrounding Software Ecology

2. Chapter 2: Getting to Know NumPy, pandas, Arrow, and Matplotlib FREE CHAPTER

3. Chapter 3: Next-Generation Sequencing

4. Chapter 4: Advanced NGS Data Processing

5. Chapter 5: Working with Genomes

6. Chapter 6: Population Genetics

7. Chapter 7: Phylogenetics

8. Chapter 8: Using the Protein Data Bank

9. Chapter 9: Bioinformatics Pipelines

10. Chapter 10: Machine Learning for Bioinformatics

11. Chapter 11: Parallel Processing with Dask and Zarr

12. Chapter 12: Functional Programming for Bioinformatics

13. Index

Why subscribe?

14. Other Books You May Enjoy

Introducing scikit-learn with a PCA example

PCA is a statistical procedure that’s used to perform a reduction of the dimension of a number of variables to a smaller subset that is linearly uncorrelated. In Chapter 6, we saw a PCA implementation based on using an external application. In this recipe, we will implement the same PCA for population genetics but will use the scikit-learn library. Scikit-learn is one of the fundamental Python libraries for machine learning and this recipe is an introduction to the library. PCA is a form of unsupervised machine learning – we don’t provide information about the class of the sample. We will discuss supervised techniques in the other recipes of this chapter.

As a reminder, we will compute PCA for 11 human populations from the HapMap project.

Getting ready

You will need to run the first recipe from Chapter 6 in order to generate the hapmap10_auto_noofs_ld_12 PLINK file (with alleles recorded as 1 and 2). From a population...

The rest of the chapter is locked

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Tiago Antao

Tiago Antao

Tiago Antao is a bioinformatician currently working in the field of genomics. A former computer scientist, Tiago moved into computational biology with an MSc in Bioinformatics from the Faculty of Sciences at the University of Porto (Portugal) and a PhD on the spread of drug-resistant malaria from the Liverpool School of Tropical Medicine (UK). Postdoctoral, Tiago has worked with human datasets at the University of Cambridge (UK) and with mosquito whole genome sequencing data at the University of Oxford (UK), before helping to set up the bioinformatics infrastructure at the University of Montana. He currently works as a data engineer in the biotechnology field in Boston, MA. He is one of the co-authors of Biopython, a major bioinformatics package written in Python.

See other products by Tiago Antao

Personalised recommendations for you

Based on your interests and search pattern

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m