Packt+ | Advance your knowledge in tech

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

Bioinformatics with Python Cookbook

You're reading from Bioinformatics with Python Cookbook Learn how to use modern Python bioinformatics libraries and applications to do cutting-edge research in computational biology

Product type Paperback

Published in Nov 2018

Publisher Packt

ISBN-13 9781789344691

Length 360 pages

Edition 2nd Edition

Languages

Python

Tools

Biopython

Concepts

Bioinformatics

Author (1):

Tiago Antao

View More author details

Table of Contents (12) Chapters

Preface

1. Python and the Surrounding Software Ecology FREE CHAPTER

2. Next-Generation Sequencing

3. Working with Genomes

4. Population Genetics

5. Population Genetics Simulation

6. Phylogenetics

7. Using the Protein Data Bank

8. Bioinformatics Pipelines

9. Python for Big Genomics Datasets

10. Other Topics in Bioinformatics

11. Advanced NGS Processing

Exploring the data with standard statistics

Now that we have a compass from the decision tree, let's explore the data in order to get more insights that might help us to better filter the data. You can find this content in Chapter11/Exploration.ipynb.

How to do it…

We start, as usual, with the necessary imports:

import gzip
import pickle
import random

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
from pandas.plotting import scatter_matrix

%matplotlib inline

Then we load the data. We will use pandas to navigate it:

fit = np.load(gzip.open('balanced_fit.npy.gz', 'rb'))
ordered_features = np.load(open('ordered_features', 'rb'))
num_features = len(ordered_features)
fit_df = pd.DataFrame(fit, columns=ordered_features + ['pos', 'error'])
num_samples = 80
del fit

Let's ask pandas to show an histogram of all annotations:

fig,ax = plt.subplots(figsize=(16,9))
fit_df.hist(column=ordered_features, ax=ax)

The following histogram is generated:

Histogram of all annotations for a dataset...

The rest of the chapter is locked

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Tiago Antao

Tiago Antao

Tiago Antao is a bioinformatician currently working in the field of genomics. A former computer scientist, Tiago moved into computational biology with an MSc in Bioinformatics from the Faculty of Sciences at the University of Porto (Portugal) and a PhD on the spread of drug-resistant malaria from the Liverpool School of Tropical Medicine (UK). Postdoctoral, Tiago has worked with human datasets at the University of Cambridge (UK) and with mosquito whole genome sequencing data at the University of Oxford (UK), before helping to set up the bioinformatics infrastructure at the University of Montana. He currently works as a data engineer in the biotechnology field in Boston, MA. He is one of the co-authors of Biopython, a major bioinformatics package written in Python.

See other products by Tiago Antao

Other recommended products

Related to this chapter

R Bioinformatics Cookbook

R Bioinformatics Cookbook

In the R Bioinformatics Cookbook, you encounter common and not-so-common challenges in the bioinformatics domain and solve them using real-world examples. The book guides you through varied bioinformatics analysis, from raw data to clean results. It shows you how to import, explore and evaluate your data and how to report it.

Oct 2019 10h 32m

Personalised recommendations for you

Based on your interests and search pattern

Mastering Customer Success

Mastering Customer Success

This guide unveils strategies to cultivate enduring customer relationships. Grounded in effective communication and problem-solving, it shows you how to harness cross-functional collaboration for competitive advantage.

May 2024 5h 40m

Architecting Salesforce Success

Architecting Salesforce Success

This book provides you with a roadmap to becoming a successful Salesforce Architect. Covering common pitfalls and valuable insights, this book prepares you to transition to the role of a Salesforce Architect.

The SketchUp Handbook for Interior Design

The SketchUp Handbook for Interior Design

The SketchUp Handbook for Interior Design walks interior designers and architects through SketchUp Pro and Studio at an intermediate+ level, covering topics and tools specific to the world of design.

Jun 2024 19h 56m

Power Platform and the AI Revolution

Power Platform and the AI Revolution

Learn how to integrate cutting-edge AI technology into business operations and tap into AI services within Power Platform to enable automation and develop apps and chatbots to achieve heightened productivity and efficiency.

May 2024 11h 52m

Mastering Project Management with ClickUp for Work and Home Life Balance

Mastering Project Management with ClickUp for Work and Home Life Balance

This book will guide you in using ClickUp to become a power user of your life. You'll learn practical strategies for setup, project management, and AI integration to optimize workflow, boost productivity, and manage time effectively.

Jun 2024 10h 16m

Mastering UI Development with Unity

Mastering UI Development with Unity

With Unity's robust toolkit, you can create visually appealing UIs to give your games a professional look and feel. This book will help you realize the full potential of the UI systems provided by Unity, so you can create the best UI for your games.

Jun 2024 21h 16m

Autodesk Civil 3D 2025 Unleashed

Autodesk Civil 3D 2025 Unleashed

This book is a comprehensive guide to using Autodesk Civil 3D. You'll progress from design customization with extensions and information management to project extension, preparing you for a successful career in civil engineering.

Jul 2024 10h 28m

Mastering Salesforce Experience Cloud

Mastering Salesforce Experience Cloud

Mastering Salesforce Experience Cloud covers every facet of the platform, from technical reviews to implementation, business transformation, and long-term growth strategies. Learn to maximize the potential of Salesforce Experience Cloud.

Oct 2024 10h 56m

Mastering the Art of Sales Engineering

Mastering the Art of Sales Engineering

This book covers what a sales engineer is and does, and why this is a highly paid and coveted role. The authors share their vast experience to help you learn daily operations and skills to advance your sales career.

Sep 2024 10h 32m

Taking Tinkercad to the Next Level

Taking Tinkercad to the Next Level

Tinkercad users familiar with the basic functionality will learn to create complex shapes and multi-part designs with this book on advanced modeling. It covers strategies and techniques for modeling and designing for production with 3D printers.

Sep 2024 13h 56m