You're reading from R Bioinformatics Cookbook Utilize R packages for bioinformatics, genomics, data science, and machine learning

Product type Paperback

Published in Oct 2023

Publisher Packt

ISBN-13 9781837634279

Length 396 pages

Edition 2nd Edition

Languages

Tools

ChatGPT

Concepts

Bioinformatics

Author (1):

Dan MacLean

View More author details

Table of Contents (16) Chapters

Preface

1. Chapter 1: Setting Up Your R Bioinformatics Working Environment

2. Chapter 2: Loading, Tidying, and Cleaning Data in the tidyverse FREE CHAPTER

3. Chapter 3: ggplot2 and Extensions for Publication Quality Plots

4. Chapter 4: Using Quarto to Make Data-Rich Reports, Presentations, and Websites

5. Chapter 5: Easily Performing Statistical Tests Using Linear Models

6. Chapter 6: Performing Quantitative RNA-seq

7. Chapter 7: Finding Genetic Variants with HTS Data

8. Chapter 8: Searching Gene and Protein Sequences for Domains and Motifs

9. Chapter 9: Phylogenetic Analysis and Visualization

10. Chapter 10: Analyzing Gene Annotations

11. Chapter 11: Machine Learning with mlr3

12. Chapter 12: Functional Programming with purrr and base R

13. Chapter 13: Turbo-Charging Development in R with ChatGPT

14. Index

Why subscribe?

15. Other Books You May Enjoy

Clustering with k-means and hierarchical clustering

It is common in bioinformatics to want to classify things into groups without first knowing what or how many groups there may be. This process is usually known as clustering and is a type of unsupervised ML. This is commonly used in genomics experiments, particularly RNAseq and related count-based technologies. In this recipe, we’ll start with a large gene expression dataset with around 150 samples. We’ll learn how to estimate how many groups of samples there are and apply a method to cluster them based on the reduction of dimensionality with PCA followed by a k-means cluster.

Getting ready

We’ll need the factoextra, RColorBrewer, and Bioconductor biobase libraries. We’ll also use the modencodefly_eset object from the rbioinfcookbook package.

How to do it…

We can cluster with the following code

Load the data and run a PCA:

library(factoextra)library(Biobase)library(rbioinfcookbook...

The rest of the chapter is locked

You're reading from R Bioinformatics Cookbook Utilize R packages for bioinformatics, genomics, data science, and machine learning

Table of Contents (16) Chapters

Clustering with k-means and hierarchical clustering

Getting ready

How to do it…

Authors (2)

Personalised recommendations for you

You're reading from R Bioinformatics Cookbook Utilize R packages for bioinformatics, genomics, data science, and machine learning

Table of Contents (16) Chapters

Clustering with k-means and hierarchical clustering

Getting ready

How to do it…

Unlock this book and the full library FREE for 7 days

Authors (2)

Personalised recommendations for you