Introducing IPython and the Jupyter Notebook
The Jupyter Notebook is a web-based interactive environment that combines code, rich text, images, videos, animations, mathematical equations, plots, maps, interactive figures and widgets, and graphical user interfaces, into a single document. This tool is an ideal gateway to high-performance numerical computing and data science in Python, R, Julia, or other languages. In this book, we will mostly use the Python language, although there are recipes introducing R and Julia.
In this recipe, we give an introduction to IPython and the Jupyter Notebook.
Getting ready
This chapter's introduction gives the instructions to install the Anaconda distribution, which comes with Jupyter and almost all Python libraries we will be using in this book.
Once Anaconda is installed, download the code from the book's website and open a Terminal in that folder. In the Terminal, type jupyter notebook
. Your default web browser should open automatically and load the address http://localhost:8888
(a server that runs on your computer). You're ready to get started!
How to do it...
- Let's create a new Jupyter notebook using an IPython kernel. We type the following command in a cell, and press Shift + Enter to evaluate it:
>>> print("Hello world!") Hello world!
A notebook contains a linear succession of cells and output areas. A cell contains Python code, in one or multiple lines. The output of the code is shown in the corresponding output area.
Tip
In this book, the prompt
>>>
means that you need to type everything that starts after it. The>>>
characters themselves should not be typed. - Now, we do a simple arithmetic operation:
>>> 2 + 2 4
The result of the operation is shown in the output area. More precisely, the output area not only displays text that is printed by any command in the cell, but it also displays a text representation of the last returned object. Here, the last returned object is the result of
2 + 2
, that is,4
. - In the next cell, we can recover the value of the last returned object with the
_
(underscore) special variable. In practice, it might be more convenient to assign objects to named variables such as inmyresult = 2 + 2
.>>> _ * 3 12
- IPython not only accepts Python code, but also shell commands. These commands are provided by the operating system. We first type
!
in a cell before typing the shell command. Here, assuming a Linux or macOS system, we get the list of all the notebooks in the current directory:>>> !ls my_notebook.ipynb
On Windows, one may replace
ls
bydir
. - IPython comes with a library of magic commands. These commands are convenient shortcuts to common actions. They all start with
%
(the percent character). We can get the list of all magic commands with%lsmagic
:>>> %lsmagic Available line magics: %alias %alias_magic %autocall %automagic %autosave %bookmark %cat %cd %clear %colors %config %connect_info %cp %debug %dhist %dirs %doctest_mode %ed %edit %env %gui %hist %history %killbgscripts %ldir %less %lf %lk %ll %load %load_ext %loadpy %logoff %logon %logstart %logstate %logstop %ls %lsmagic %lx %macro %magic %man %matplotlib %mkdir %more %mv %notebook %page %pastebin %pdb %pdef %pdoc %pfile %pinfo %pinfo2 %popd %pprint %precision %profile %prun %psearch %psource %pushd %pwd %pycat %pylab %qtconsole %quickref %recall %rehashx %reload_ext %rep %rerun %reset %reset_selective %rm %rmdir %run %save %sc %set_env %store %sx %system %tb %time %timeit %unalias %unload_ext %who %who_ls %whos %xdel %xmode Available cell magics: %%! %%HTML %%SVG %%bash %%capture %%debug %%file %%html %%javascript %%js %%latex %%markdown %%perl %%prun %%pypy %%python %%python2 %%python3 %%ruby %%script %%sh %%svg %%sx %%system %%time %%timeit %%writefile Automagic is ON, % prefix IS NOT needed for line magics.
Cell magics have a
%%
prefix; they target entire code cells. - For example, the
%%writefile
cell magic lets us create a text file. This magic command accepts a filename as an argument. All the remaining lines in the cell are directly written to this text file. Here, we create atest.txt
file and writeHello world!
into it:>>> %%writefile test.txt Hello world! Writing test.txt >>> # Let's check what this file contains. with open('test.txt', 'r') as f: print(f.read()) Hello world!
- As we can see in the output of
%lsmagic
, there are many magic commands in IPython. We can find more information about any command by adding?
after it. For example, to get some help about the%run
magic command, we type%run?
as shown here:>>> %run?
The pager (a text area at the bottom of the screen) opens and shows the help of the
%run
magic command. - We covered the basics of IPython and the Notebook. Let's now turn to the rich display and interactive features of the Notebook. Until now, we have only created code cells (containing code). Jupyter supports other types of cells. In the Notebook toolbar, there is a drop-down menu to select the cell's type. The most common cell type after the code cell is the Markdown cell.
Markdown cells contain rich text formatted with Markdown, a popular plain text- formatting syntax. This format supports normal text, headers, bold, italics, hypertext links, images, mathematical equations in LaTeX (a typesetting system adapted to mathematics), code, HTML elements, and other features, as shown here:
Running a Markdown cell (by pressing Shift + Enter, for example) displays the output, as shown in the bottom panel of the preceding screenshot.
By combining code cells and Markdown cells, we create a standalone interactive document that combines computations (code), text, and graphics.
- The Jupyter Notebook also comes with a sophisticated display system that lets us insert rich web elements in the Notebook. Here, we show how to add HTML, Scalable Vector Graphics (SVG), and even YouTube videos in a notebook. First, we need to import some classes:
>>> from IPython.display import HTML, SVG, YouTubeVideo
- We create an HTML table dynamically with Python, and we display it in the (HTML-based) notebook.
>>> HTML(''' <table style="border: 2px solid black;"> ''' + ''.join(['<tr>' + ''.join([f'<td>{row},{col}</td>' for col in range(5)]) + '</tr>' for row in range(5)]) + ''' </table> ''')
- Similarly, we create an SVG image dynamically:
>>> SVG('''<svg width="600" height="80">''' + ''.join([f'''<circle cx="{(30 + 3*i) * (10 - i)}" cy="30" r="{3. * float(i)}" fill="red" stroke-width="2" stroke="black"> </circle>''' for i in range(10)]) + '''</svg>''')
- We display a YouTube video by giving its identifier to
YoutubeVideo
:>>> YouTubeVideo('VQBZ2MqWBZI')
There's more...
Notebooks are saved as structured text files (JSON format), which makes them easily shareable. Here are the contents of a simple notebook:
{ "cells": [ { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Hello world!\n" ] } ], "source": [ "print(\"Hello world!\")" ] } ], "metadata": {}, "nbformat": 4, "nbformat_minor": 2 }
Jupyter comes with a special tool, nbconvert, which converts notebooks to other formats such as HTML and PDF (https://nbconvert.readthedocs.io/en/stable/).
Another online tool, nbviewer (http://nbviewer.jupyter.org), allows us to render a publicly-available notebook directly in the browser.
We will cover many of these possibilities in subsequent chapters, notably in Chapter 3, Mastering the Jupyter Notebook.
There are other implementations of Jupyter Notebook frontends that offer different ways of interacting with the same notebook documents. JupyterLab, an IDE for interactive computing and data science, is the future of the Jupyter Notebook. It is introduced in Chapter 3, Mastering the Jupyter Notebook. nteract is a desktop application that lets the user open a notebook file by double-clicking on it, without using the Terminal and using a web browser. Hydrogen is a plugin of the Atom text editor that provides rich interactive capabilities when opening notebook files. Juno is a Jupyter Notebook client for iPad.
Here are a few references about the Notebook:
- Installing Jupyter, available at http://jupyter.org/install.html
- Documentation of the Notebook available at http://jupyter.readthedocs.io/en/latest/index.html
- Security in Jupyter notebooks, at https://jupyter-notebook.readthedocs.io/en/stable/security.html#Security-in-notebook-documents
- User-curated gallery of interesting notebooks available at https://github.com/jupyter/jupyter/wiki/A-gallery-of-interesting-Jupyter-Notebooks
- JupyterLab at https://github.com/jupyterlab/jupyterlab
- nteract at https://nteract.io
- Hydrogen at
https:/
- Juno at https://juno.sh/
See also
- The Getting started with exploratory data analysis in the Jupyter Notebook recipe
- The Introducing JupyterLab recipe in Chapter 3, Mastering the Jupyter Notebook