You're reading from Big Data Analytics Real time analytics using Apache Spark and Hadoop

Product type Paperback

Published in Sep 2016

Publisher Packt

ISBN-13 9781785884696

Length 326 pages

Edition 1st Edition

Tools

Hadoop

Concepts

Big Data

Author (1):

Venkat Ankam

View More author details

Table of Contents (12) Chapters

Preface

1. Big Data Analytics at a 10,000-Foot View

2. Getting Started with Apache Hadoop and Apache Spark FREE CHAPTER

3. Deep Dive into Apache Spark

4. Big Data Analytics with Spark SQL, DataFrames, and Datasets

5. Real-Time Analytics with Spark Streaming and Structured Streaming

6. Notebooks and Dataflows with Spark and Hadoop

7. Machine Learning with Spark and Hadoop

8. Building Recommendation Systems with Spark and Mahout

9. Graph Analytics with GraphX

10. Interactive Analytics with SparkR

Index

Conventions

In this book, you will find a number of text styles that distinguish between different kinds of information. Here are some examples of these styles and an explanation of their meaning.

Code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles are shown as follows: "Spark's default OFF_HEAP (experimental) storage is Tachyon."

Most of the examples are executed in Scala, Python and Mahout shells. Any command-line input is written as follows:

[root@myhost ~]# pyspark --master spark://sparkmasterhostname:7077 --total-executor-cores 4

A block of Python code executed in PySpark shell is shown as follows:

>>> myList = ["big", "data", "analytics", "hadoop" , "spark"]
>>> myRDD = sc.parallelize(myList)
>>> myRDD.getNumPartitions()

A block of code written in Python Application is shown as follows:

from pyspark import SparkConf, SparkContext
conf = (SparkConf()
        .setMaster("spark://masterhostname:7077")
        .setAppName("My Analytical Application")
        .set("spark.executor.memory", "2g"))
sc = SparkContext(conf = conf)

New terms and important words are shown in bold. Words that you see on the screen, for example, in menus or dialog boxes, appear in the text like this: "In case of VMWare Player, click on Open a Virtual Machine, and point to the directory where you have extracted the VM."

Note

Warnings or important notes appear in a box like this.

Tip

Tips and tricks appear like this.

The rest of the chapter is locked

You're reading from Big Data Analytics Real time analytics using Apache Spark and Hadoop

Table of Contents (12) Chapters

Conventions

Note

Tip

Authors (1)

Personalised recommendations for you

You're reading from Big Data Analytics Real time analytics using Apache Spark and Hadoop

Table of Contents (12) Chapters

Conventions

Note

Tip

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you