In this book, you will find a number of text styles that distinguish between different kinds of information. Here are some examples of these styles and an explanation of their meaning. Code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles are shown as follows: "We also appended the magic column row_id, which uniquely identifies each row in the dataset." A block of code is set as follows:
import org.apache.spark.ml.feature.StopWordsRemover
val stopWords= StopWordsRemover.loadDefaultStopWords("english") ++ Array("ax", "arent", "re")
When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:
val MIN_TOKEN_LENGTH = 3
val toTokens= (minTokenLen: Int, stopWords: Array[String],
Any command-line input or output is written as follows:
tar -xvf spark-2.1.1-bin-hadoop2.6.tgz
export SPARK_HOME="$(pwd)/spark-2.1.1-bin-hadoop2.6
New terms and important words are shown in bold. Words that you see on the screen, for example, in menus or dialog boxes, appear in the text like this: "Download the DECLINED LOAN DATA as shown in the following screenshot"