Directory organization and convention
One convention that would be handy is to download and install software in the /opt
directory. Also, have a generic soft link to Spark that points to the current version. For example, /opt/spark
points to /opt/spark-2.0.0
with the following command:
sudo ln -f -s spark-2.0.0 spark
Tip
Downloading the example code
You can download the example code files for all of the Packt books you have purchased from your account at http://www.packtpub.com. If you purchased this book elsewhere, you can visit http://www.packtpub.com/support and register to have the files e-mailed directly to you.
Later, if you upgrade, say to Spark 2.1, you can change the soft link.
However, remember to copy any configuration changes and old logs when you change to a new distribution. A more flexible way is to change the configuration directory to /etc/opt/spark
and the log files to /var/log/spark/
. In this way, these files will stay independent of the distribution updates. More details are available at https://spark.apache.org/docs/latest/configuration.html#overriding-configuration-directory and https://spark.apache.org/docs/latest/configuration.html#configuring-logging.