References
"MapReduce Implementation of Variational Bayesian Probabilistic Matrix Factorization Algorithm". In: IEEE Conference on Big Data. pp 145-152. 2013
Dean J. and Ghemawat S. "MapReduce: Simplified Data Processing on Large Clusters". Communications of the ACM 51 (1). 107-113
https://github.com/jeffreybreen/tutorial-rmr2-airline/blob/master/R/1-wordcount.R
Chowdhury M., Das T., Dave A., Franklin M.J., Ma J., McCauley M., Shenker S., Stoica I., and Zaharia M. "Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing". NSDI 2012. 2012
Amazon Elastic Compute Cloud (EC2) User Guide, Kindle e-book by Amazon Web Services, updated April 9, 2014
Spark documentation for AWS at http://spark.apache.org/docs/latest/ec2-scripts.html
AWS documentation for Spark at http://aws.amazon.com/elasticmapreduce/details/spark/
Microsoft Virtual Academy website at http://www.microsoftvirtualacademy.com/training-courses/getting-started-with-microsoft-azure-machine-learning...