Recommended reading Zaharia et al., Apache Spark: A Unified Engine For Big Data Processing, Communications of the ACM, Volume 59 Issue 11, November 2016 Additional explorations Meng et al., MLlib: Machine Learning in Apache Spark, Journal of Machine Learning Research 17, 2016 Apache Spark web page Jacek Laskowski, Mastering Spark 2.3.2, Gitbooks