From f6ba9ea149ca77fd112ebe0ae7404246ee80079d Mon Sep 17 00:00:00 2001 From: PeninaR Date: Thu, 5 Jul 2018 18:06:14 +0300 Subject: [PATCH] test --- site/content/howto/spark.markdown | 2 ++ 1 file changed, 2 insertions(+) diff --git a/site/content/howto/spark.markdown b/site/content/howto/spark.markdown index 993523e9a..9741f9293 100644 --- a/site/content/howto/spark.markdown +++ b/site/content/howto/spark.markdown @@ -10,6 +10,8 @@ weight: 1 {{%exurl "Apache Spark""https://spark.apache.org/"%}} is a powerful open-source processing engine built around speed, ease of use, and sophisticated analytics, with APIs in Java, Scala, Python, R, and SQL. Spark runs programs up to 100x faster than Apache Hadoop MapReduce in memory, or 10x faster on disk. It can be used to build data applications as a library, or to perform ad-hoc data analysis interactively. + + Spark powers a stack of libraries including SQL, DataFrames, and Datasets, MLlib for machine learning, GraphX for graph processing, and Spark Streaming. You can combine these libraries seamlessly in the same application. As well, Spark runs on a laptop, Apache Hadoop, Apache Mesos, standalone, or in the cloud. It can access diverse data sources including HDFS, Apache Cassandra, Apache HBase, and S3.