Skip to content

Commit

Permalink
test
Browse files Browse the repository at this point in the history
  • Loading branch information
PeninaR committed Jul 5, 2018
1 parent a616858 commit f6ba9ea
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions site/content/howto/spark.markdown
Original file line number Diff line number Diff line change
Expand Up @@ -10,6 +10,8 @@ weight: 1
{{%exurl "Apache Spark""https://spark.apache.org/"%}} is a powerful open-source processing engine built around speed, ease of use, and sophisticated analytics, with APIs in Java, Scala, Python, R, and SQL.
Spark runs programs up to 100x faster than Apache Hadoop MapReduce in memory, or 10x faster on disk.
It can be used to build data applications as a library, or to perform ad-hoc data analysis interactively.


Spark powers a stack of libraries including SQL, DataFrames, and Datasets, MLlib for machine learning, GraphX for graph processing, and Spark Streaming. You can combine these libraries seamlessly in the same application.
As well, Spark runs on a laptop, Apache Hadoop, Apache Mesos, standalone, or in the cloud. It can access diverse data sources including HDFS, Apache Cassandra, Apache HBase, and S3.

Expand Down

0 comments on commit f6ba9ea

Please sign in to comment.