Scala, Spark, Visualization example

This example analyze crime data. The goal is to understand which districts of this city are more prone to which crime.

The example load data and manipulate it using spark. The visualization is being done by vegas.

Vegas aims to be the missing MatPlotLib for the Scala and Spark world. Vegas wraps around Vega-Lite but provides syntax more familiar (and type checked) for use within Scala.

The output of this example are:

csv file
html visualization file.

Requerments:

Use for the building & code executing:

jdk 1.8
Scala 2.11.8
sbt 0.13.8

Instructions:

Option 1:

run the following command:

./scripts/run_container.bash

Option 2:

Step 1:

Download this csv file.

Step 2:

locate the uber jar & csv file/s in accessibly location for the spark job. e.g. shared storage (s3, hdfs, NFS)/ on each server in the cluster.

Run the following command (Review and set the script parameters.):

./scripts/run.bash

if you are running it via IDE at local mode add the following JVM parameter: -Dspark.master=local[*] And add the following args: [input file/dir] [output_dir] [app_name] e.g. ./src/main/resources/data/Crimes.csv ./target/output/ crime_analyzer

You can find more visualization example here

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
project		project
scripts		scripts
src/main		src/main
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
build.sbt		build.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scala, Spark, Visualization example

Requerments:

Instructions:

Option 1:

Option 2:

Step 1:

Step 2:

About

Releases

Packages

Languages

asnir/scala_spark_visualization_example

Folders and files

Latest commit

History

Repository files navigation

Scala, Spark, Visualization example

Requerments:

Instructions:

Option 1:

Option 2:

Step 1:

Step 2:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages