Skip to content

PavelPenkov/sbt-spark-submit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

sbt-spark-submit

Conviniently submit Spark jobs. This plugin makes it easier to submit Spark jobs to remote servers, it requires ssh and scp command line tools installed.

Usage

Add addSbtPlugin("me.penkov" % "sbt-spark-submit" % "0.1.18") to project/spark-submit.sbt.

Configure Spark submit settings in build.sbt

mainClass in sparkSubmit := Some("MyClass")

sparkMaster in sparkSubmit := Yarn(
  numExecutors = Some(100),
  executorCores = Some(1),
  queue = Some("production")
)

sparkConf := Map("spark.dynamicAllocation.enabled" -> "false")

sparkArgs := Seq("--date", "2019-12-12")

sshHost in sparkSubmit := Some("somehost")
sshUser in sparkSubmit := "ppenkov" // Default is current user

Run sbt sparkSubmit. The plugin will build a thin JAR (fat JARs not supported yet), copy it to remote host and run spark-submit with appropriate arguments.

About

sbt plug-in to conveniently submit Spark jobs

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages