[SPARK-20242][Web UI] Add spark.ui.stopDelay #17551

barnardb · 2017-04-06T12:54:25Z

What changes were proposed in this pull request?

Adds a spark.ui.stopDelay configuration property that can be used to keep the UI running when an application has finished. This is very useful for debugging, especially when the driver application is running remotely.

How was this patch tested?

This patch was tested manually. E.g. here's a screenshot from bin/spark-submit run-example --conf spark.ui.stopDelay=30s SparkPi 100:

This property can be used to keep the UI running when an application has finished. This can be very useful for debugging.

srowen · 2017-04-06T12:55:50Z

I would oppose this change. It is what the history server is for. Or also profiling tools, or simple debugging mechanisms you allude to. It doesn't belong as yet another flag in Spark

AmplabJenkins · 2017-04-06T12:57:15Z

Can one of the admins verify this patch?

jerryshao · 2017-04-06T13:38:35Z

Agree with @srowen , the proposed solution overlaps the key functionality of history server. Usually we should stop the app and release the resources as soon as application finished. This solution unnecessarily block the whole stop chain and potentially introduce some issues when trying to visit objects that have already stopped.

ajbozarth · 2017-04-06T15:35:58Z

I agree with @srowen and @jerryshao this is what the history server is for

barnardb · 2017-04-06T16:48:02Z

Our use case involves jobs running in a remote cluster without a Spark master. I agree that the history server is the better to solve this, but we'd like to get a solution that doesn't depend on a distributed filesystem. Would you be willing to consider a pull request for https://issues.apache.org/jira/browse/SPARK-19802 (sending events to a remote history server) instead?

vanzin · 2017-04-06T18:41:25Z

Our use case involves jobs running in a remote cluster without a Spark master.

It's still running your code, right? Why can't you add a configuration to your own code that tells it to wait some time before shutting down the SparkContext? That would achieve the same goal without changing Spark.

barnardb · 2017-04-06T20:10:35Z

It's still running your code, right? Why can't you add a configuration to your own code that tells it to wait some time before shutting down the SparkContext?

We're trying to support arbitrary jobs running on the cluster, to make it easy for users to inspect the jobs that they run there. This was a quick way to achieve that, but I agree with the other commenters that this quite hacky, and that the history server would be a nicer solution. Our problem with the history server right now is that while the current driver-side EventLoggingListener + history-server-side FsHistoryProvider implementations are great for environments with HDFS, they're much less convenient in a cluster without a distributed filesystem. I'd propose that I close this PR, and work on an RPC-based listener-provider combination to use with the history server.

jerryshao · 2017-04-07T00:35:37Z

@barnardb only in Spark standalone mode HistoryServer is embedded into Master process for convenience IIRC. You can always start a standalone HistoryServer process.

Also FsHistoryProvider is not bound to HDFS, other Hadoop compatible FS could be supported, like wasb, s3 and other object stores that has Hadoop FS compatible layer. I would think even in your cluster environment (k8s), you probably have a object store. And at least you could implement a customized ApplicationHistoryProvider.

Closes apache#11785 Closes apache#13027 Closes apache#13614 Closes apache#13761 Closes apache#15197 Closes apache#14006 Closes apache#12576 Closes apache#15447 Closes apache#13259 Closes apache#15616 Closes apache#14473 Closes apache#16638 Closes apache#16146 Closes apache#17269 Closes apache#17313 Closes apache#17418 Closes apache#17485 Closes apache#17551 Closes apache#17463 Closes apache#17625 Closes apache#10739 Closes apache#15193 Closes apache#15344 Closes apache#14804 Closes apache#16993 Closes apache#17040 Closes apache#15180 Closes apache#17238

This pr proposed to close stale PRs. Currently, we have 400+ open PRs and there are some stale PRs whose JIRA tickets have been already closed and whose JIRA tickets does not exist (also, they seem not to be minor issues). // Open PRs whose JIRA tickets have been already closed Closes apache#11785 Closes apache#13027 Closes apache#13614 Closes apache#13761 Closes apache#15197 Closes apache#14006 Closes apache#12576 Closes apache#15447 Closes apache#13259 Closes apache#15616 Closes apache#14473 Closes apache#16638 Closes apache#16146 Closes apache#17269 Closes apache#17313 Closes apache#17418 Closes apache#17485 Closes apache#17551 Closes apache#17463 Closes apache#17625 // Open PRs whose JIRA tickets does not exist and they are not minor issues Closes apache#10739 Closes apache#15193 Closes apache#15344 Closes apache#14804 Closes apache#16993 Closes apache#17040 Closes apache#15180 Closes apache#17238 N/A Author: Takeshi Yamamuro <[email protected]> Closes apache#17734 from maropu/resolved_pr. Change-Id: Id2e590aa7283fe5ac01424d30a40df06da6098b5

[SPARK-20242][Web UI] Add spark.ui.stopDelay

aa59599

This property can be used to keep the UI running when an application has finished. This can be very useful for debugging.

maropu mentioned this pull request Apr 23, 2017

[BUILD] Close stale PRs #17734

Closed

asfgit closed this in e9f9715 Apr 24, 2017

barnardb deleted the ui-defer-stop-SPARK-20242 branch September 26, 2017 16:19

barnardb restored the ui-defer-stop-SPARK-20242 branch September 26, 2017 16:19

barnardb deleted the ui-defer-stop-SPARK-20242 branch September 26, 2017 16:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-20242][Web UI] Add spark.ui.stopDelay #17551

[SPARK-20242][Web UI] Add spark.ui.stopDelay #17551

barnardb commented Apr 6, 2017

srowen commented Apr 6, 2017

AmplabJenkins commented Apr 6, 2017

jerryshao commented Apr 6, 2017

ajbozarth commented Apr 6, 2017

barnardb commented Apr 6, 2017

vanzin commented Apr 6, 2017

barnardb commented Apr 6, 2017

jerryshao commented Apr 7, 2017

[SPARK-20242][Web UI] Add spark.ui.stopDelay #17551

[SPARK-20242][Web UI] Add spark.ui.stopDelay #17551

Conversation

barnardb commented Apr 6, 2017

What changes were proposed in this pull request?

How was this patch tested?

srowen commented Apr 6, 2017

AmplabJenkins commented Apr 6, 2017

jerryshao commented Apr 6, 2017

ajbozarth commented Apr 6, 2017

barnardb commented Apr 6, 2017

vanzin commented Apr 6, 2017

barnardb commented Apr 6, 2017

jerryshao commented Apr 7, 2017