-
Notifications
You must be signed in to change notification settings - Fork 28.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-7997][Core]Remove Akka from Spark Core and Streaming #10854
Conversation
Test build #49828 has finished for PR 10854 at commit
|
retest this please |
Looks weird. Not sure why the following errors didn't happen in #10744
|
Test build #49833 has finished for PR 10854 at commit
|
LGTM |
Test build #49838 has finished for PR 10854 at commit
|
retest this please |
Test build #49860 has finished for PR 10854 at commit
|
Test build #2433 has started for PR 10854 at commit |
@@ -175,7 +175,7 @@ private[netty] class Dispatcher(nettyEnv: NettyRpcEnv) extends Logging { | |||
endpoints.keySet().asScala.foreach(unregisterRpcEndpoint) | |||
// Enqueue a message that tells the message loops to stop. | |||
receivers.offer(PoisonPill) | |||
threadpool.shutdown() | |||
threadpool.shutdownNow() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just found here didn't interrupt threads previously..
It doesn't need since PoisonPill
has been sent. Found an issue in FlumePollingReceiver that could block the message loop.
Looks a netty bug. I just set the netty3 version to 3.8.0.Final and then passed the flume tests locally. Since we have been used 3.8.0.Final for a long time, let's keep it rather than downgrading it. |
Test build #49908 has finished for PR 10854 at commit
|
Test build #2437 has started for PR 10854 at commit |
Test build #49911 has finished for PR 10854 at commit
|
retest this please |
Test build #49921 has finished for PR 10854 at commit
|
I'm going to merge this. Not sure why Jenkins reported timeout. I looked at the Jenkins logs and tests all passed. |
`rpcEnv.awaitTermination()` was not added in #10854 because some Streaming Python tests hung forever. This patch fixed the hung issue and added rpcEnv.awaitTermination() back to SparkEnv. Previously, Streaming Kafka Python tests shutdowns the zookeeper server before stopping StreamingContext. Then when stopping StreamingContext, KafkaReceiver may be hung due to https://issues.apache.org/jira/browse/KAFKA-601, hence, some thread of RpcEnv's Dispatcher cannot exit and rpcEnv.awaitTermination is hung.The patch just changed the shutdown order to fix it. Author: Shixiong Zhu <[email protected]> Closes #11031 from zsxwing/awaitTermination.
spark.akka.frameSize
tospark.rpc.message.maxSize
. I think it's still worth to keep this config because usingDirectTaskResult
orIndirectTaskResult
depends on it.