Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-6236] [core] [wip] caching for blocks over 2GB #8320

Closed
wants to merge 109 commits into from

Conversation

squito
Copy link
Contributor

@squito squito commented Aug 19, 2015

This allows caching of partitions that are greater than 2GB, and also includes error messages for the other cases where we hit the 2GB limit (replication & shuffle).

This should go on top of #5400, posting for now to get other eyes on it. Also it includes even more giant tests, which is even more pressure to turn these off with some mechanism for slow / integration tests that only run periodically.

Conflicts:
	core/src/main/scala/org/apache/spark/storage/BlockManager.scala
squito added 21 commits June 2, 2015 16:10
Conflicts:
	core/src/main/scala/org/apache/spark/storage/BlockManager.scala
	core/src/main/scala/org/apache/spark/storage/BlockObjectWriter.scala
	core/src/main/scala/org/apache/spark/storage/TachyonStore.scala
	core/src/test/scala/org/apache/spark/ShuffleSuite.scala
	streaming/src/main/scala/org/apache/spark/streaming/rdd/WriteAheadLogBackedBlockRDD.scala
	streaming/src/main/scala/org/apache/spark/streaming/receiver/ReceivedBlockHandler.scala
	streaming/src/test/scala/org/apache/spark/streaming/ReceivedBlockHandlerSuite.scala
Conflicts:
	core/src/main/scala/org/apache/spark/util/io/ByteArrayChunkOutputStream.scala
	core/src/test/scala/org/apache/spark/util/io/ByteArrayChunkOutputStreamSuite.scala
	network/common/src/main/java/org/apache/spark/network/buffer/BufferTooLargeException.java
	network/common/src/main/java/org/apache/spark/network/buffer/LargeByteBuffer.java
	network/common/src/main/java/org/apache/spark/network/buffer/LargeByteBufferHelper.java
	network/common/src/main/java/org/apache/spark/network/buffer/WrappedLargeByteBuffer.java
	network/common/src/test/java/org/apache/spark/network/buffer/LargeByteBufferHelperSuite.java
	network/common/src/test/java/org/apache/spark/network/buffer/WrappedLargeByteBufferSuite.java
Conflicts:
	core/src/main/scala/org/apache/spark/storage/TachyonBlockManager.scala
@squito
Copy link
Contributor Author

squito commented Aug 19, 2015

@tgravescs You can try this out for handling caching for over 2GB. I've tried it out some but haven't really stressed it, interested in your feedback.

@SparkQA
Copy link

SparkQA commented Aug 20, 2015

Test build #41283 has finished for PR 8320 at commit 06c8ffa.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • public class LargeByteBufferInputStream extends InputStream
    • public class LargeByteBufferOutputStream extends OutputStream
    • abstract class BlockSizeLimitException(msg: String, cause: BufferTooLargeException)
    • class ReplicationBlockSizeLimitException(cause: BufferTooLargeException)
    • class TachyonBlockSizeLimitException(cause: BufferTooLargeException)
    • class ShuffleBlockSizeLimitException(size: Long)
    • class ShuffleRemoteBlockSizeLimitException(cause: BufferTooLargeException)
    • public class BufferTooLargeException extends IOException
    • public class LargeByteBufferHelper
    • public class WrappedLargeByteBuffer implements LargeByteBuffer

@SparkQA
Copy link

SparkQA commented Aug 20, 2015

Test build #41293 timed out for PR 8320 at commit 200afdc after a configured wait of 175m.

@tgravescs
Copy link
Contributor

Thanks @squito I'll start looking at it and get somethings running. It will probably be next week by the time I have some results.

@SparkQA
Copy link

SparkQA commented Aug 20, 2015

Test build #1677 timed out for PR 8320 at commit 200afdc after a configured wait of 175m.

@rxin
Copy link
Contributor

rxin commented Jun 15, 2016

Thanks for the pull request. I'm going through a list of pull requests to cut them down since the sheer number is breaking some of the tooling we have. Due to lack of activity on this pull request, I'm going to push a commit to close it. Feel free to reopen it or create a new one.

@asfgit asfgit closed this in 1a33f2e Jun 15, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants