Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-7842] [SQL] Makes task committing/aborting in InsertIntoHadoopFsRelation more robust #6378

Closed
wants to merge 1 commit into from

Conversation

liancheng
Copy link
Contributor

When committing/aborting a write task issued in InsertIntoHadoopFsRelation, if an exception is thrown from OutputWriter.close(), the committing/aborting process will be interrupted, and leaves messy stuff behind (e.g., the _temporary directory created by FileOutputCommitter).

This PR makes these two process more robust by catching potential exceptions and falling back to normal task committment/abort.

@liancheng
Copy link
Contributor Author

@yhuai This is also a partial solution for SPARK-7838.

(Sorry for the confusing PR branch name, I was going to open this PR against SPARK-7838, but later decided to open a new JIRA for this.)

@SparkQA
Copy link

SparkQA commented May 23, 2015

Test build #33410 has finished for PR 6378 at commit f18253a.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@liancheng
Copy link
Contributor Author

Merging to master and branch-1.4.

asfgit pushed a commit that referenced this pull request May 25, 2015
…FsRelation more robust

When committing/aborting a write task issued in `InsertIntoHadoopFsRelation`, if an exception is thrown from `OutputWriter.close()`, the committing/aborting process will be interrupted, and leaves messy stuff behind (e.g., the `_temporary` directory created by `FileOutputCommitter`).

This PR makes these two process more robust by catching potential exceptions and falling back to normal task committment/abort.

Author: Cheng Lian <[email protected]>

Closes #6378 from liancheng/spark-7838 and squashes the following commits:

f18253a [Cheng Lian] Makes task committing/aborting in InsertIntoHadoopFsRelation more robust

(cherry picked from commit 8af1bf1)
Signed-off-by: Cheng Lian <[email protected]>
@asfgit asfgit closed this in 8af1bf1 May 25, 2015
jeanlyn pushed a commit to jeanlyn/spark that referenced this pull request May 28, 2015
…FsRelation more robust

When committing/aborting a write task issued in `InsertIntoHadoopFsRelation`, if an exception is thrown from `OutputWriter.close()`, the committing/aborting process will be interrupted, and leaves messy stuff behind (e.g., the `_temporary` directory created by `FileOutputCommitter`).

This PR makes these two process more robust by catching potential exceptions and falling back to normal task committment/abort.

Author: Cheng Lian <[email protected]>

Closes apache#6378 from liancheng/spark-7838 and squashes the following commits:

f18253a [Cheng Lian] Makes task committing/aborting in InsertIntoHadoopFsRelation more robust
jeanlyn pushed a commit to jeanlyn/spark that referenced this pull request Jun 12, 2015
…FsRelation more robust

When committing/aborting a write task issued in `InsertIntoHadoopFsRelation`, if an exception is thrown from `OutputWriter.close()`, the committing/aborting process will be interrupted, and leaves messy stuff behind (e.g., the `_temporary` directory created by `FileOutputCommitter`).

This PR makes these two process more robust by catching potential exceptions and falling back to normal task committment/abort.

Author: Cheng Lian <[email protected]>

Closes apache#6378 from liancheng/spark-7838 and squashes the following commits:

f18253a [Cheng Lian] Makes task committing/aborting in InsertIntoHadoopFsRelation more robust
nemccarthy pushed a commit to nemccarthy/spark that referenced this pull request Jun 19, 2015
…FsRelation more robust

When committing/aborting a write task issued in `InsertIntoHadoopFsRelation`, if an exception is thrown from `OutputWriter.close()`, the committing/aborting process will be interrupted, and leaves messy stuff behind (e.g., the `_temporary` directory created by `FileOutputCommitter`).

This PR makes these two process more robust by catching potential exceptions and falling back to normal task committment/abort.

Author: Cheng Lian <[email protected]>

Closes apache#6378 from liancheng/spark-7838 and squashes the following commits:

f18253a [Cheng Lian] Makes task committing/aborting in InsertIntoHadoopFsRelation more robust
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants