SPARK-3794 [CORE] Building spark core fails due to inadvertent dependency on Commons IO #2662

srowen · 2014-10-05T14:22:12Z

Remove references to Commons IO FileUtils and replace with pure Java version, which doesn't need to traverse the whole directory tree first.

I think this method could be refined further if it would be alright to rename it and its args and break it down into two methods. I'm starting with a simple recursive rendition.

…version, which doesn't need to traverse the whole directory tree first

SparkQA · 2014-10-05T14:29:32Z

QA tests have started for PR 2662 at commit 4cd172f.

This patch merges cleanly.

SparkQA · 2014-10-05T15:38:41Z

QA tests have finished for PR 2662 at commit 4cd172f.

This patch passes unit tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2014-10-05T15:38:44Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/21313/Test PASSed.

ash211 · 2014-10-05T22:18:31Z

I believe this was introduced in #2609 -- any idea why Jenkins didn't catch the build issue?

cc @mccheah

marmbrus · 2014-10-06T01:43:19Z

@ash211 I'd guess that is dependent on the version of Hadoop that we are compiling with. It did cause failures on some versions of the master build.

@srowen thanks for fixing this! I'm going to merge to master.

aarondav · 2014-10-06T02:01:49Z

core/src/main/scala/org/apache/spark/util/Utils.scala

-      val cutoffTimeInMillis = (currentTimeMillis - (cutoff * 1000))
-      val newFiles = files.filter { _.lastModified > cutoffTimeInMillis }
-      newFiles.nonEmpty
+      throw new IllegalArgumentException("$dir is not a directory!")


this is not string interpolated (missing s)

Ack, sorry, look at what happens when I 'improve' a line of code. Anyone feel free to zap it or I'm about to open a related PR anyway that can fix it.

mccheah · 2014-10-06T17:37:26Z

Sorry about that. I think Jenkins should be catching these kinds of build failures though. Jenkins should attempt to build the project against multiple versions of hadoop since contributors may be used to using things like FileUtils and other libraries that may have incompatibility issues.

I've opened https://issues.apache.org/jira/browse/SPARK-3819 to consider updating the Jenkins build process. Feel free to discuss if such measures are necessary there.

vanzin · 2014-10-06T17:43:04Z

@mccheah agree about jenkins catching these, but at the same time is sort of sketchy to rely on transitive dependencies of Hadoop exactly for that reason.

commons-io is not an explicit dependency of Spark, so it should be avoided.

mccheah · 2014-10-06T17:47:51Z

Fair enough. I guess I didn't actually check up the explicit dependencies of Spark before I chose the library to use, so when it just magically appeared in autocomplete in Eclipse I just assumed it would be okay to use. Certainly it was my fault.

The bottom line is that we could be more explicit about this. Catching it in the build would certainly be explicit. Perhaps also something in the documentation?

Remove references to Commons IO FileUtils and replace with pure Java …

4cd172f

…version, which doesn't need to traverse the whole directory tree first

srowen mentioned this pull request Oct 5, 2014

[SPARK-3801] More efficient app dir cleanup #2660

Closed

marmbrus mentioned this pull request Oct 6, 2014

[SPARK-1860] More conservative app directory cleanup. #2609

Closed

asfgit closed this in 8d22dbb Oct 6, 2014

aarondav reviewed Oct 6, 2014
View reviewed changes

srowen deleted the SPARK-3794 branch October 9, 2014 06:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPARK-3794 [CORE] Building spark core fails due to inadvertent dependency on Commons IO #2662

SPARK-3794 [CORE] Building spark core fails due to inadvertent dependency on Commons IO #2662

srowen commented Oct 5, 2014

SparkQA commented Oct 5, 2014

SparkQA commented Oct 5, 2014

AmplabJenkins commented Oct 5, 2014

ash211 commented Oct 5, 2014

marmbrus commented Oct 6, 2014

aarondav Oct 6, 2014

srowen Oct 6, 2014

mccheah commented Oct 6, 2014

vanzin commented Oct 6, 2014

mccheah commented Oct 6, 2014

SPARK-3794 [CORE] Building spark core fails due to inadvertent dependency on Commons IO #2662

SPARK-3794 [CORE] Building spark core fails due to inadvertent dependency on Commons IO #2662

Conversation

srowen commented Oct 5, 2014

SparkQA commented Oct 5, 2014

SparkQA commented Oct 5, 2014

AmplabJenkins commented Oct 5, 2014

ash211 commented Oct 5, 2014

marmbrus commented Oct 6, 2014

aarondav Oct 6, 2014

Choose a reason for hiding this comment

srowen Oct 6, 2014

Choose a reason for hiding this comment

mccheah commented Oct 6, 2014

vanzin commented Oct 6, 2014

mccheah commented Oct 6, 2014