Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SQL] Improve DataFrame API error reporting #4296

Closed
wants to merge 7 commits into from

Conversation

rxin
Copy link
Contributor

@rxin rxin commented Jan 31, 2015

  1. Throw UnsupportedOperationException if a Column is not computable.
  2. Perform eager analysis on DataFrame so we can catch errors when they happen (not when an action is run).

@SparkQA
Copy link

SparkQA commented Jan 31, 2015

Test build #26456 has started for PR 4296 at commit 265fde7.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Jan 31, 2015

Test build #26456 has finished for PR 4296 at commit 265fde7.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • trait Column extends DataFrame with ExpressionApi
    • class ColumnName(name: String) extends IncomputableColumn(name)
    • trait DataFrame extends DataFrameSpecificApi with RDDApi[Row]
    • class GroupedDataFrame protected[sql](df: DataFrameImpl, groupingExprs: Seq[Expression])
    • protected[sql] class QueryExecution(val logical: LogicalPlan)

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/26456/
Test FAILed.

This patch changes Column from a concrete implementation to a trait, and provides two
concrete implementations: IncomputableColumn and ComputableColumn.
testData.groupBy('key).agg(Map("nonExistentName" -> "sum"))
}
intercept[Exception] {
testData.groupBy("nonExistentName").agg(Map("key" -> "sum"))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why isn't this (String, String)*?

@SparkQA
Copy link

SparkQA commented Feb 2, 2015

Test build #26487 has started for PR 4296 at commit 17f6bae.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 2, 2015

Test build #26487 has finished for PR 4296 at commit 17f6bae.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • trait Column extends DataFrame with ExpressionApi
    • class ColumnName(name: String) extends IncomputableColumn(name)
    • trait DataFrame extends DataFrameSpecificApi with RDDApi[Row]
    • class GroupedDataFrame protected[sql](df: DataFrameImpl, groupingExprs: Seq[Expression])
    • protected[sql] class QueryExecution(val logical: LogicalPlan)

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/26487/
Test FAILed.

@SparkQA
Copy link

SparkQA commented Feb 2, 2015

Test build #26553 has started for PR 4296 at commit 5afe1ff.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 3, 2015

Test build #26553 has finished for PR 4296 at commit 5afe1ff.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/26553/
Test FAILed.

Davies Liu and others added 3 commits February 2, 2015 16:57
Conflicts:
	sql/core/src/main/scala/org/apache/spark/sql/DataFrame.scala
@SparkQA
Copy link

SparkQA commented Feb 3, 2015

Test build #26564 has started for PR 4296 at commit 6527b86.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 3, 2015

Test build #26564 has finished for PR 4296 at commit 6527b86.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • class LogisticGradient(numClasses: Int) extends Gradient
    • trait Column extends DataFrame with ExpressionApi
    • class ColumnName(name: String) extends IncomputableColumn(name)
    • trait DataFrame extends DataFrameSpecificApi with RDDApi[Row]
    • class GroupedDataFrame protected[sql](df: DataFrameImpl, groupingExprs: Seq[Expression])
    • protected[sql] class QueryExecution(val logical: LogicalPlan)
    • case class HiveScriptIOSchema (
    • val trimed_class = serdeClassName.split("'")(1)

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/26564/
Test FAILed.

@pwendell
Copy link
Contributor

pwendell commented Feb 3, 2015

Jenkins, test this please

@SparkQA
Copy link

SparkQA commented Feb 3, 2015

Test build #26587 has started for PR 4296 at commit 6527b86.

  • This patch merges cleanly.

@pwendell
Copy link
Contributor

pwendell commented Feb 3, 2015

@rxin this test failure was spurious. It is safe to merge.

@asfgit asfgit closed this in 554403f Feb 3, 2015
@SparkQA
Copy link

SparkQA commented Feb 3, 2015

Test build #26587 has finished for PR 4296 at commit 6527b86.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • trait Column extends DataFrame with ExpressionApi
    • class ColumnName(name: String) extends IncomputableColumn(name)
    • trait DataFrame extends DataFrameSpecificApi with RDDApi[Row]
    • class GroupedDataFrame protected[sql](df: DataFrameImpl, groupingExprs: Seq[Expression])
    • protected[sql] class QueryExecution(val logical: LogicalPlan)

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/26587/
Test PASSed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants