-
Notifications
You must be signed in to change notification settings - Fork 28.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-4798][SQL] A new set of Parquet testing API and test suites #3644
Conversation
private[spark] def unsetConf(key: String) { | ||
settings -= key | ||
} | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Used in ParquetTest.withSQLConf
.
Test build #24249 has started for PR 3644 at commit
|
Test build #24250 has started for PR 3644 at commit
|
Test build #24250 has finished for PR 3644 at commit
|
Test FAILed. |
Test build #24249 has finished for PR 3644 at commit
|
Test PASSed. |
Although it passed Jenkins, the first failure is rather weird. It seems that partitions collected via |
retest this please |
Test build #24251 has started for PR 3644 at commit
|
Test build #24251 has finished for PR 3644 at commit
|
Test PASSed. |
ee17d7b
to
3bb8731
Compare
Test build #24304 has started for PR 3644 at commit
|
Test build #24304 has finished for PR 3644 at commit
|
Test FAILed. |
While collecting data from a Parquet based SchemaRDD, the underlying Parquet split may be out of order, thus caused occasional test failures. |
Test build #24314 has started for PR 3644 at commit
|
Test build #24314 has finished for PR 3644 at commit
|
Test PASSed. |
Thanks for doing this! We should use some of these helper functions in the other tests :) Merged to master. |
This is a follow-up of #3367 and #3644. At the time #3644 was written, #3367 hadn't been merged yet, thus `IsNull` and `IsNotNull` filters are not covered in the first version of `ParquetFilterSuite`. This PR adds corresponding test cases. <!-- Reviewable:start --> [<img src="https://reviewable.io/review_button.png" height=40 alt="Review on Reviewable"/>](https://reviewable.io/reviews/apache/spark/3748) <!-- Reviewable:end --> Author: Cheng Lian <[email protected]> Closes #3748 from liancheng/test-null-filters and squashes the following commits: 1ab943f [Cheng Lian] IsNull and IsNotNull Parquet filter test case for boolean type bcd616b [Cheng Lian] Adds Parquet filter pushedown tests for IsNull and IsNotNull
This PR removes the deprecated `ParquetQuerySuite`, renamed `ParquetQuerySuite2` to `ParquetQuerySuite`, and refactored changes introduced in #4115 to `ParquetFilterSuite` . It is a follow-up of #3644. Notice that test cases in the old `ParquetQuerySuite` have already been well covered by other test suites introduced in #3644. <!-- Reviewable:start --> [<img src="https://reviewable.io/review_button.png" height=40 alt="Review on Reviewable"/>](https://reviewable.io/reviews/apache/spark/4116) <!-- Reviewable:end --> Author: Cheng Lian <[email protected]> Closes #4116 from liancheng/remove-deprecated-parquet-tests and squashes the following commits: f73b8f9 [Cheng Lian] Removes deprecated Parquet test suite
This PR removes the deprecated `ParquetQuerySuite`, renamed `ParquetQuerySuite2` to `ParquetQuerySuite`, and refactored changes introduced in apache#4115 to `ParquetFilterSuite` . It is a follow-up of apache#3644. Notice that test cases in the old `ParquetQuerySuite` have already been well covered by other test suites introduced in apache#3644. <!-- Reviewable:start --> [<img src="https://reviewable.io/review_button.png" height=40 alt="Review on Reviewable"/>](https://reviewable.io/reviews/apache/spark/4116) <!-- Reviewable:end --> Author: Cheng Lian <[email protected]> Closes apache#4116 from liancheng/remove-deprecated-parquet-tests and squashes the following commits: f73b8f9 [Cheng Lian] Removes deprecated Parquet test suite
This PR provides a set Parquet testing API (see trait
ParquetTest
) that enables developers to write more concise test cases. A new set of Parquet test suites built upon this API are added and aim to replace the oldParquetQuerySuite
. To avoid potential merge conflicts, old testing code are not removed yet. The following classes can be safely removed after most Parquet related PRs are handled:ParquetQuerySuite
ParquetTestData