-
Notifications
You must be signed in to change notification settings - Fork 236
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for Cloudera CDS-3.3.2 #9396
Add support for Cloudera CDS-3.3.2 #9396
Conversation
Signed-off-by: Rishi <[email protected]>
build |
...n/spark332cdh/scala/com/nvidia/spark/rapids/shims/spark332cdh/SparkShimServiceProvider.scala
Outdated
Show resolved
Hide resolved
...n/spark332cdh/scala/com/nvidia/spark/rapids/shims/spark332cdh/SparkShimServiceProvider.scala
Outdated
Show resolved
Hide resolved
...in/src/main/spark332cdh/scala/com/nvidia/spark/rapids/spark332cdh/RapidsShuffleManager.scala
Outdated
Show resolved
Hide resolved
...32cdh/scala/org/apache/spark/sql/rapids/shims/spark332cdh/RapidsShuffleInternalManager.scala
Outdated
Show resolved
Hide resolved
...n/src/test/spark332cdh/scala/com/nvidia/spark/rapids/shims/spark332cdh/SparkShimsSuite.scala
Outdated
Show resolved
Hide resolved
...src/main/spark350/scala/org/apache/spark/sql/execution/rapids/shims/FilePartitionShims.scala
Show resolved
Hide resolved
Change-Id: I0c4427947d4db88d7d96ccc0f59a8cb5ba4813c6
pom.xml
Outdated
<repositories> | ||
<repository> | ||
<id>cloudera-repo</id> | ||
<url>https://repository.cloudera.com/artifactory/cloudera-repos/</url> | ||
<releases> | ||
<enabled>true</enabled> | ||
</releases> | ||
<snapshots> | ||
<enabled>true</enabled> | ||
</snapshots> | ||
</repository> | ||
</repositories> | ||
<pluginRepositories> | ||
<pluginRepository> | ||
<id>cloudera-repo</id> | ||
<url>https://repository.cloudera.com/artifactory/cloudera-repos/</url> | ||
</pluginRepository> | ||
</pluginRepositories> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We get more and more of these. Let us move repeated repo definitions out of all CDH profiles and have them defined in the global lists at the bottom of the pom like this:
<repositories>
...
<repository>
<id>cloudera-repo</id>
<url>https://repository.cloudera.com/artifactory/cloudera-repos/</url>
<releases>
<enabled>${cloudera.repo.enabled}</enabled>
</releases>
<snapshots>
<enabled>${cloudera.repo.enabled}</enabled>
</snapshots>
</repository>
</repositories>
<pluginRepositories>
...
<pluginRepository>
<id>cloudera-repo</id>
<url>https://repository.cloudera.com/artifactory/cloudera-repos/</url>
<releases>
<enabled>${cloudera.repo.enabled}</enabled>
</releases>
<snapshots>
<enabled>${cloudera.repo.enabled}</enabled>
</snapshots>
</pluginRepository>
</pluginRepositories>
the we can add just a single line to each CDH profile properties:
<cloudera.repo.enabled>true</cloudera.repo.enabled>
override def getShimVersion: ShimVersion = SparkShimServiceProvider.VERSION | ||
|
||
def matchesVersion(version: String): Boolean = { | ||
version.contains(SparkShimServiceProvider.CDH_BASE_VERSION) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I guess we always do it like this, but wonder if it should be more restrictive like startsWith
or equality after stripping spaces etc.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this could be on a shim specific case depending on what we are expecting. But if we want to look at it, I propose we file a separate issue as it affects more then this shim.
...src/main/spark350/scala/org/apache/spark/sql/execution/rapids/shims/FilePartitionShims.scala
Show resolved
Hide resolved
@sririshindra can you update to resolve the conflicts with the files when you have time? |
Change-Id: I9f05def7dae9b0a6e63b6ac40460f12a7847708c
…332cdh Change-Id: I76ee6e036136919a1cdb0b95cb67bf19c35292bc
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
build |
build |
This PR add support for Cloudera's CDS-3.3.2 spark release. This is quite similar to CDS3.3.0 release.