Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CELEBORN-1002] Add SBT MRClientProject #1930

Closed
wants to merge 8 commits into from

Conversation

cxzl25
Copy link
Contributor

@cxzl25 cxzl25 commented Sep 20, 2023

What changes were proposed in this pull request?

Why are the changes needed?

./build/make-distribution.sh --sbt-enabled -Pmr
./build/make-distribution.sh --sbt-enabled --release

Does this PR introduce any user-facing change?

How was this patch tested?

@codecov
Copy link

codecov bot commented Sep 20, 2023

Codecov Report

Merging #1930 (cbb8a42) into main (84ef527) will not change coverage.
Report is 2 commits behind head on main.
The diff coverage is n/a.

@@           Coverage Diff           @@
##             main    #1930   +/-   ##
=======================================
  Coverage   46.50%   46.50%           
=======================================
  Files         164      164           
  Lines       10341    10341           
  Branches      954      954           
=======================================
  Hits         4808     4808           
- Misses       5218     5219    +1     
+ Partials      315      314    -1     

see 3 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more


(assembly / logLevel) := Level.Info,

// Exclude `scala-library` from assembly.
Copy link
Contributor

@cfmcgrady cfmcgrady Sep 20, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems that this relocation does not take effect.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

then shall we remove this rule?

name.startsWith("guava-") ||
name.startsWith("netty-") ||
name.startsWith("commons-lang3-") ||
name.startsWith("RoaringBitmap-"))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm... it seems we don't need the lz4-java dependency for MR shaded client? cc @FMX

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It may be because in the hadoop MR environment, there is no lz4-java.

https://issues.apache.org/jira/browse/HADOOP-17292

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

shall we also include zstd-jni? cc @FMX

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think Hadoop has zstd-jni already. I'll make a test to verify this later.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like you have already added zstd-jni, I think it's OK.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hadoop uses the native zstd and does not use zstd-ini.

See https://issues.apache.org/jira/browse/HADOOP-13578

@pan3793
Copy link
Member

pan3793 commented Sep 28, 2023

#1933 updates the shading configuration, the SBT part should be updated accordingly

case m if m == "META-INF/LICENSE.txt" => MergeStrategy.discard
case m if m == "META-INF/NOTICE.txt" => MergeStrategy.discard
case m if m == "LICENSE.txt" => MergeStrategy.discard
case m if m == "NOTICE.txt" => MergeStrategy.discard
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

SBT Packaging Flink/Spark Excluding license related files will be implemented in the next PR.

@cxzl25 cxzl25 changed the title [CELEBORN-1002] add SBT MRClientProject [CELEBORN-1002] Add SBT MRClientProject Sep 28, 2023
@cfmcgrady
Copy link
Contributor

SBT and Maven have different guava versions.

./build/mvn dependency:tree -Pmr -am -pl client-mr/mr

[INFO] --- maven-dependency-plugin:3.1.1:tree (default-cli) @ celeborn-client-mr_2.12 ---
[INFO] org.apache.celeborn:celeborn-client-mr_2.12:jar:0.4.0-SNAPSHOT
[INFO] +- org.apache.celeborn:celeborn-common_2.12:jar:0.4.0-SNAPSHOT:compile
[INFO] |  +- com.google.guava:guava:jar:14.0.1:compile
./build/sbt -Pmr celeborn-client-mr/dependencyTree | grep guava
[info]   | +-com.google.guava:guava:14.0.1 (evicted by: 27.0-jre)
[info]   | +-com.google.guava:guava:27.0-jre

@cfmcgrady
Copy link
Contributor

we can align the guava versions in #1928.

@cfmcgrady
Copy link
Contributor

we can align the guava versions in #1928.

addressed in cbb8a42

@cfmcgrady
Copy link
Contributor

thanks, merging to main(v0.4.0).

@cfmcgrady cfmcgrady closed this in 22f5235 Oct 8, 2023
cfmcgrady added a commit that referenced this pull request Nov 6, 2023
…jars

### What changes were proposed in this pull request?
Flink/Spark jars packaged with SBT use the correct LICENSE and NOTICE.

### Why are the changes needed?
#1930 (comment)

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

Closes #1967 from cxzl25/CELEBORN-1031.

Lead-authored-by: sychen <[email protected]>
Co-authored-by: Fu Chen <[email protected]>
Signed-off-by: Fu Chen <[email protected]>
gotikkoxq added a commit to gotikkoxq/celeborn that referenced this pull request Aug 26, 2024
…jars

### What changes were proposed in this pull request?
Flink/Spark jars packaged with SBT use the correct LICENSE and NOTICE.

### Why are the changes needed?
apache/celeborn#1930 (comment)

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

Closes #1967 from cxzl25/CELEBORN-1031.

Lead-authored-by: sychen <[email protected]>
Co-authored-by: Fu Chen <[email protected]>
Signed-off-by: Fu Chen <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants