This repository has been archived by the owner on Sep 18, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 75
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[NSE-186]backport to 1.1 branch batch3 (#252)
* [NSE-229] Fix the deprecated code warning in shuffle_split_test (#230) * fix the deprecated code warning in shuffle_split_test * fix the code style * format update * [NSE-239] Adopt ARROW-7011 (#240) * [NSE-224] update third party code (#242) * update third party code Signed-off-by: Yuan Zhou <[email protected]> * fix format Signed-off-by: Yuan Zhou <[email protected]> * [NSE-176]Arrow install order issue (#231) * Add Arrow install script * [NSE-176]Add Arrow install Script in Arrow Data Source * remove popd parameter for ubuntu * Change variables to BFS * Update Arrow Header and Find Arrow function * Fix one issue with reading wrong Arrow Path * Update ARROW_CSV=ON * Update build_arrow=OFF in Scala UT * Install parent pom file in Scala UT * [NSE-196] clean up native sql options (#215) * clean up native sql options Signed-off-by: Yuan Zhou <[email protected]> * adding more options Signed-off-by: Yuan Zhou <[email protected]> * adding more options Signed-off-by: Yuan Zhou <[email protected]> * adding warning log for running on non-intel cpu Signed-off-by: Yuan Zhou <[email protected]> * [NSE-206]doc update on feature support status (#253) * update operators support status Signed-off-by: Yuan Zhou <[email protected]> * update docs on operators supporting status Signed-off-by: Yuan Zhou <[email protected]> * fix Signed-off-by: Yuan Zhou <[email protected]> * [NSE-241] fix hashagg result length (#249) * fix hashagg result length Signed-off-by: Yuan Zhou <[email protected]> * optimize on getting batch size Signed-off-by: Yuan Zhou <[email protected]> * using fixed sized output len for hashagg Signed-off-by: Yuan Zhou <[email protected]> * fix format Signed-off-by: Yuan Zhou <[email protected]> * [NSE-248] fix arrow dependency order (#259) * Only read .so.300.0.0 * Fix arroow dataset dependency issue * Add ARROW_S3=ON, Add symlink copy in CMakeList. Co-authored-by: JiaKe <[email protected]> Co-authored-by: Hongze Zhang <[email protected]> Co-authored-by: Wei-Ting Chen <[email protected]>
- Loading branch information
1 parent
198629c
commit c96aebb
Showing
22 changed files
with
5,126 additions
and
3,383 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,106 @@ | ||
#!/bin/bash | ||
|
||
set -eu | ||
|
||
NPROC=$(nproc) | ||
|
||
TESTS=OFF | ||
BUILD_ARROW=OFF | ||
STATIC_ARROW=OFF | ||
ARROW_ROOT=/usr/local | ||
|
||
for arg in "$@" | ||
do | ||
case $arg in | ||
-t=*|--tests=*) | ||
TESTS=("${arg#*=}") | ||
shift # Remove argument name from processing | ||
;; | ||
-a=*|--build_arrow=*) | ||
BUILD_ARROW=("${arg#*=}") | ||
shift # Remove argument name from processing | ||
;; | ||
-s=*|--static_arrow=*) | ||
STATIC_ARROW=("${arg#*=}") | ||
shift # Remove argument name from processing | ||
;; | ||
-ar=*|--arrow_root=*) | ||
ARROW_ROOT=("${arg#*=}") | ||
shift # Remove argument name from processing | ||
;; | ||
*) | ||
OTHER_ARGUMENTS+=("$1") | ||
shift # Remove generic argument from processing | ||
;; | ||
esac | ||
done | ||
|
||
echo "CMAKE Arguments:" | ||
echo "TESTS=${TESTS}" | ||
echo "BUILD_ARROW=${BUILD_ARROW}" | ||
echo "STATIC_ARROW=${STATIC_ARROW}" | ||
echo "ARROW_ROOT=${ARROW_ROOT}" | ||
|
||
CURRENT_DIR=$(cd "$(dirname "$BASH_SOURCE")"; pwd) | ||
echo $CURRENT_DIR | ||
|
||
cd ${CURRENT_DIR} | ||
if [ -d build ]; then | ||
rm -r build | ||
fi | ||
|
||
if [ $BUILD_ARROW == "ON" ]; then | ||
echo "Building Arrow from Source ..." | ||
mkdir build | ||
cd build | ||
ARROW_PREFIX="${CURRENT_DIR}/build" # Use build directory as ARROW_PREFIX | ||
ARROW_SOURCE_DIR="${ARROW_PREFIX}/arrow_ep" | ||
ARROW_INSTALL_DIR="${ARROW_PREFIX}/arrow_install" | ||
|
||
echo "ARROW_PREFIX=${ARROW_PREFIX}" | ||
echo "ARROW_SOURCE_DIR=${ARROW_SOURCE_DIR}" | ||
echo "ARROW_INSTALL_DIR=${ARROW_INSTALL_DIR}" | ||
mkdir -p $ARROW_SOURCE_DIR | ||
mkdir -p $ARROW_INSTALL_DIR | ||
git clone https://github.com/oap-project/arrow.git --branch arrow-3.0.0-oap $ARROW_SOURCE_DIR | ||
pushd $ARROW_SOURCE_DIR | ||
|
||
cmake ./cpp \ | ||
-DARROW_BUILD_STATIC=OFF -DARROW_BUILD_SHARED=ON -DARROW_COMPUTE=ON \ | ||
-DARROW_S3=ON \ | ||
-DARROW_GANDIVA_JAVA=ON \ | ||
-DARROW_GANDIVA=ON \ | ||
-DARROW_PARQUET=ON \ | ||
-DARROW_HDFS=ON \ | ||
-DARROW_BOOST_USE_SHARED=OFF \ | ||
-DARROW_JNI=ON \ | ||
-DARROW_DATASET=ON \ | ||
-DARROW_WITH_PROTOBUF=ON \ | ||
-DARROW_WITH_SNAPPY=ON \ | ||
-DARROW_WITH_LZ4=ON \ | ||
-DARROW_WITH_ZSTD=OFF \ | ||
-DARROW_WITH_BROTLI=OFF \ | ||
-DARROW_WITH_ZLIB=OFF \ | ||
-DARROW_WITH_FASTPFOR=ON \ | ||
-DARROW_FILESYSTEM=ON \ | ||
-DARROW_JSON=ON \ | ||
-DARROW_CSV=ON \ | ||
-DARROW_FLIGHT=OFF \ | ||
-DARROW_JEMALLOC=ON \ | ||
-DARROW_SIMD_LEVEL=AVX2 \ | ||
-DARROW_RUNTIME_SIMD_LEVEL=MAX \ | ||
-DARROW_DEPENDENCY_SOURCE=BUNDLED \ | ||
-DCMAKE_INSTALL_PREFIX=${ARROW_INSTALL_DIR} \ | ||
-DCMAKE_INSTALL_LIBDIR=lib | ||
|
||
make -j$NPROC | ||
make install | ||
|
||
cd java | ||
mvn clean install -P arrow-jni -am -Darrow.cpp.build.dir=${ARROW_INSTALL_DIR}/lib -DskipTests -Dcheckstyle.skip | ||
popd | ||
echo "Finish to build Arrow from Source !!!" | ||
else | ||
echo "Use ARROW_ROOT as Arrow Library Path" | ||
echo "ARROW_ROOT=${ARROW_ROOT}" | ||
fi |
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,30 @@ | ||
| No. | Executor | Description | Notes | BOOLEAN | BYTE | SHORT | INT | LONG | FLOAT | DOUBLE | STRING | DECIMAL | DATE | TIMESTAMP | NULL | BINARY | CALENDAR | ARRAY | MAP | STRUCT | UDT | | ||
| --- | --------------------------- | ----------- | ----------------- | ------- | ---- | ----- | --- | ---- | ----- | ------ | ------ | ------- | ---- | --------- | ---- | ------ | -------- | ----- | --- | ------ | --- | | ||
| 1 | CoalesceExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 2 | CollectLimitExec | | using row version | | | | | | | | | | | | | | | | | | | | ||
| 3 | ExpandExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 4 | FileSourceScanExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 5 | FilterExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 6 | GenerateExec | | using row version | | | | | | | | | | | | | | | | | | | | ||
| 7 | GlobalLimitExec | | using row version | | | | | | | | | | | | | | | | | | | | ||
| 8 | LocalLimitExec | | | | | | | | | | | | | | | | | | | | | | ||
| 9 | ProjectExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 10 | RangeExec | | using row version | | | | | | | | | | | | | | | | | | | | ||
| 11 | SortExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 12 | TakeOrderedAndPorjectExec | | using row version | | | | | | | | | | | | | | | | | | | | ||
| 13 | UnionExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 14 | CustomShuffleReaderExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 15 | HashAggregateExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 16 | SortAggregateExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 17 | DataWritingCommandExec | | using row version | | | | | | | | | | | | | | | | | | | | ||
| 18 | BatchScanExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 19 | BroadcastExchangeExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 20 | ShuffleExchangeExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 21 | BroadcastHashJoinExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 22 | BroadcastNestedLoopJoinExec | | using row version | | | | | | | | | | | | | | | | | | | | ||
| 23 | CartesianProductExec | | using row version | | | | | | | | | | | | | | | | | | | | ||
| 24 | ShuffledHashJoinExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 25 | SortMergeJoinExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | | ||
| 26 | ArrowEvalPythonExec | | using row version | | | | | | | | | | | | | | | | | | | | ||
| 27 | WindowINPandasExec | | using row version | | | | | | | | | | | | | | | | | | | | ||
| 28 | WindowExec | | | y | y | y | y | y | y | y | y | y | y | | | | | | | | | |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.