Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add orca horovod tf2 test to github actions #5601

Merged
merged 34 commits into from
Sep 13, 2022
Merged
Show file tree
Hide file tree
Changes from 30 commits
Commits
Show all changes
34 commits
Select commit Hold shift + click to select a range
6e2f762
add orca horovod tf2 test to github actions
pinggao18 Aug 31, 2022
81d524a
update pull_request
pinggao18 Aug 31, 2022
79c352d
update
pinggao18 Aug 31, 2022
70cd169
add keras
pinggao18 Aug 31, 2022
398c820
update python version
pinggao18 Sep 1, 2022
2ce03f1
update horovod
pinggao18 Sep 1, 2022
aa58b5f
update cmake
pinggao18 Sep 1, 2022
8fd3fd4
install bigdl-orca-spark3
pinggao18 Sep 1, 2022
1c2dab7
update
pinggao18 Sep 1, 2022
0c9ddfd
update mxnet
pinggao18 Sep 1, 2022
7036e26
update
pinggao18 Sep 1, 2022
dc17aaf
update numpy
pinggao18 Sep 1, 2022
4458bba
update
pinggao18 Sep 1, 2022
0aa21a2
update ray version
pinggao18 Sep 2, 2022
3ebd4d6
unintsall intel-tensorflow
pinggao18 Sep 2, 2022
21d09b3
add conda env
pinggao18 Sep 2, 2022
fa0245c
install horovod
pinggao18 Sep 5, 2022
11b5614
remove conda env
pinggao18 Sep 5, 2022
eaeac8e
install bigdl-orca-spark3
pinggao18 Sep 5, 2022
ff69478
split conda env
pinggao18 Sep 7, 2022
d46f182
source conda
pinggao18 Sep 7, 2022
4176900
split conda env
pinggao18 Sep 7, 2022
e26de8f
update
pinggao18 Sep 7, 2022
27b0ec5
update env
pinggao18 Sep 8, 2022
30cb3b4
remove env
pinggao18 Sep 8, 2022
09a66e4
update
pinggao18 Sep 8, 2022
63aee4d
update
pinggao18 Sep 8, 2022
73d8241
uninstall requirements
pinggao18 Sep 8, 2022
8ec9a2e
update
pinggao18 Sep 8, 2022
42a3560
update install whl
pinggao18 Sep 8, 2022
6d00823
update
pinggao18 Sep 9, 2022
bdc0b5b
add remove env
pinggao18 Sep 9, 2022
1d92e93
install bigdl-orca-spark3
pinggao18 Sep 9, 2022
9ab6fa4
update
pinggao18 Sep 13, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
@@ -0,0 +1,57 @@
name: 'Run Orca Python Horovod Tf2 Py37 Spark3'
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

will this file be removed later?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It will be removed after the test is successful

description: 'Run Orca Python Horovod Tf2 Py37 Spark3'
runs:
using: "composite"
steps:
- name: Setup Env
shell: bash
run: |
apt-get update
apt-get install wget

- name: Set up Python
uses: actions/setup-python@v2
with:
python-version: '3.7.10'

- name: Install dependencies
shell: bash
run: |
python -m pip install --upgrade pip
python -m pip install --upgrade setuptools==58.0.4

- name: Run Test
shell: bash
run: |
conda remove -n orca-python-horovod-tf2-py37 -y --all
conda create -n orca-python-horovod-tf2-py37 -y python==3.7.10 setuptools==58.0.4
conda info -e
source activate orca-python-horovod-tf2-py37
pip uninstall -y bigdl-friesian bigdl-friesian-spark3 bigdl-dllib bigdl-dllib-spark3 bigdl-orca pyspark bigdl-orca-spark3 bigdl-chronos bigdl-chronos-spark3 bigdl-friesian bigdl-friesian-spark3
pip uninstall -y intel-tensorflow
pip install cmake mxnet==1.6.0
pip uninstall -r .github/actions/python-requirements/requirements-horovod-tf2.txt -y
pip install -i https://pypi.tuna.tsinghua.edu.cn/simple -r .github/actions/python-requirements/requirements-horovod-tf2.txt
pip install pytest
HOROVOD_WITH_PYTORCH=1; HOROVOD_WITH_GLOO=1; pip install --no-cache-dir horovod==0.19.2
pip uninstall -y opencv-python-headless
pip install opencv-python-headless

#sed -i "s/'bigdl-core=='+VERSION/'bigdl-core==2.1.0b20220811'/g" python/dllib/src/setup.py
#bash python/dev/release_default_linux_spark312.sh default false false

# install dllib
#pip install python/dllib/src/dist/bigdl_dllib*-py3-none-manylinux1_x86_64.whl

# install orca
#pip install python/orca/src/dist/bigdl_orca*-py3-none-manylinux1_x86_64.whl
pip install --pre --upgrade bigdl-orca-spark3
export SPARK_LOCAL_HOSTNAME=localhost
chmod a+x python/orca/dev/test/run-pytests-horovod-tf.sh
python/orca/dev/test/run-pytests-horovod-tf.sh
ray stop

conda remove -n bigdl-init --all
env:
BIGDL_ROOT: ${{ github.workspace }}
ANALYTICS_ZOO_ROOT: ${{ github.workspace }}
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
name: 'Run Orca Python Horovod Tf2 Py37 Spark3'
description: 'Run Orca Python Horovod Tf2 Py37 Spark3'
runs:
using: "composite"
steps:
- name: Run Test
shell: bash
run: |
source activate orca-oython-horovod-tf2-py37-spark3
export SPARK_LOCAL_HOSTNAME=localhost
chmod a+x python/orca/dev/test/run-pytests-horovod-tf.sh
python/orca/dev/test/run-pytests-horovod-tf.sh
ray stop
source deactivate
env:
BIGDL_ROOT: ${{ github.workspace }}
ANALYTICS_ZOO_ROOT: ${{ github.workspace }}
50 changes: 50 additions & 0 deletions .github/actions/orca/setup-env/setup-horovod-tf2-py37/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
name: 'Run Orca Python Horovod Tf2 Py37 Spark3'
description: 'Run Orca Python Horovod Tf2 Py37 Spark3'
runs:
using: "composite"
steps:
- name: Set up Python
uses: actions/setup-python@v2
with:
python-version: '3.7.10'

- name: Install dependencies
shell: bash
run: |
apt-get update
apt-get install wget
python -m pip install --upgrade pip
python -m pip install --upgrade setuptools==58.0.4

- name: Setup env
shell: bash
run: |
if conda info --env | grep "orca-oython-horovod-tf2-py37-spark3"; then
source activate orca-oython-horovod-tf2-py37-spark3
else
conda create -n orca-oython-horovod-tf2-py37-spark3 -y python==3.7.10 setuptools==58.0.4
conda info --env
source activate orca-oython-horovod-tf2-py37-spark3
fi
pip uninstall -y bigdl-friesian bigdl-friesian-spark3 bigdl-dllib bigdl-dllib-spark3 bigdl-orca pyspark bigdl-orca-spark3 bigdl-chronos bigdl-chronos-spark3 bigdl-friesian bigdl-friesian-spark3
pip uninstall -y intel-tensorflow
pip install cmake mxnet==1.6.0
pip uninstall -r .github/actions/python-requirements/requirements-horovod-tf2.txt -y
pip install -i https://pypi.tuna.tsinghua.edu.cn/simple -r .github/actions/python-requirements/requirements-horovod-tf2.txt
pip install pytest
HOROVOD_WITH_PYTORCH=1; HOROVOD_WITH_GLOO=1; pip install --no-cache-dir horovod==0.19.2
pip uninstall -y opencv-python-headless
pip install opencv-python-headless
#pip install --pre --upgrade bigdl-orca-spark3
sed -i "s/'bigdl-core=='+VERSION/'bigdl-core==2.1.0b20220811'/g" python/dllib/src/setup.py
bash python/dev/release_default_linux_spark312.sh default false false

# install dllib
pip install python/dllib/src/dist/bigdl_dllib*-py3-none-manylinux1_x86_64.whl

# install orca
pip install python/orca/src/dist/bigdl_orca*-py3-none-manylinux1_x86_64.whl

env:
BIGDL_ROOT: ${{ github.workspace }}
ANALYTICS_ZOO_ROOT: ${{ github.workspace }}
33 changes: 33 additions & 0 deletions .github/actions/python-requirements/requirements-horovod-tf2.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
torch==1.7.1
torchvision
torchmetrics
albumentations
scikit-learn==0.22.2.post1
opencv-python
matplotlib
tqdm
protobuf==3.15.3
pandas
ray[default]==1.9.2
aiohttp==3.8.1
async-timeout==4.0.1
dm_tree
gym[atari]==0.17.1
atari-py==0.2.5
tabulate==0.8.7
tensorboard==2.3.0
tensorboard-data-server==0.6.1
tensorboard-plugin-wit==1.7.0
tensorboardX==2.1
tensorflow==2.3.0
tensorflow-estimator==2.3.0
xgboost_ray
argparse
Pillow
pyarrow==4.0.1
h5py==2.10.0
fsspec==2021.9.0
scikit-optimize==0.8.1
pandas==1.1.5
Keras-Applications==1.0.8
Keras-Preprocessing==1.1.2
23 changes: 22 additions & 1 deletion .github/workflows/nightly_test.yml
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
name: Nightly Test

on:
#pull_request:
pull_request:
schedule:
- cron: '0 13 * * *' # GMT time, 13:00 GMT == 21:00 China
workflow_dispatch:
Expand All @@ -19,6 +19,7 @@ on:
- Orca-Python-Ray-Py37-Spark3
- Orca-Python-Py37-Spark3
- Orca-Ray-Ctx-Example
- Orca-Python-Horovod-Tf2-Py37-Spark3
- Dllib-Scala-UT
- Friesian-Scala-UT
- PPML-Scala-UT
Expand All @@ -29,6 +30,11 @@ permissions:
packages: write

jobs:
#env:
#name: conda env
#outputs:
#Orca-Python-Horovod-Tf2-Py37-Spark3: Orca-Python-Horovod-Tf2-Py37-Spark3

Orca-Python-ExampleTest-Py37-Spark3:
if: ${{ github.event.schedule || github.event.inputs.artifact == 'Orca-Python-ExampleTest-Py37-Spark3' || github.event.inputs.artifact == 'all' }}
runs-on: [self-hosted, Gondolin, ubuntu-20.04-lts]
Expand Down Expand Up @@ -95,6 +101,21 @@ jobs:
- name: Run test
uses: ./.github/actions/orca-python-py37-spark3-action

Orca-Python-Horovod-Tf2-Py37-Spark3:
#if: ${{ github.event.schedule || github.event.inputs.artifact == 'Orca-Python-Horovod-Tf2-Py37-Spark3' || github.event.inputs.artifact == 'all' }}
runs-on: [self-hosted, Gondolin, ubuntu-20.04-lts]

steps:
- uses: actions/checkout@v3
- name: Set up JDK8
uses: ./.github/actions/jdk-setup-action
- name: Set up maven
uses: ./.github/actions/maven-setup-action
- name: Setup env
uses: ./.github/actions/orca/setup-env/setup-horovod-tf2-py37
- name: Run Test
uses: ./.github/actions/orca/orca-python-horovod-tf2-py37-spark3-action/nightly-test

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will a remove-env action be added?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

removing the env will cause each test to take too long, will no longer add remove-env.

Dllib-Scala-UT:
if: ${{ github.event.schedule || github.event.inputs.artifact == 'Dllib-Scala-UT' || github.event.inputs.artifact == 'all' }}
runs-on: [ self-hosted, Gondolin, ubuntu-20.04-lts ]
Expand Down