Skip to content

Latest commit

 

History

History
27 lines (23 loc) · 1.02 KB

README.md

File metadata and controls

27 lines (23 loc) · 1.02 KB

Download Video from VATEX and Jukinmedia

first, cd into the Crawler folder

cd Crawler

VATEX

run the following script to generate the download cmd string to a file

python download_cmd_generation.py --video_path [VIDEO SAVE FOLDER] --ann_file [VATEX ANNOTATION FILE PATH] --output_filename [CMD FILE OUTPUT PATH]

Then run following script to download the video from Youtube.(PS: This script calls the youtube video download tool yt-dlp. You need to install it first)

python parallel_download_VATEX.py --num_process 32 --cmd_file [THE FILE YOU GENERATE FROM LAST STEP]

Jukin Media

Run the following script to generate the video_id and video information from jukinmedia to a file.

python download_jukin_video_id.py --savefolder [THE VIDEO INFO SAVEFOLDER]

Run the folowing script to download the video

python parallel_download_jukin.py --save_dir [VIDEO SAVE FOLDER] --input_file [THE FILE YOU GENERATE FROM LAST STEP] --num_process 5