Visit the full docs here to see additional examples and the API reference.
Prefect integrations with NASA Earthdata, taking advantage of the earthaccess
library.
prefect-earthdata
provides a Prefect credentials block and a few tasks to interact with
NASA Earthdata. It does so by leveraging the earthaccess
library and its API.
Loading the EarthdataCredentials
block with your credentials, corresponds to calling the earthaccess.login()
function.
After that, all other earthaccess
functions can be directly used, without having to login again.
Nevertheless, a few tasks are provided to help taking full advantage of Prefect's observability features.
from prefect import flow, get_run_logger
from prefect_earthdata.credentials import EarthdataCredentials
from prefect_earthdata.tasks import download, search_data
@flow(log_prints=True)
def example_earthdata_download_flow():
logger = get_run_logger()
earthdata_credentials = EarthdataCredentials.load("earthdata-credentials")
granules = search_data(
earthdata_credentials,
count=1,
short_name="ATL08",
bounding_box=(-92.86, 16.26, -91.58, 16.97),
)
logger.info(f"File URL: {granules[0].data_links()[0]}")
download_path = "/tmp"
logger.info(f"Downloading data to {download_path}")
files = download(
credentials=earthdata_credentials,
granules=granules,
local_path=download_path,
)
logger.info(f"Downloaded files: {files}")
return granules, files
example_earthdata_download_flow()
Output:
21:07:16.603 | INFO | prefect.engine - Created flow run 'cheerful-peacock' for flow 'example-earthdata-download-flow'
21:07:18.121 | INFO | Flow run 'cheerful-peacock' - Created task run 'search_data-0' for task 'search_data'
21:07:18.121 | INFO | Flow run 'cheerful-peacock' - Executing 'search_data-0' immediately...
21:07:19.642 | INFO | Task run 'search_data-0' - You're now authenticated with NASA Earthdata Login
21:07:19.644 | INFO | Task run 'search_data-0' - Using token with expiration date: 09/01/2023
21:07:20.243 | INFO | Task run 'search_data-0' - Using environment variables for EDL
21:07:21.432 | INFO | Task run 'search_data-0' - Granules found: 760
21:07:22.211 | INFO | Task run 'search_data-0' - Finished in state Completed()
21:07:22.225 | INFO | Flow run 'cheerful-peacock' - File URL: https://data.nsidc.earthdatacloud.nasa.gov/nsidc-cumulus-prod-protected/ATLAS/ATL08/005/2018/11/05/ATL08_20181105083647_05760107_005_01.h5
21:07:22.226 | INFO | Flow run 'cheerful-peacock' - Downloading data to /tmp
21:07:22.586 | INFO | Flow run 'cheerful-peacock' - Created task run 'download-0' for task 'download'
21:07:22.587 | INFO | Flow run 'cheerful-peacock' - Executing 'download-0' immediately...
21:07:24.529 | INFO | Task run 'download-0' - We are already authenticated with NASA EDL
21:07:28.677 | INFO | Task run 'download-0' - Getting 1 granules, approx download size: 0.0 GB
QUEUEING TASKS | : 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:00<00:00, 2016.49it/s]
PROCESSING TASKS | : 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:11<00:00, 11.45s/it]
COLLECTING RESULTS | : 100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:00<00:00, 25115.59it/s]
21:07:40.388 | INFO | Task run 'download-0' - Finished in state Completed()
21:07:40.392 | INFO | Flow run 'cheerful-peacock' - Downloaded files: ['ATL08_20181105083647_05760107_005_01.h5']
21:07:41.243 | INFO | Flow run 'cheerful-peacock' - Finished in state Completed()
For more tips on how to use tasks and flows in a Collection, check out Using Collections!
Install prefect-earthdata
with pip
:
pip install prefect-earthdata
Requires an installation of Python 3.8+.
We recommend using a Python virtual environment manager such as pipenv, conda or virtualenv.
These tasks are designed to work with Prefect 2.0. For more information about how to use Prefect, please refer to the Prefect documentation.
If you encounter any bugs while using prefect-earthdata
, feel free to open an issue in the prefect-earthdata repository.
If you have any questions or issues while using prefect-earthdata
, you can find help in either the Prefect Discourse forum or the Prefect Slack community.
Feel free to star or watch prefect-earthdata
for updates too!
If you'd like to help contribute to fix an issue or add a feature to prefect-earthdata
, please propose changes through a pull request from a fork of the repository.
Here are the steps:
- Fork the repository
- Clone the forked repository
- Install the repository and its dependencies:
pip install -e ".[dev]"
- Make desired changes
- Add tests
- Insert an entry to CHANGELOG.md
- Install
pre-commit
to perform quality checks prior to commit:
pre-commit install
git commit
,git push
, and create a pull request