-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
request review of metadata #1
Comments
I have pushed a new branch that adapts the pyarrow ideas that Vincent shared yesterday: https://github.com/os-climate/wri-gppd-ingestion-pipeline/tree/metadata-v1 Please have a look and comment. |
I have now moved the metadata implementation to DBT pipelines - as OpenMetadata is able to ingest metadata from catalog.json which is generated and versioned when generating DBT documentation |
I just reviewed the notebook and see that it is unchanged since October 2021. It needs a complete overhaul in terms of |
I suggest deprecating the older notebook as I am essentially rebuilding the pipeline from scatch. But keep it around so you can have a look when I complete the end-to-end flow, as you may want to make some functional changes (note: most of the data processing that was in the notebook should be in DBT now). I have already checked in the notebooks for extraction and loading, with the data transformation now being shifted to DBT together with metadata ingestion. |
Please review and comment on the metadata implementation here:
https://github.com/os-climate/wri-gppd-ingestion-pipeline/blob/master/notebooks/WRI-gppd-ingest.ipynb
Relates to: os-climate/os_c_data_commons#48
The text was updated successfully, but these errors were encountered: