Data transfer to MCP #145

slesaad · 2022-06-13T19:49:19Z

Epic

Description

One of the most important tasks in migration of VEDA to MCP is migration of the data files to the MCP s3 bucket.

Based on decisions made in #118, it's been decided that this will be carried out by running the data transformation and ingestion pipeline with a flag set to transfer the data to the MCP s3 bucket.

A role based policy will be used to gain access to the MCP s3 bucket from the ingestion pipeline.

Examples

The current data link points to a UAH bucket. That's where the data files exist.

{
  "assets": {
    "cog_default": {
      "href": "s3://climatedashboard-data/bmhd_30m_monthly_bkp/finalBMHD_ScaledVenice_202203.tif",
      "type": "image/tiff; application=geotiff; profile=cloud-optimized",
      "roles": [
        "data",
        "layer"
      ]
    },
  }
}

At the end of the data migration, the links should look like the following and the data should exist in that link:

{
  "assets": {
    "cog_default": {
      "href": "s3://veda-data-store-staging/nightlights-hd-monthly/finalBMHD_ScaledVenice_202203.tif",
      "type": "image/tiff; application=geotiff; profile=cloud-optimized",
      "roles": [
        "data",
        "layer"
      ]
    },
  }
}

Acceptance Criteria:

All the data is transferred to the MCP bucket
All the data link points to the MCP s3 url
The asset/data links from the staging catalog can still be viewed in the cog viewer after the transfer

Checklist for collections

Checklist:

Epic Link
Detailed description
~~Concept diagrams~~
Assignee

xhagrg · 2022-06-14T13:42:40Z

We should probably push to prod bucket rather than staging bucket.

slesaad · 2022-06-14T14:58:48Z

Misc TODOs

Remove povmap-grdi-v1_VNL-2020-01-01_2020-12-31 from grdi-vnl-raster
Delete collection IS2SITMOGR4, added IS2SITMOGR4-cog instead
Delete nightlights-hd-3bands
Add nightlights-500m-daily

slesaad · 2022-06-15T16:40:34Z

Some pgstac database quirks ☠️ realised while migrating the datasets that we should be aware of:

When you update a collection after the items have been ingested into the collection, the items disappear. 🛑 DONOT UPDATE A COLLECTION AFTER THE ITEMS ARE INGESTED, else you'll have to run the ingestion again.
We can't use same id 👯 for items even if they are in different collection

abarciauskas-bgse · 2022-06-22T19:55:16Z

The data products are in s3://veda-data-store-staging and not s3://veda-data-store - @xhagrg @slesaad can we migrate the products to be in s3://veda-data-store? 🙏🏽

abarciauskas-bgse · 2022-06-22T20:12:52Z

We are having a longer discussion on slack so will follow up with next steps

slesaad assigned slesaad and xhagrg Jun 13, 2022

anayeaye mentioned this issue Jun 13, 2022

Ingest subset of HLS data for EJ story #146

Closed

7 tasks

slesaad closed this as completed Jul 11, 2022

anayeaye mentioned this issue Jul 15, 2022

Tech Plan for migrating data (req backend deployment to prod) to MCP VEDA ops NASA-IMPACT/veda-backend#87

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data transfer to MCP #145

Data transfer to MCP #145

slesaad commented Jun 13, 2022 •

edited

Loading

xhagrg commented Jun 14, 2022

slesaad commented Jun 14, 2022 •

edited

Loading

slesaad commented Jun 15, 2022

abarciauskas-bgse commented Jun 22, 2022

abarciauskas-bgse commented Jun 22, 2022

Data transfer to MCP #145

Data transfer to MCP #145

Comments

slesaad commented Jun 13, 2022 • edited Loading

Epic

Description

Examples

Acceptance Criteria:

Checklist for collections

Checklist:

xhagrg commented Jun 14, 2022

slesaad commented Jun 14, 2022 • edited Loading

Misc TODOs

slesaad commented Jun 15, 2022

abarciauskas-bgse commented Jun 22, 2022

abarciauskas-bgse commented Jun 22, 2022

slesaad commented Jun 13, 2022 •

edited

Loading

slesaad commented Jun 14, 2022 •

edited

Loading