Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ingest subset of roughly 100-200 WV03_MSI_L2A granules into UAT #394

Open
5 tasks
jsrikish opened this issue Oct 8, 2024 · 0 comments
Open
5 tasks

Ingest subset of roughly 100-200 WV03_MSI_L2A granules into UAT #394

jsrikish opened this issue Oct 8, 2024 · 0 comments
Assignees

Comments

@jsrikish
Copy link
Collaborator

jsrikish commented Oct 8, 2024

Ingest subset of roughly 100-200 WV03_MSI_L2A granules into UAT to allow verification of metadata and metrics.

ACCOUNT: [csda-cumulus-uat-1686]

To find a date range that has roughly 100-200 granules, make use of AWS Athena by doing the following:

  • In the MCP IMPACT-CSDAP AWS account's console, open the Athena query editor
  • Select the AwsDataCatalog Data Source and the default Database
  • Run the following query to update the partitions: MSCK REPAIR TABLE maxar_transfer
  • Run queries similar to the following (tweaking dt and the path pattern -- see notes below) until a count near the 100-200 range is obtained:
    SELECT * FROM "maxar_transfer$partitions" ORDER BY dt DESC
    SELECT * FROM "maxar_transfer" WHERE dt='2024-05-17-01-00' AND key like 'css/nga/WV03/2A/2009/%P1BS%' AND key like 'css/nga/WV03/2A/2009/%BROWSE.jpg%' limit 100;
    

This approach uses the count of *BROWSE.jpg files as a proxy for the number of granules in the date range.

NOTE: the value of dt in the query above must be set to the most recent date of the S3 inventory (likely the current date, but the day prior to the current date should always work since the most recent 3 days are retained).

NOTE: days like 2009/321 and 2009/339 (because the end date is exclusive in the rule definition) must be converted to yyyy/MM/DD for determining ISO 8601 datetime values for the start and end dates for the sample rule. This site provides conversion: https://dqydj.com/day-of-the-year-calculator/

  • 2009/321 is 2009/11/17T00:00:00Z
  • 2009/339 is 2009/12/05T00:00:00Z

Acceptance criteria:

  • define a rule named WV03_MSI_L2A___1_UAT.json that spans a period of time that includes roughly 100-200 granules (using instructions above for finding appropriate values for startDate and endDate)
  • load the rule into UAT
DOTENV=.env.uat make bash
cumulus rules add --data app/stacks/cumulus/resources/rules/WV02_Pan_L1B/v1/WV03_MSI_L2A___1_UAT.json
  • trigger the rule to ingest the granules
DOTENV=.env.uat make bash
cumulus rules enable --name WV03_MSI_L2A___1_UAT
cumulus rules run --name WV03_MSI_L2A___1_UAT
  • verify all granules covered by the rule are successfully ingested and published by using UAT Earthdata Search: https://search.uat.earthdata.nasa.gov/search?q=WV02_Pan_L1B (value of q should be collection short name)
  • verify log entries appear in Kibana:
    1. connect to NASA VPN
    2. search Kibana for the recent files
    3. login with saml/launchpad
    4. confirm that the search results show the ingested files (you might need to adjust the timespan in the upper right corner of the page)
    5. Message Helen & Aaron to check for 100 granules in UAT
@jsrikish jsrikish self-assigned this Oct 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant