-
Notifications
You must be signed in to change notification settings - Fork 128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Recipe to create pool of obs4MIPs data #3317
Conversation
Hi @bouweandela So I have encountered several issues with the download from ESGF. For datasets MODIS-1-0, AIRS-2-1, AIRS-2-0 See code for testing what I mean: #!/bin/bash
from pyesgf.search import SearchConnection
conn = SearchConnection('https://esgf-node.llnl.gov/esg-search', distrib=True)
facets='project,source_id,variable,time_frequency'
ctx = conn.new_context(project='obs4MIPs', source_id='MODIS-1-0', facets=facets)
print(ctx.hit_count)
ctx = conn.new_context(project='obs4MIPs', source_id='MODIS-1-0', variable="clt", facets=facets)
print(ctx.hit_count)
ctx = conn.new_context(project='obs4MIPs', source_id='MODIS-1-0', variable="clt", time_frequency="mon", facets=facets)
print(ctx.hit_count)
ctx = conn.new_context(project='obs4MIPs', source_id='AIRS-2-1', facets=facets)
print(ctx.hit_count)
ctx = conn.new_context(project='obs4MIPs', source_id='AIRS-2-1', variable="hus", facets=facets)
print(ctx.hit_count)
ctx = conn.new_context(project='obs4MIPs', source_id='AIRS-2-1', variable="hus", time_frequency="mon", facets=facets)
print(ctx.hit_count) |
I can't find |
Finally, question regarding derived variables like |
Note that I am using a dummy diagnostic. Is there another way? |
Note that this will not only download the data, but also create CMORized copies. If this is not needed, I would recommend writing a small script that uses the |
It appears that some obs4MIPs datasets use I'm not entirely sure how to solve this, we could consider removing this line from our code, but that may cause trouble for datasets that provide the same variable in multiple frequencies. |
Maybe it is related to the issue with In [1]: from esmvalcore.esgf import find_files
In [2]: find_files(project='obs4MIPs', dataset='RSS-v7', short_name='*')
Out[2]:
[ESGFFile:obs4MIPs/RSS-v7/v20180305/prw_mon_RSS-v7_BE_gn_198801_201512.nc on hosts ['aims3.llnl.gov'],
ESGFFile:obs4MIPs/RSS-v7/v20180305/sfcWind_mon_RSS-v7_BE_gn_198801_201512.nc on hosts ['aims3.llnl.gov'],
ESGFFile:obs4MIPs/RSS-v7/v20180305/tos_mon_RSS-v7_BE_gn_200206-201012.nc on hosts ['aims3.llnl.gov']] or see here for some results in json format. |
I suspect that mixing different datasets is not supported by the tool. You could try using |
Yes, use |
Thanks for initiating this, @rbeucher. I support the idea of having a recipe to check the completeness of a shared obs4MIPs data pool on clusters. I was wondering if it wouldn't be simpler to replicate what we do for CMORized data in the recipe_check_obs.yml but for obs4MIPs data used in our recipes, e.g. Here is an example: CERES-EBAF:
description: CERES-EBAF check
variables:
rlut:
rlutcs:
additional_datasets:
- {dataset: CERES-EBAF, project: obs4MIPs, mip: Amon, tier: 1,
start_year: 2000, end_year: 2014}
scripts: null When ran with
|
Description
This is an attempt at a recipe leveraging the ESGF download capability to create an obs4MIPs data pool at NCI.
I am testing downloads from ESGF using corrected names.
See discussion here #2974
Before you get started
Checklist
It is the responsibility of the author to make sure the pull request is ready to review. The icons indicate whether the item will be subject to the 🛠 Technical or 🧪 Scientific review.
New or updated recipe/diagnostic
New or updated data reformatting script
To help with the number of pull requests: