-
Notifications
You must be signed in to change notification settings - Fork 319
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integrate Ben's chunking in the new mksurfdat toolchain #643
Comments
I wanted to get a task list down here for the chunked regridding (see this board for more detailed issues). I ordered the tasks by priority.
Ping @rsdunlapiv |
My current understanding:
cmd="$mpirun By "ocgis call" we mean parts of Ben's bash script that does the spatial subsetting and weight generation: I would like an expert's opinion on which parts of Ben's script MUST end up in mkmapdata.sh.
At the Monday meeting we can discuss when to have the tour of mkmapdata.sh to address (1) and (2). Are (1) and (2) in the correct order here? |
From today's discussion: Next step is to just play around with using ocgis, to make sure it will work for our use cases with reasonable memory, time, etc. We can worry about getting a robustly engineered solution as a following step. Can start by just seeing what the cmds are that are done by mkmapdata. |
Keeping @billsacks and @ekluzek in the loop: @negin513 @bekozi and I met today. Negin and Ben worked in Negin’s environment (not using conda) to get Ben's script running. They started troubleshooting from the same error that I got in my attempts: “...requires PIO…” Now Negin has a script that works with Ben’s source and destination datasets. Next Negin and Ben tried running the script with a couple of CTSM’s default source and destination datasets and failed. Ben will troubleshoot and let us know in the next day or two. |
@negin513, @slevisconsulting: I pushed a fix for the SCRIP grids (NCPP/ocgis#497) to master. Thankfully it was a quick one. I also added some new examples:
Let me know how it goes! |
Working in cheyenne:/gpfs/fs1/work/slevis/ocgis_work Good news. I ran two scripts successfully:
I will post more tests soon. |
Comparing the output from (2) in the previous post to the output from running mkmapdata.sh: ncdump -h weights_0520.nc ncdump -h map_0.5x0.5_AVHRR_to_4x5_nomask_aave_da_c190516.nc // global attributes: The lines in these plots are densely packed dots. There is only one S value per n_s. |
@slevisconsulting Is it possible to include the regridding commands that generated these weight files? It would be useful to verify the same flags are being used. Thanks! |
It took me and @bekozi a few iterations, but we have a script that works now: Repeating test (3) above with Repeating test (4) above with |
Good news: |
Results from new tests.
14-17) requesting 40 chunks PASS |
@ekluzek pls confirm whether I understand the dependencies correctly: @billsacks I think it's the other way around: #643 depends on #806 |
No, I would describe #806 as depending on this one, because you want to use the subsetting that's available in OCGIS to accomplish #806. So the first thing to do is to move mkmapdata.sh to use OCGIS, and then add the ability to also use it to do the subsetting. There are going to be changes that will be needed to mkmapdata.sh in order for it to use OCGIS either for subsetting or making maps, so you might as well do that first, and then adding the subsetting functionality to it won't be hard to do. |
So a way to look at it is this. This issue is refactoring mkmapdata.sh to use OCGIS. Doing that will make it easier to add subsetting (using OCGIS) to it. So this is a refactoring that enables adding a new feature easier. |
Meeting with @ekluzek Currently @slevisconsulting has Instead, we will need to add ocgis to ctsm as an external. Something different will need to happen with ESMPy. Not clear what, yet. |
I've added another issue to OCGIS to add more metadata to their mapping files... |
Based on discussion in #645 we actually don't want ocgis as an external: instead, this will be installed on the system via conda. |
@billsacks @ekluzek |
With @mvertens work on making mksurfdata run in parallel, I think this issue becomes a WONTFIX. Once, we've completely validated the parallel mksurfdata we should close this. |
Modify mkmapdata to include calls to OCGIS.
@billsacks wrote:
Initially, this will largely consist of doing some experimentation: making sure this tool works for our current mkmapdata uses (replacing the direct call to the ESMF regridder with a call to the OCGIS tool), and investigating performance, memory requirements, etc., for both our standard use cases and for typical new grids.
The text was updated successfully, but these errors were encountered: