-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions about the Extdata.rc #346
Comments
Hi @helpyuan, thanks for asking about this. ExtData.rc is indeed confusing. MAPL ExtData is going to be overhauled in MAPL 3, which we plan to include in GCHP v15.0 next year, and with that update will be a switch to a new yaml-format ExtData configuration file. We hope it will be easier to understand when that change happens. For now we are still using the old way which has several rather hidden features. I will do my best to explain below. Regarding the "D" entry for the climatology column, this means day of week. It is a special case of values that recur in time throughout the year. The file contains 7 values per month, one for each day of the week in the month. It was implemented specifically for GEOS-Chem NEI scale factors. You should only use "D" if you have a day-of-week file. Regarding the year entry for the climatology column, this is a feature of MAPL that we typically do not use but that snuck in with an update. If you replace "Y" with the year then MAPL will always use that year as climatology for all years. It can be redundant depending on how you use it. There is actually an update going into the next GCHP version to change how it is specified for AEIC19 (the example you posted) to be more clear, and to fix the typo in those lines (%d2 should be 01). It is more consistent to use "Y" for clim and then put the year in the refresh template (F2019-%m2-01). I recommend doing this if add any new climatology inputs. Regarding Refresh, I am not sure that example is correct. It appears to have been copied from the GEOS wiki page for ExtData. If you have 3-hourly data I suggest copying what is done for the 3-hourly meteorology fields. For example, see here for reading 3-hourly data that is stored in daily files and the first time in the file is 00:00:00. It gets more complicated if the times are offset from 00:00:00. An example of reading 3-hourly fields whose times start at 01:30:00 rather than 00:00:00 is here. There is an extra entry at the end of the line that specifies the a reference start time (with 1 hr 30 min offset) and the frequency (3hr). For that example each file has only one time, as show in the filename template. Regarding Refresh F2010-%m2-01T00:00:00, that's a good question and worth checking. In my answer above I said this will always use the 2010 data for all years, but your question makes me wonder if that is really what is happening. We should check this. I'm not sure I understand your question #3. If the Clim column specifies Y then it assumes 12-month climatology. Are you asking about the file itself, or how ExtData reads and uses the data? |
Hi @lizziel - one note: I think that using the target year in "clim" is preferable to using "Y", as I think that "Y" implies (requires?) that the data are monthly only. I'm not 100% sure on that but wanted to give you a heads up (for example, if true this would cause a problem for something like the daily AEIC 2019 data). |
Thanks @sdeastham. Yes, my understanding is Y means monthly only. The daily AEIC should not use Y. I checked my 14.2.1 fixes for AEIC and I left the daily as this (it was the monthly that needed fixing, and only for performance so as not to try to refresh daily): Regarding monthly climatology, do you know if it is problematic to hard-code year in the refresh template, e.g. what we do for EDGAR scale factors? |
I think hard-coding the year in the refresh template is fine - I'm not aware of any issues with that! |
Thank you very much for your answer @lizziel, which deepened my understanding of Extdata.rc. I will study the examples you mentioned carefully. "I'm not sure I understand your question #3. If the Clim column specifies Y then it assumes 12-month climatology. Are you asking about the file itself, or how ExtData reads and uses the data?" |
Hi @helpyuan, we currently only use a day-of-week scale factor for NEI99, and the file containing it has different scale factors for every month for an entire year. This is slightly different than your example. I suggest configuring For diurnal scale factors, see the example for EDGAR under title "Diurnal scale factor" in |
Thank you for answering my confusion @lizziel! My question has been basically resolved, and I think this issue can be closed. |
Name and Institution (Required)
Name: liumy
Institution: AMS
Confirm you have reviewed the following documentation
Description of your issue or question
Hi all, I have the following questions about Extdata.rc:
1. About the
Clim
.The description of Clim in the GCHP manual is as follows [https://gchp.readthedocs.io/en/latest/user-guide/config-files/ExtData_rc.html]:
Enter Y if the file is a 12 month climatology, otherwise enter N. If you specify it is a climatology ExtData the data can be on either one file or 12 files if they are templated appropriately with one per month
. My understanding of this is thatClim
is eitherN
orY
. However, in the template ofExtdata.rc.fullchem
, I found these forms, such as,The above figures show
Clim
parameters such asD
and2019
, I want to know whatD
and2019
represent here? Can I useN
orY
instead ofD
and2019
?2. About the
Refresh
.There is an example in the GCHP manual that introduces
Refresh
, as follows:For example, a template in the form %y4-%m2-%d2T12:00:00 will cause the variable to be updated at the start of a new day (i.e. when the clock hits 2007-08-02T00:00:00 it will update the variable but the time it will use for reading and interpolation is 2007-08-02T12:00:00).
What I don't understand is that if set to
%y4-%m2-%d2T12:00:00
, shouldn't the data be updated at 2007-08-02 12:00:00? PerhapsF% y4-%m2-%d2T12:00:00
can update the data at 2007-08-02 12:00:00?.In addition, what I would like to know is that I have 1 emission source data that changes every 3 hours. Do I need to set my
Refresh
to%y4-%m2-%d2T%h2:00:00
?Also, I can see that the following is in
Refresh
format:Does
F2010-%m2-01T00:00:00
mean that this data will only be updated in 2010? If the mode runs until 2011, will this data not be updated?3. Perhaps this is a fundamental question.
I would like to know how to know if the file is
a 12 month climatology
inClim
.The above are some of my questions, and I am grateful for any answers.
Please provide as much detail as possible. Always include the GCHP version number and any relevant configuration and log files.
The text was updated successfully, but these errors were encountered: