Minimal carbon pool model #134

jacobcook1995 · 2022-12-15T15:29:43Z

Description

I've added a basic carbon pool to the soil module. It only includes 2 pools (the full model will probably include 5 pools), and a function that handles transfers between these two pools. This is all bundled together as part of a SoilCarbon class.

I was originally intending to implement an alternate equation form and a method to switch between the two, but decided that was a bit ambitious for a single pull request. Structurally there's a couple of things that I'm not completely happy with and would want to clear up before I moved onto something like that.

The constant dictionaries defined at the top of the script define a set of fitting parameters for the various equations, these are taken from fits to data carried out in the literature. I'm not sure if this is a good way to store this kind of information, but it didn't seem sensible either to define these fitting parameters as independant constants either. I'm also unsure whether treating these as constants is a sensible approach either as I could see us wanting to do sensitivity analysis with them down the line.
I wrote the code with no sanity checking for variables except when the class is initialised. My reasoning for this is that too common sanity checking would slow down execution, and that we probably don't want to simulation to halt entirely because a negative value has crept into one differential equation in one grid cell. Is this broadly sensible?

No hurry on reviewing this, I mainly wanted to get the pull request submitted before I finished for Christmas.

Fixes #99

Type of change

New feature (non-breaking change which adds functionality)
Optimization (back-end change that speeds up the code)
Bug fix (non-breaking change which fixes an issue)

Key checklist

Make sure you've run the pre-commit checks: $ pre-commit run -a
All tests pass: $ poetry run pytest

Further checks

Code is commented, particularly in hard-to-understand areas
Tests added that prove fix is effective or that feature works

…g for status to be reported' problem

alexdewar

This looks really good and it's also a good example of how a PR should be done (i.e. you provided a good explanation, and your code has tests and lots of docstrings). Well done!

I've made some small suggestions throughout. Some slightly "bigger picture suggestions I had were:

It'd be helpful to say what units each of your variables are throughout (in docstrings etc.) where they represent physical values (e.g. I'd assume all your temperatures are in °C but would like to know for sure, I don't know what the unit of soil moisture is at all etc.)
While it's great that you have tests for each of your functions, you're oftentimes only testing that it works for a single set of parameters. This is much better than having nothing at all, but I think you should go one step further and test multiple sets or, ideally, ranges of values (i.e. by using @pytest.mark.parametrize). I was working on something similar myself today: https://github.com/ImperialCollegeLondon/FINESSE/blob/main/tests/test_tc4820.py

virtual_rainforest/soil/carbon.py

alexdewar · 2022-12-16T10:07:01Z

I realise I forgot to answer your questions:

Seems sensible to me!
I would just add the sanity checks. In the grand scheme of things it shouldn't hurt performance much and it could potentially save you a lot of debugging pain further down the line.

davidorme · 2022-12-16T10:49:04Z

On question 2, I think this overlaps with #135 - I've added a sketch of how this might work across modules to that discussion.

codecov-commenter · 2022-12-16T12:27:50Z

Codecov Report

Merging #134 (8d985d2) into develop (9659a15) will increase coverage by 0.35%.
The diff coverage is 100.00%.

@@             Coverage Diff             @@
##           develop     #134      +/-   ##
===========================================
+ Coverage    94.95%   95.31%   +0.35%     
===========================================
  Files           14       15       +1     
  Lines          714      768      +54     
===========================================
+ Hits           678      732      +54     
  Misses          36       36

Impacted Files	Coverage Δ
virtual_rainforest/core/config.py	`98.23% <ø> (ø)`
virtual_rainforest/models/plants/__init__.py	`100.00% <ø> (ø)`
virtual_rainforest/models/soil/__init__.py	`100.00% <ø> (ø)`
virtual_rainforest/models/soil/model.py	`94.73% <ø> (ø)`
virtual_rainforest/__init__.py	`100.00% <100.00%> (ø)`
virtual_rainforest/models/soil/carbon.py	`100.00% <100.00%> (ø)`

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

davidorme

Looks good - some suggestions on the tests and some possible tweaks on the implementation, but nothing serious. I think the tests are worth updating, so requesting those changes.

tests/test_soil_carbon.py

virtual_rainforest/models/soil/carbon.py

davidorme

Looks good but a few things to look at.

tests/test_soil_carbon.py

virtual_rainforest/models/soil/carbon.py

Making sure latest doc changes are in my dummy carbon branch

…format

alexdewar

I've made some very minor comments, but otherwise I think this is good to go. Good job!

tests/test_soil_carbon.py

virtual_rainforest/models/soil/carbon.py

alexdewar · 2023-02-01T17:59:47Z

virtual_rainforest/models/soil/carbon.py

+            labile carbon (m^3 kg^-1)
+    """
+
+    return 10.0 ** (BINDING_WITH_PH["slope"] * pH + BINDING_WITH_PH["intercept"])


I can't remember if I said this in my original review, but I think BINDING_WITH_PH would be better as a dataclass.

Ahh right is that the case for everywhere that I'm using dictionaries of constants? Or is this specific to BINDING_WITH_PH?

And the class would be defined like this

@dataclass class BindingWithPH: """From linear regression (Mayes et al. (2012)).""" intercept: float = -0.186 slope: float = -0.216

and accessed like this?

return 10.0 ** (BindingWithPH.slope * pH + BindingWithPH.intercept)

If it is just the object.thing notation then there are other options for having dot style dictionaries (dotmap for example). But @alexdewar did you mean that BindingWithPH would also provide BindingWithPH.calc_binding?

@dataclass class BindingWithPH: """From linear regression (Mayes et al. (2012)).""" intercept: float = -0.186 slope: float = -0.216 def calc_binding(ph: NDArray) -> NDArray return 10.0 ** (self.slope * pH + self.intercept)

More generally, I think there's an argument for the constants module providing something that can be easily serialised to JSON or YAML so that the constants can become part of the configuration if needed. core.constants might provide a default set, but people might want to play around with the settings.

I've used dataclasses for this kind of role here: https://github.com/davidorme/pyrealm/blob/master/pyrealm/param_classes.py

I will accept that the size of the dataclasses may not be ideal 😬

@jacobcook1995 Yes, that's essentially what I had in mind, but I'm open to suggestions if something like dotmap (which I hadn't heard of till now) is better.

The reason I suggested a dataclass was so you can get better type linting (i.e. mypy will be able to see what fields your BindingWithPH class has and what types they are). You could also just have them as separate variables.

virtual_rainforest/models/soil/carbon.py

jacobcook1995 · 2023-02-03T12:17:14Z

Made the decision to switch to data classes rather than dot map as I could work out how to add parameter specific docstrings more easily with data classes. Happy for this to change down the line when we agree upon a consistent approach between modules for constants units etc

davidorme

LGTM. Obvious caveats about how we are going to do constants and possibly having some kind of shared utility for bounds checking and if so where.

davidorme

Wait. I think there's something off about using dataclasses like that.

davidorme · 2023-02-03T16:55:29Z

So I may be not understanding how dataclasses are used, but I've always seen them used to create an instance of the dataclass. So given:

@dataclass
class BindingWithPH:
    """From linear regression (Mayes et al. (2012))."""
    intercept: float = -0.186
    slope: float = -0.216

Then a function might work like this:

def calculate_binding_coefficient(pH: NDArray[np.float32], coef: BindingWithPH = BindingWithPH())

And then the code can use coef.slope, etc. You could then run that calculation with the default values or something else:

res = calculate_binding_coefficient(pH_array, BindingWithPH(slope=-0.23))

Here, we're using the attributes more like class attributes. The dataclass does expose those attributes but they seem to fundamentally not do what I expect. So for example:

In [40]: @dataclass
    ...: class BindingWithPH:
    ...:     """From linear regression (Mayes et al. (2012))."""
    ...:     intercept: float = -0.186
    ...:     slope: float = -0.216
    ...: 

# create a normal instance

In [41]: inst  = BindingWithPH()
In [42]: inst
Out[42]: BindingWithPH(intercept=-0.186, slope=-0.216)

# and with a different slope
In [43]: inst  = BindingWithPH(slope=3)
In [44]: inst
Out[44]: BindingWithPH(intercept=-0.186, slope=3)

# The value _can_ be accessed directly from the class, not an instance
# as if it is a class attribute

In [45]: BindingWithPH.slope
Out[45]: -0.216

# And it can be changed too

In [46]: BindingWithPH.slope = 3
In [47]: BindingWithPH.slope
Out[47]: 3

# BUT that doesn't interact with the instance __init__, which keeps
# returning the original defaults.

In [48]: inst  = BindingWithPH()
In [49]: inst
Out[49]: BindingWithPH(intercept=-0.186, slope=-0.216)
In [50]: BindingWithPH.__init__
Out[50]: <function __main__.BindingWithPH.__init__(self, intercept: float = -0.186, slope: float = -0.216) -> None>

I'm not sure what the advantage of dataclass over a namedtuple is here - we're not using instances of it.

I'm also wary of how users change these values. I'm not sure, but at the moment I think you could do it by changing the "class attribute" value as above, but you then have absolutely no control over what other functions might be accessing that altered value. If an instance is created, with defaults or altered, then the user knows what version they are passing.

This all seems really familiar for some reason!

alexdewar · 2023-02-06T09:55:29Z

Yeah, I guess a namedtuple would be another way to go. Alternatively you could just have two separate constants defined at the module level.

I hear your concerns about the values being mutable, but that is kind of what you get with Python. Module-level constants are mutable too.

davidorme · 2023-02-06T10:10:29Z

I think dataclasses are probably the way to go - see the discussion here #162 - and I agree about the mutable being Python for you. I have used frozen dataclasses over in pyrealm, which is probably just unnecessary paranoia. Well, actually, not paranoia because someone will do it, but stopping people changing key constants half way through an analysis should be a job for common sense, not code.

jacobcook1995 · 2023-02-06T15:02:21Z

So the suggestion here @davidorme would be to keep the classes as they are but move them to a models/soil/constants.py script so that all model data classes for the module are stored in a single location? And then these would be imported in and supplied as a default value for functions that use them?

davidorme · 2023-02-06T15:10:17Z

Yes - I think so. That's very close to what you have already and seems like a sensible way to organise things anyway. Something else might come of #162 but no alarm bells yet!

davidorme

LGTM!

jacobcook1995 added 13 commits December 13, 2022 09:32

Added skelton SoilCarbon class

53b361d

Add basic pools init

893c098

Created draft function for pool update function

3ae2dba

Started to flesh out mineral_association function

363d0f3

Wrote function for soil moisture scalar

fbd56a5

Wrote function for soil temperature scalar

02333af

Added unit annotations

df6aab4

Defined a time step for the function update

86e1929

Added error handling for bad soil carbon pool initialisation

1a82ba1

Merge branch 'develop' into feature/dummy_carbon

ae2efd7

Added tests for scalar calculating functions

6cc5f3c

Added test for mineral association function

07b94fd

Added test for update pools function

713f0af

jacobcook1995 requested review from davidorme and alexdewar December 15, 2022 15:29

Changed qa python version to see if that fixes the 'Expected — Waitin…

088e735

…g for status to be reported' problem

alexdewar requested changes Dec 15, 2022

View reviewed changes

jacobcook1995 added 3 commits December 16, 2022 11:54

Moved plant and soil module folders into new models folder

3e0625a

Updated test paths to match new models/soil/ directory structure

32fdf62

Switch to using np.exp and np.pi

e0c1a2c

jacobcook1995 added 4 commits December 16, 2022 14:06

Converted comments to docstrings

7fcbbc1

Switched to returning single flux

a1a0f57

Improved naming of scalar generating functions

5704337

Switched to using float for pool update time step

9907355

davidorme requested changes Dec 21, 2022

View reviewed changes

jacobcook1995 added 2 commits January 3, 2023 14:38

Started using np.allclose

c2ec463

Removed repeated dictionary accesses for temperature scalar

ceedb05

davidorme requested changes Jan 18, 2023

View reviewed changes

tests/test_soil_carbon.py Outdated Show resolved Hide resolved

virtual_rainforest/models/soil/carbon.py Outdated Show resolved Hide resolved

virtual_rainforest/models/soil/carbon.py Show resolved Hide resolved

virtual_rainforest/models/soil/carbon.py Outdated Show resolved Hide resolved

jacobcook1995 added 5 commits January 18, 2023 15:42

Fixed minor issue with using any rather than np.any

01fdd56

Switched to new api docs structure

9cf48f2

Merge branch 'develop' into feature/dummy_carbon

e934c0a

Making sure latest doc changes are in my dummy carbon branch

Caught pinned python version I previously missed

21d5e0d

Merging changes from develop in and updating docs index to match new …

a848ff0

…format

alexdewar approved these changes Feb 1, 2023

View reviewed changes

jacobcook1995 added 8 commits February 2, 2023 11:31

Improved docstring style for soil.carbon module

5026a22

Removed deprecated log_and_raise function from the soil.carbon module

1e1b759

Removed unnecessary mocking from test_calculate_equilibrium_maom

4a82512

Removed unneeded mocking from test_mineral_association

040a1f7

Removed unnecessary mocking from test_update_pools

056dbdd

Merged in changes from develop

7009b60

Fixed various problems with the docs

8d985d2

Converted dictonaries of fitting parameters to dataclasses

8c4d2d8

jacobcook1995 requested a review from davidorme February 3, 2023 12:15

Merge branch 'develop' into feature/dummy_carbon

27f7498

davidorme approved these changes Feb 3, 2023

View reviewed changes

davidorme requested changes Feb 3, 2023

View reviewed changes

Moved data classes to seperate constants.py script

4867277

jacobcook1995 requested a review from davidorme February 6, 2023 15:49

davidorme approved these changes Feb 6, 2023

View reviewed changes

jacobcook1995 merged commit 7f445d9 into develop Feb 7, 2023

jacobcook1995 deleted the feature/dummy_carbon branch February 7, 2023 09:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minimal carbon pool model #134

Minimal carbon pool model #134

jacobcook1995 commented Dec 15, 2022

alexdewar left a comment

alexdewar commented Dec 16, 2022

davidorme commented Dec 16, 2022

codecov-commenter commented Dec 16, 2022 •

edited

Loading

davidorme left a comment

davidorme left a comment

alexdewar left a comment

alexdewar Feb 1, 2023

jacobcook1995 Feb 2, 2023

jacobcook1995 Feb 2, 2023

davidorme Feb 2, 2023

davidorme Feb 2, 2023

alexdewar Feb 2, 2023

jacobcook1995 commented Feb 3, 2023

davidorme left a comment

davidorme left a comment

davidorme commented Feb 3, 2023

alexdewar commented Feb 6, 2023

davidorme commented Feb 6, 2023

jacobcook1995 commented Feb 6, 2023

davidorme commented Feb 6, 2023

davidorme left a comment

Minimal carbon pool model #134

Minimal carbon pool model #134

Conversation

jacobcook1995 commented Dec 15, 2022

Description

Type of change

Key checklist

Further checks

alexdewar left a comment

Choose a reason for hiding this comment

alexdewar commented Dec 16, 2022

davidorme commented Dec 16, 2022

codecov-commenter commented Dec 16, 2022 • edited Loading

Codecov Report

davidorme left a comment

Choose a reason for hiding this comment

davidorme left a comment

Choose a reason for hiding this comment

alexdewar left a comment

Choose a reason for hiding this comment

alexdewar Feb 1, 2023

Choose a reason for hiding this comment

jacobcook1995 Feb 2, 2023

Choose a reason for hiding this comment

jacobcook1995 Feb 2, 2023

Choose a reason for hiding this comment

davidorme Feb 2, 2023

Choose a reason for hiding this comment

davidorme Feb 2, 2023

Choose a reason for hiding this comment

alexdewar Feb 2, 2023

Choose a reason for hiding this comment

jacobcook1995 commented Feb 3, 2023

davidorme left a comment

Choose a reason for hiding this comment

davidorme left a comment

Choose a reason for hiding this comment

davidorme commented Feb 3, 2023

alexdewar commented Feb 6, 2023

davidorme commented Feb 6, 2023

jacobcook1995 commented Feb 6, 2023

davidorme commented Feb 6, 2023

davidorme left a comment

Choose a reason for hiding this comment

codecov-commenter commented Dec 16, 2022 •

edited

Loading