86 sensitivity analysis sampling #97

barneydobson · 2024-03-16T13:38:36Z

Description

Adjustments to config file, and new module to run sensitivity analysis.

Fixes #86

Summary of changes:

Enable model_number to be given to SWMManywhere
Enable setting of parameters to sample and number (via SALib magnitude) of samples.
Update to a design storm (for now, see Precipitation events #55 ) - this is the majority of lines removed.
Debug small errors revealed from testing.
The current shortest path method seems to accommodate negative cycles (since it is heap based), so it is changed to warning.
Log completed metrics
Improve alignment of results in metrics
Add experimenter.py to sample, run, evaluate and save for a sensitivity analysis defined in a config file.
Add submit_icl_example - an example jobarray submission script for ICL HPC.
Add some tests

typo

…-design-parameters' into 86-sensitivity-analysis-sampling

not necessarily a deal breaker... made negative cycles a warning

- log metric completeion - help merge for data align - nse handle invliad

update nx version to avoid some weighted shortest path errors

This reverts commit c11a10a.

further help aligning

dalonsoa

It looks ok, but I think it has a few aspects to polish.

swmmanywhere/graph_utilities.py

swmmanywhere/paper/experimenter.py

dalonsoa · 2024-03-26T05:54:34Z

swmmanywhere/paper/experimenter.py

+            bound = [parameters[parameter]['minimum'],
+                      parameters[parameter]['maximum']]
+
+        names.append(parameter)
+        bounds.append(bound)
+        dists.append(parameters[parameter].get('dist', 'unif'))
+        groups.append(parameters[parameter]['category'])


Are all of these parameter, maximum, minimum, etc. guarantee to exist in the parameters dictionary and inside? If not, I would use get with a default value or handle the possibility of it failing with some custom error message.

They should do because of the use of pydantic in parameters.py, though admittedly there are some 'unsampleable' parameters in there - so I have added the following in config validation.

Here:

# Check that the parameter is sample-able required_attrs = set(['minimum', 'maximum', 'default', 'category']) correct_attrs = required_attrs.intersection(params[param]) missing_attrs = required_attrs.difference(correct_attrs) if any(missing_attrs): raise ValueError(f"{param} missing {missing_attrs} so cannot be sampled.")

dalonsoa · 2024-03-26T05:55:39Z

swmmanywhere/paper/experimenter.py

+
+    if N is None:
+        N = 2 ** (problem['num_vars'] - 1) 
+    problem_ = problem.copy()


Why are you creating the problem object and then copying it?

If groups is False, then I need to delete the groups key to pass to SALib, but I still want to retain the groups information. I've improved the commenting to explain

dalonsoa · 2024-03-26T05:58:14Z

swmmanywhere/paper/experimenter.py

+        for x,y,z in zip(problem['groups'],
+                         problem['names'],
+                         params):
+            X.append({'param' : y,
+                    'value' : z,
+                    'iter' : ix,
+                    'group' : x})


Before, you removed groups if sampling via groups was not required, but here you are using them again. I think the workflow of this function needs to be clarified.

Yes I see it was confusing - hopefully the new comments clarify?

swmmanywhere/paper/experimenter.py

dalonsoa · 2024-03-26T06:06:11Z

swmmanywhere/paper/experimenter.py

+    # Iterate over the samples, running the model when the jobid matches the
+    # processor number
+    for ix, params_ in gb:
+        if ix % nproc != jobid:


I don't get this. You are generating a lot of samples but then running the problem just every jobid number of them. What's the rationale for that?

It's for easy use with a job array, jobid will be different on different processors - happy if there's a better way!

By job array, do you mean Slurm's?

I don't also quite follow the logic here. You have N samples, and you want to submit a job for each? I am not seeing how mod operator does this.

Based on my understanding of what you're trying to achieve, IMHO, this is better:

job_iter = tlz.partition_all(nproc, range(len(X))) for _ in range(jobid + 1): job_idx = next(job_iter, None) if job_idx is None: print('No job to do') return config = config_base.copy() for ix in job_idx: params = gb.get_group(ix) ...

Each Slurm job does a partition of the samples.

dalonsoa · 2024-03-26T06:12:37Z

swmmanywhere/swmmanywhere.py

+    params = parameters.get_full_parameters_flat()
+    for param in config.get('parameters_to_sample',{}):
+        if isinstance(param, dict):
+            param = list(param.keys())[0]


What if there's more than one key? Are the rest of them not relevant? How do you know the first one is the one that matters?

Is it clear with this:
Here

# If the parameter is a dictionary, the values are bounds, all we are # checking here is that the parameter exists, we only need the first # entry. if isinstance(param, dict): if len(param) > 1: raise ValueError("""If providing new bounds in the config, a dict of len 1 is required, where the key is the parameter to change and the values are (new_lower_bound, new_upper_bound).""") param = list(param.keys())[0]

Co-authored-by: Diego Alonso Álvarez <[email protected]>

…/ImperialCollegeLondon/SWMManywhere into 86-sensitivity-analysis-sampling

use eps for float issues

swmmanywhere/graph_utilities.py

swmmanywhere/metric_utilities.py

swmmanywhere/paper/experimenter.py

cheginit · 2024-03-29T15:59:44Z

swmmanywhere/paper/experimenter.py

+    # Iterate over the samples, running the model when the jobid matches the
+    # processor number
+    for ix, params_ in gb:
+        if ix % nproc != jobid:


By job array, do you mean Slurm's?

I don't also quite follow the logic here. You have N samples, and you want to submit a job for each? I am not seeing how mod operator does this.

Based on my understanding of what you're trying to achieve, IMHO, this is better:

job_iter = tlz.partition_all(nproc, range(len(X))) for _ in range(jobid + 1): job_idx = next(job_iter, None) if job_idx is None: print('No job to do') return config = config_base.copy() for ix in job_idx: params = gb.get_group(ix) ...

Each Slurm job does a partition of the samples.

swmmanywhere/paper/experimenter.py

fix typing use np.inf/isfinite

Use itertuples rather than iterrows

cheginit · 2024-04-01T15:38:52Z

swmmanywhere/paper/experimenter.py

 from pathlib import Path

 import pandas as pd
+import toolz as tlz


Instead of toolz use its C version since it's the same package (same developers), written in Cython. So, this becomes: import cytoolz.curried as tlz.

Initial commit

91bd1eb

barneydobson linked an issue Mar 16, 2024 that may be closed by this pull request

Sensitivity analysis sampling #86

Closed

barneydobson self-assigned this Mar 16, 2024

barneydobson added the sa_paper Sensitivity analysis paper label Mar 16, 2024

barneydobson and others added 26 commits March 16, 2024 13:45

Update test_swmmanywhere.py

e68530e

Update test_swmmanywhere.py

6d9f7fb

typo

Create test_experimenter.py

48e8863

Update test_experimenter.py

4f070f7

Merge branch 'main' into 86-sensitivity-analysis-sampling

9a7355e

Merge branch 'main' into 86-sensitivity-analysis-sampling

5e2459b

Merge branch 'fix-geometries' into 86-sensitivity-analysis-sampling

8af45bd

Merge branch '51-gridded-metrics' into 86-sensitivity-analysis-sampling

4c0d958

Merge branch '91-design-output-metrics-as-in-metrics-for-the-networks…

f1a69b0

…-design-parameters' into 86-sensitivity-analysis-sampling

Update swmmanywhere.py

e349f48

Merge branch 'main' into 86-sensitivity-analysis-sampling

2d936c3

Merge verbosity

5206bc4

Update test_metric_utilities.py

c2f4d90

typo

61bf093

Update graph_utilities.py

93cac80

not necessarily a deal breaker... made negative cycles a warning

Use design storm

8663bb2

Update metric_utilities.py

c11a10a

- log metric completeion - help merge for data align - nse handle invliad

Update swmmanywhere.py

db9585d

Update pyproject.toml

88eb5f3

update nx version to avoid some weighted shortest path errors

Revert "Update metric_utilities.py"

c688a4c

This reverts commit c11a10a.

Update metric_utilities.py

ff69382

further help aligning

Update metric_utilities.py

6113088

redo fixes

692b46d

update for new storm

6f6ffea

Update schema.yml

9588122

Update metric_utilities.py

790bcb2

barneydobson requested review from dalonsoa and cheginit March 25, 2024 14:51

dalonsoa requested changes Mar 26, 2024

View reviewed changes

barneydobson and others added 10 commits March 26, 2024 08:40

Update swmmanywhere/paper/experimenter.py

8679c56

Co-authored-by: Diego Alonso Álvarez <[email protected]>

Update experimenter.py

f26ce5d

Merge branch '86-sensitivity-analysis-sampling' of https://github.com…

2ab1373

…/ImperialCollegeLondon/SWMManywhere into 86-sensitivity-analysis-sampling

Update experimenter.py

e014ef7

Update swmmanywhere.py

0a85214

Update swmmanywhere.py

649b635

Update typing

c710cec

Update graph_utilities.py

e7feb7a

use eps for float issues

Update geospatial_utilities.py

edb8b64

Merge branch 'main' into 86-sensitivity-analysis-sampling

c3a0bbb

cheginit requested changes Mar 29, 2024

View reviewed changes

barneydobson added 4 commits April 1, 2024 11:57

Update experimenter.py

1da3f6f

Update experimenter.py

ac9726d

import future imports

d91ef66

Update metric_utilities.py

ca59f50

barneydobson mentioned this pull request Apr 1, 2024

Support SALib methods besides sobol #117

Closed

barneydobson added 5 commits April 1, 2024 12:26

Update experimenter.py

20852d3

Update experimenter.py

f2b4890

Update metric_utilities.py

7930b00

fix typing use np.inf/isfinite

Update graph_utilities.py

ec9d19b

Update experimenter.py

ac43542

Use itertuples rather than iterrows

cheginit reviewed Apr 1, 2024

View reviewed changes

barneydobson mentioned this pull request Apr 1, 2024

Subcatchment delineation #17

Closed

Merge branch 'main' into 86-sensitivity-analysis-sampling

3f8e89b

barneydobson merged commit 25b6b05 into main Apr 1, 2024
10 checks passed

barneydobson deleted the 86-sensitivity-analysis-sampling branch April 1, 2024 19:04

barneydobson mentioned this pull request Jun 11, 2024

Support SALib methods besides sobol barneydobson/swmmanywhere_paper#6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

86 sensitivity analysis sampling #97

86 sensitivity analysis sampling #97

barneydobson commented Mar 16, 2024 •

edited

Loading

dalonsoa left a comment

dalonsoa Mar 26, 2024

barneydobson Mar 26, 2024

dalonsoa Mar 26, 2024

barneydobson Mar 26, 2024

dalonsoa Mar 26, 2024

barneydobson Mar 26, 2024 •

edited

Loading

dalonsoa Mar 26, 2024

barneydobson Mar 26, 2024

cheginit Mar 29, 2024

dalonsoa Mar 26, 2024

barneydobson Mar 26, 2024 •

edited

Loading

cheginit Mar 29, 2024

cheginit Apr 1, 2024

86 sensitivity analysis sampling #97

86 sensitivity analysis sampling #97

Conversation

barneydobson commented Mar 16, 2024 • edited Loading

Description

Summary of changes:

dalonsoa left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

barneydobson Mar 26, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

barneydobson Mar 26, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

barneydobson commented Mar 16, 2024 •

edited

Loading

barneydobson Mar 26, 2024 •

edited

Loading

barneydobson Mar 26, 2024 •

edited

Loading