Migrate code, write tests, write docstrings #19

barneydobson · 2024-01-18T15:46:48Z

Need to do..

barneydobson · 2024-01-19T09:11:32Z

Currently have done terrible job at creating sensible modules, propose to migrate code into these modules:

downloaders.py - as is
graph_operations.py - every registered graphfcn that takes a graph and returns a graph
geospatial_operations.py - all the complicated polygon and raster stuff goes here
hydraulic_design.py - currently just the guts pipe_by_pipe go in here - tbh could just go in graph_operations.py, but then it becomes a busy module
swmm_text_tools.py - all the SWMM file writing tools go here - not sure how good Jinja2 is but maybe this isn't needed
swmmanywhere.py - main script to read the config file and cycle the graph operations

barneydobson · 2024-01-19T09:20:17Z

While @dalonsoa and @cheginit try and get a big picture, I will work on geospatial_operations.py (because these are functions that are not really related to the overall software architecture) and swmm_text_tools.py (because it's such a mess that there's not much to be done and we'll probably replace it with Jinja2 ( #16 )

cheginit · 2024-01-24T00:50:14Z

For modules and structuring the codes, I recommend making them a bit more categorical. For example, something like the following:

prepare_data: This module contains functionalities for retrieving raw data from various sources
pre_processing: This module processes the raw input data into formats that the BUSN generator algorithm and SWMM case generator require
generate_busn: This module generates BUSNs as nx.DiGraph objects
post_processing: This module generates SWMM cases based on the generated BUSNs and other input data
data_analysis: Sensitivity analysis and other data analysis code goes into this module.
utils: All other misc functionalities go here. If there are many of them we can have more specific utility modules, e.g., utils_geo, utils_graph, and so on.

A thematic structure like this makes it easier to navigate the codebase.

cheginit · 2024-01-24T00:59:38Z

For managing config files, there are two main options: in-house reader or existing config management libraries. If the config file is simple, we can develop a reader module with suitable validator functionalities. If there are many input files, I recommend using existing Python libraries. A popular choice is MogaConf. A more general solution is pydantic that provides powerful schema generation and validators. So, it boils down to the complexity of the config files.

dalonsoa · 2024-01-24T08:54:56Z

I very much recommend using existing Python tooling for configuration based on standard formats (yaml, toml, etc). Otherwise, things can become very complicated very easily. Another option, more lightweight than the ones suggested by @cheginit (which are indeed pretty good), would be schema.

barneydobson · 2024-01-25T15:31:32Z

For modules and structuring the codes, I recommend making them a bit more categorical. For example, something like the following:

prepare_data: This module contains functionalities for retrieving raw data from various sources

pre_processing: This module processes the raw input data into formats that the BUSN generator algorithm and SWMM case generator require

generate_busn: This module generates BUSNs as nx.DiGraph objects

post_processing: This module generates SWMM cases based on the generated BUSNs and other input data

data_analysis: Sensitivity analysis and other data analysis code goes into this module.

utils: All other misc functionalities go here. If there are many of them we can have more specific utility modules, e.g., utils_geo, utils_graph, and so on.

A thematic structure like this makes it easier to navigate the codebase.

OK I'm easy on naming conventions.
As I see it the mapping to what I have now in both this and the old repository is as follows:

downloaders -> prepare_data
pre_processing -> equivalent mainly to the operations in download function (besides the downloads themselves) in the old repo, and I guess a few more where I do any other data tidying on the fly.
swmm_text_tools -> post_processing
experimenter a function in the old repo -> data_analysis - I don't want to touch this yet until we make some other decisions e.g., about config file Configuration file #10 since that will determine some big picture choices about how we would 'do' sensitivity analysis
swmmanywhere (old repo) -> generate_busn - though I don't like the name generate_busn since I have every intention of expanding this to cover foul networks down the line. Perhaps just generate_network ?
geospatial_analysis -> utils_geo
graph_functions -> utils_graph

If we're happy with this I will first make a PR to rename modules.

Implicit within this for me is the idea that there will be a set of registered graph functions in utils_graph whose order is defined in the config file, which is read and iterated over in the generate_network module. Are you both happy with this?

Then, perhaps do we need to finalise discussion of config file (#10 ) before commencing on utils_graph and generate_network (I think it makes sense that these two modules are under the same PR since they are quite tightly linked)?

I would probably work on pre_processing after these to fit the requirements of the polished functions in utils_graph. Then finally the data_analysis once everything else is done.

Also @dalonsoa you mentioned for the graph functions (i.e., a function that takes a graph and gives a graph), I can register it, @register_graphfcn for example. Is there any more I need to know about that at this stage - is it similar to what you did in WSIMOD nodes? Possibly this too is linked in with the config file discussion

dalonsoa · 2024-01-26T09:08:24Z

The purpose is similar, but the implementation is not.

In that case I used __init_subclass__ in the parent class to automatically register subclasses in a registry that you can use somewhere else (and do some validation and raise warnings/errors if needed). In this case, you use the decorator you register a function - any function you want - in a registry that contain functions that can be applied to the graph sequentially. Again, you can include in this decorator function some validation aspects (eg. the decorated function has some particular signature).

cheginit · 2024-01-26T15:06:45Z

Using generate_network is fine by me.

I have a self-imposed limit that when a module becomes larger than 1000 LOC, I tend to break it into more modules. So, if generate_network becomes large, you can break it into two modules, generate_busn, and generate_bssn (below-ground sanitary sewer network), or generate_foul_network 😄

barneydobson · 2024-03-18T11:32:46Z

Closed by #83

barneydobson added documentation Improvements or additions to documentation feature Adding a new functionality, small or large enhancements priority! labels Jan 18, 2024

barneydobson mentioned this issue Jan 19, 2024

Separate functions into modules #15

Closed

barneydobson mentioned this issue Jan 19, 2024

Start of geospatial analysis #21

Merged

5 tasks

barneydobson self-assigned this Jan 19, 2024

barneydobson mentioned this issue Jan 22, 2024

SWMM text tools #25

Merged

4 tasks

barneydobson mentioned this issue Jan 25, 2024

Configuration file #10

Closed

barneydobson mentioned this issue Jan 26, 2024

Graphfcns #31

Merged

13 tasks

barneydobson mentioned this issue Feb 5, 2024

Preprocessing #37

Merged

4 tasks

barneydobson mentioned this issue Mar 13, 2024

Create demo_config.yml #83

Merged

barneydobson closed this as completed Mar 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate code, write tests, write docstrings #19

Migrate code, write tests, write docstrings #19

barneydobson commented Jan 18, 2024

barneydobson commented Jan 19, 2024 •

edited

Loading

barneydobson commented Jan 19, 2024

cheginit commented Jan 24, 2024

cheginit commented Jan 24, 2024

dalonsoa commented Jan 24, 2024

barneydobson commented Jan 25, 2024 •

edited

Loading

dalonsoa commented Jan 26, 2024

cheginit commented Jan 26, 2024

barneydobson commented Mar 18, 2024

Migrate code, write tests, write docstrings #19

Migrate code, write tests, write docstrings #19

Comments

barneydobson commented Jan 18, 2024

barneydobson commented Jan 19, 2024 • edited Loading

barneydobson commented Jan 19, 2024

cheginit commented Jan 24, 2024

cheginit commented Jan 24, 2024

dalonsoa commented Jan 24, 2024

barneydobson commented Jan 25, 2024 • edited Loading

dalonsoa commented Jan 26, 2024

cheginit commented Jan 26, 2024

barneydobson commented Mar 18, 2024

barneydobson commented Jan 19, 2024 •

edited

Loading

barneydobson commented Jan 25, 2024 •

edited

Loading