stan model chunks #484

sbfnk · 2023-10-24T10:09:52Z

Testing whether we could move duplicated stan code into chunks.

I like the way it simplifies the main models and highlights commonalities. It also makes coding more fault tolerant.

On the other hand it will become even harder to trace code having to navigate, at times, to chunks that then call functions contained in other files. This could be mitigated by, e.g., making functions that are only called once chunks (e.g. the GP functions); or by moving each stan function into a separate files (though would complicate ensuring all relevant functions are loaded for all chunks) or all into one file.

In conclusion, I'm on the fence regarding whether this is a good idea. If going ahead with this there are a few bits of code that could potentially still be moved into chunks.

A few processing functions had to be updated in order to make this work. Also the stan code is probably less efficient in places - touchstone will tell us how much.

Closes #381

jamesmbaazam · 2023-10-24T12:03:59Z

On the issue of navigating the code, can we use prefixes to differentiate between the stan code in ./inst/stan/functions/ directory and those in the ./inst/stan/data/ directory? We could prefix the functions with fnc_* and those in data with "data_". I find that some stan files have the same name in both folders and can be confusing when opened side by side.

github-actions · 2023-10-24T12:41:27Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if dc29e32 is merged into main:

:ballot_box_with_check:default: 57.6s -> 1.01m [-8.61%, +18.58%]
:ballot_box_with_check:no_delays: 1.08m -> 1.09m [-16.28%, +18.26%]
:ballot_box_with_check:random_walk: 18s -> 18.8s [-11.13%, +20.41%]
:rocket:stationary: 38s -> 34.2s [-18.58%, -1.55%]
:ballot_box_with_check:uncertain: 1.39m -> 1.43m [-7.51%, +12.21%]
Further explanation regarding interpretation and methodology can be found in the documentation.

sbfnk · 2023-10-24T14:03:14Z

the stan code is probably less efficient in places

That turned out to be of no concern.

calc sec

github-actions · 2023-10-25T17:16:10Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if 4470973 is merged into main:

:ballot_box_with_check:default: 54.2s -> 54.2s [-12.67%, +12.86%]
:ballot_box_with_check:no_delays: 57.8s -> 1.15m [-5.72%, +43.82%]
:ballot_box_with_check:random_walk: 20.2s -> 16.8s [-54.35%, +20.53%]
:ballot_box_with_check:stationary: 33.5s -> 41.8s [-14.36%, +63.99%]
:ballot_box_with_check:uncertain: 1.28m -> 1.33m [-15.64%, +22.85%]
Further explanation regarding interpretation and methodology can be found in the documentation.

seabbs · 2023-11-01T10:57:16Z

I find that some stan files have the same name in both folders and can be confusing when opened side by side.

So this is intentional in order to show which data chunks link with which function chunks etc? In the viewer I look at these from the tree structure of them being in folders then provides the distinction. Interesting to hear this isn't working out for you though. I would ideally like a solution that doesn't duplicate the info in the folder label in the filename though.

seabbs · 2023-11-01T10:59:52Z

In conclusion, I'm on the fence regarding whether this is a good idea. If going ahead with this there are a few bits of code that could potentially still be moved into chunks.

Yeah, I feel the same way for all the reasons you mention. I think I would much prefer doing this manually (i.e having everything in chunks and then providing R code to generate the complete models (this would need to be done at build time/whenever the chunks were modified (likely in CI to be sure)). In this mists of time I did some work on this in a branch of idbrms that never went anywhere - I think that wouldn't be that hard to revive.

The issue there would be how we maintain the code for generating the models of course...

sbfnk · 2023-11-07T13:12:43Z

In conclusion, I'm on the fence regarding whether this is a good idea. If going ahead with this there are a few bits of code that could potentially still be moved into chunks.

Yeah, I feel the same way for all the reasons you mention. I think I would much prefer doing this manually (i.e having everything in chunks and then providing R code to generate the complete models (this would need to be done at build time/whenever the chunks were modified (likely in CI to be sure)). In this mists of time I did some work on this in a branch of idbrms that never went anywhere - I think that wouldn't be that hard to revive.

The issue there would be how we maintain the code for generating the models of course...

Isn't there also an issue that this would require management of between-snippet dependencies, inflating complexity? Unless perhaps we did away with functions.

seabbs · 2023-11-07T16:08:51Z

Isn't there also an issue that this would require management of between-snippet dependencies, inflating complexity? Unless perhaps we did away with functions.

Hmm I'm not sure how much complexity it would really add to not duplicate functions? Or do you mean something more than this?

sbfnk · 2023-11-07T16:15:38Z

Isn't there also an issue that this would require management of between-snippet dependencies, inflating complexity? Unless perhaps we did away with functions.

Hmm I'm not sure how much complexity it would really add to not duplicate functions? Or do you mean something more than this?

Perhaps I'm misunderstanding what you're suggesting. If we compose a model of chunks then chunks will depend on other chunks (defining functions or declaring variables) which themselves might have dependencies.

seabbs · 2023-11-07T16:17:49Z

Yes but can't you define those in a list? We aren't saying we are going to have a system for composing models - just the models we already have?

sbfnk · 2024-01-16T08:55:39Z

I think we concluded that this wasn't a good idea so I'm closing the PR.

sbfnk added 15 commits October 25, 2023 15:38

delay_type_max chunk

4f6d30f

param chunks

28c3de2

gt_rev_pmf chunk

6bd2b5d

generate_infections chunk

5eb8c73

delay_rev_pmf/trunc_rev_cmf chunks

0854430

observation model chunks

48e687c

calculate_secondary chunk

6c88ea8

calc sec

likelihood chunk

5c62cc0

prior chunks

54a60e4

likelihood chunks

de05054

impute_reports chunk

d8689a3

R_to_growth chunk

6037361

update simulations

422eefb

update tests

04cdc21

make extract_parameter_samples work with samples

c606708

sbfnk force-pushed the stan-chunks branch from 5fce5cc to c606708 Compare October 25, 2023 14:38

sbfnk closed this Jan 16, 2024

sbfnk mentioned this pull request Jan 16, 2024

Put duplicated stan code in chunks #381

Closed

sbfnk deleted the stan-chunks branch May 3, 2024 19:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stan model chunks #484

stan model chunks #484

sbfnk commented Oct 24, 2023

jamesmbaazam commented Oct 24, 2023

github-actions bot commented Oct 24, 2023

sbfnk commented Oct 24, 2023

github-actions bot commented Oct 25, 2023

seabbs commented Nov 1, 2023

seabbs commented Nov 1, 2023 •

edited

Loading

sbfnk commented Nov 7, 2023

seabbs commented Nov 7, 2023

sbfnk commented Nov 7, 2023

seabbs commented Nov 7, 2023

sbfnk commented Jan 16, 2024 •

edited

Loading

stan model chunks #484

stan model chunks #484

Conversation

sbfnk commented Oct 24, 2023

jamesmbaazam commented Oct 24, 2023

github-actions bot commented Oct 24, 2023

sbfnk commented Oct 24, 2023

github-actions bot commented Oct 25, 2023

seabbs commented Nov 1, 2023

seabbs commented Nov 1, 2023 • edited Loading

sbfnk commented Nov 7, 2023

seabbs commented Nov 7, 2023

sbfnk commented Nov 7, 2023

seabbs commented Nov 7, 2023

sbfnk commented Jan 16, 2024 • edited Loading

seabbs commented Nov 1, 2023 •

edited

Loading

sbfnk commented Jan 16, 2024 •

edited

Loading