1 write get baseline predictions function #4

lshandross · 2024-09-04T19:44:21Z

No description provided.

elray1

This is looking good! I added a few comments -- mostly minor, one question of substance about how to handle the two different output types.

R/get_baseline_predictions.R

elray1 · 2024-09-04T20:50:20Z

R/get_baseline_predictions.R

+#' @param seed integer specifying a seed to set for reproducible results.
+#'   Defaults to NULL, in which case no seed is set.


I'd vote to take this argument out, and if the user wants reproducible results they can just set the seed before calling the function.

I found that the results actually weren't reproducible unless I set the seed within the function, which is why I added the argument

elray1 · 2024-09-04T20:56:21Z

R/get_baseline_predictions.R

+  num_locs <- length(unique(target_ts[["location"]]))
+  if (num_locs != 1) {
+    cli::cli_abort("{.arg target_ts} contains {.val num_locs} but only one may be provided.")
+  }


I believe that this is the only place in this function where the location column is used. It seems like this limits the flexibility of this function -- e.g., the input data must encode the location in the column named location and the validation will not check for other types of disaggregation, e.g. by age group.

As a proposal, what if we get rid of the requirement of a column named "location" and this check that there is only a single location, and replace it with a check that there are no duplicated values in the time_index column. This is not exactly logically equivalent, but in most cases a data set with more than one location will also contain duplicates for the time index.

Another idea heading in the other direction would be to check that there is only one value in any columns not named time_index or observation. That would mean we could still do this check, but it would apply generally and would not rely on this specific column name.

I'm open to other ideas too, or just keeping this as is for now and filing an issue to address this later.

I wrote the code like this based on the hubverse framework of target data in a time series format always having the three columns "time_index", "location", and "observation", so the validation function for target time series data also makes this assumption. I don't know if this framework has changed for the hubverse, or if we'd prefer to do something different for this package, but I think we should discuss and then decide on a package-level, rather than just for this function in particular.

R/validate_model_inputs.R

tests/testthat/test-get_baseline_predictions.R

elray1 · 2024-09-05T01:11:10Z

one other request -- could we add the github action to run unit tests on pull requests to this repo? to avoid adding more stuff to this PR, maybe good to add that in a separate PR we could merge first

Co-authored-by: Evan Ray <[email protected]>

…ntiles

lshandross added 24 commits July 11, 2024 10:42

Add original get_baseline_predictions function

4b87b03

Update DESCRIPTION

10ebc97

Create .lintr

2a1f127

Create functions to validate target data and model variations

55b118c

Update get_baseline_predictions function

3065de4

Document get_baseline_predictions function

5a0dd6b

Update .Rbuildignore

7b1866b

Add lint check

21db0d7

Add GPL3 license

f037fca

Update get_baseline_predictions param desc

ee548eb

Add tests

4270f60

Add tests for validate_target_ts

acd57e5

Split validate_model_variations into two functions, update tests

079545f

Check for duplicate rows in target_ts

71ba8d1

Add more checks to get_baseline_predictions

88be0a8

Add output_type to returned prediction columns

393e4e2

Update DESCRIPTION

b529569

Update documentation

659409b

Update DESCRIPTION

7315c89

Update NAMESPACE

e71a64a

get_baseline_predictions fixes

d784c6e

Add get_baseline_predictions tests

3703f51

Additional get_baseline_predictions tests

fc7b616

Update get_baseline_predictions.R

64631f0

lshandross linked an issue Sep 4, 2024 that may be closed by this pull request

Write get_baseline_predictions function #1

Closed

lshandross requested a review from elray1 September 4, 2024 19:58

elray1 requested changes Sep 4, 2024

View reviewed changes

lshandross and others added 2 commits September 5, 2024 11:11

Use expect_gte instead of separate expectations

7f80aee

Improve n_samples param description

8561bab

Co-authored-by: Evan Ray <[email protected]>

lshandross and others added 10 commits September 5, 2024 11:15

Improve n_samples param description

c9a1ced

Co-authored-by: Evan Ray <[email protected]>

split else if blocks into separate conditionals

454a76e

Refactor column validation into separate function

c22922c

Use duplicated() in check for duplicate df rows

31e576e

Co-authored-by: Evan Ray <[email protected]>

Remove redundant code to ensure non-negative values

6f0e8bd

split n_samples param into n_sim and itself

1401c96

Refactor out extract_predictions from get_baseline_predictions

5c50fce

Refactor out validate_integer from argument checks

4b33a3a

Clarify that get_baseline_predictions can return samples and/or qua…

0086548

…ntiles

Document extract_predictions()

340e61d

elray1 approved these changes Sep 10, 2024

View reviewed changes

lshandross merged commit 1c86ee0 into main Sep 10, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1 write get baseline predictions function #4

1 write get baseline predictions function #4

lshandross commented Sep 4, 2024

elray1 left a comment

elray1 Sep 4, 2024

lshandross Sep 5, 2024

elray1 Sep 4, 2024

lshandross Sep 10, 2024

elray1 commented Sep 5, 2024

		#' @param seed integer specifying a seed to set for reproducible results.
		#' Defaults to NULL, in which case no seed is set.

1 write get baseline predictions function #4

1 write get baseline predictions function #4

Conversation

lshandross commented Sep 4, 2024

elray1 left a comment

Choose a reason for hiding this comment

elray1 Sep 4, 2024

Choose a reason for hiding this comment

lshandross Sep 5, 2024

Choose a reason for hiding this comment

elray1 Sep 4, 2024

Choose a reason for hiding this comment

lshandross Sep 10, 2024

Choose a reason for hiding this comment

elray1 commented Sep 5, 2024