functional simulation tests to enable external distribution classes #1105

sbenthall · 2022-01-20T15:39:04Z

Continuing from this discussion:
#1104 (comment)

I believe that @llorracc is keen on replacing as much of the distribution modeling functionality in HARK with an external library as soon as possible.

A major obstacle to doing this is that the tests for model simulations currently mostly check for near equality with specific numerical values, as opposed to testing for a functional relationship between a model's state value and prior relevant state values.

Fixing this is just a matter of labor time.

This would make it much easier to work on #611 #949

sbenthall · 2022-01-20T15:44:42Z

Though it's not entirely clear from the documentation, I've tested it and this scipy.stats.rv_discrete class is more full-featured that our current DiscreteDistribution class -- including CDF, entropy, moment, and expected value computation -- with an initialization parameter, values, that mirrors the way we are currently constructing DiscreteDistributions.

https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.rv_discrete.html

llorracc · 2022-01-21T05:20:06Z

I've spent a little time in the last couple of days looking into this and learned several things that are relevant for our purposes.

A key one that illustrates the reason I want to outsource our construction of distributions as much as possible is the recent announcement of this toolkit which provides tools for constructing distributions that are adapted to super fast simulation with TensorFlow Markov Chain Monte Carlo tools. It would be totally unsurprising if someone developed a corresponding tool for PyTorch or other generic tools for doing these kinds of simulations.

Ideally, we could construct an architecture that would make it reasonably straightforward for people to choose scipy.stats.rv_discrete or tfp.mcmc or the new StochasticProcess components of sympy when they want to do the Monte Carlo part of their simulations.

Another interesting new development I've just come across is a new toolkit that is laser-focused on describing bellman problems, which we should take a very close look at as we think about our future direction.

sbenthall · 2022-02-24T22:49:39Z

An example of the kind of test that needs to be changed:

https://github.com/econ-ark/HARK/blob/master/HARK/ConsumptionSaving/tests/test_IndShockConsumerType.py#L92

And example of the kind of test it should be changed into:

https://github.com/econ-ark/HARK/blob/master/HARK/ConsumptionSaving/tests/test_ConsPortfolioFrameModel.py#L153-L156

llorracc · 2022-02-24T23:56:33Z

I have no objection to ADDITIONAL tests like https://github.com/econ-ark/HARK/blob/master/HARK/ConsumptionSaving/tests/test_ConsPortfolioFrameModel.py#L153-L156

And if the reason you think the first test should be eliminated is that it tests the outcome for a specific agent (at index 1), I'm on board. Instead, what we should be testing is things like whether the average level of assets in simulations matches some raw number like 0.184 (or whatever the correct answer is).

Any time a change in the HARK code causes our simulations to produce significantly different quantitative answers, we need to be alerted to that because we know that our current code produces the right answers and any substantial change in those answers means the HARK change introduced a conceptual error somewhere (even if accounting identities remain satisfied).

sbenthall · 2022-02-25T03:37:02Z

Yes, the simulation tests I'm referring to are those that test a specific value for a specific agent.

We also have tests for the solution objects, which should not break with a change in how distributions are sampled. I.e. that the consumption function, for some amount of market resources, is 0.184. So these can go untouched.

I don't recall if we have any tests for the approximate equality of a simulated population mean to a fixed number. I would be wary of such tests, because the sensitivity of the equality approximation would need to be calibrated to the confidence interval of the result somehow to be effective if the only change was to distribution sampling.

llorracc · 2022-02-25T15:51:49Z

I don't recall if we have any tests for the approximate equality of a simulated population mean to a fixed number. I would be wary of such tests, because the sensitivity of the equality approximation would need to be calibrated to the confidence interval of the result somehow to be effective if the only change was to distribution sampling.

Any such tests should be low precision ones. If an aggregate variable changed from 3.14 to 3.15 or 3.13 we would not worry. But if it changed to 6.8 or something, that would be a clear sign of a bug.

sbenthall · 2022-05-11T15:50:15Z

As of the meeting today, we decided:

For numerical tests where precision is an issue, change the precision (not in the scope of this ticket)
For numerical tests of simulation outputs, which are blocking some changes moving forward, we can remove those current tests (!)
- This reduces test coverage. Functional simulation tests as discussed above are still a good idea so that there is test coverage of the transition equations.

sbenthall · 2022-05-31T19:50:33Z

I'm taking responsibility for this issue, but am a little stuck.

One thing we could do is remove any tests for particular values that come out of a simulation's results.

In many cases, this would leave simulations as entirely untested except to show that they executed without error.

I wonder if there's anything else that comes to mind for what would be a good simulation test.
(The tests of a particular transition function are one idea, though writing these will be a cumbersome process.)

econ-ark#1105

sbenthall added the Function: Distributions label Jan 20, 2022

Mv77 mentioned this issue Jan 24, 2022

Rework income process using IndexDistribution #1024

Merged

3 tasks

sbenthall added the Expertise: Basic Python and Open Source label Jan 26, 2022

This was referenced Feb 10, 2022

calc_expectation returns additional dimension #1098

Closed

Independence and marginals in DiscreteDistribution #1114

Open

Mv77 mentioned this issue Mar 1, 2022

[WIP] Arbitrary distributions of newborns' states #1121

Closed

3 tasks

This was referenced Apr 6, 2022

[WIP] cleaning up strange newborn handling in ConsIndShock model #1021

Open

[WIP] t-indexed inputs for lifecycle models #1042

Open

sbenthall mentioned this issue May 11, 2022

define distributions directly in initial parameters for models #620

Closed

sbenthall self-assigned this May 11, 2022

sbenthall added a commit to sbenthall/HARK that referenced this issue May 31, 2022

change TractableBufferstockModel to use functional simulation test, see

1f67d65

econ-ark#1105

sbenthall mentioned this issue May 31, 2022

functional simulation test #1148

Merged

3 tasks

sbenthall added a commit to sbenthall/HARK that referenced this issue May 31, 2022

functional test for PerfectForesight simulation, econ-ark#1105

8953c1b

sbenthall added this to the 1.0.0 milestone Jun 14, 2022

sbenthall added a commit to sbenthall/HARK that referenced this issue Sep 27, 2022

updates to tests based on @Mv77's review. econ-ark#1105

e822f2f

sbenthall closed this as completed Nov 8, 2022

sbenthall modified the milestones: 1.0.0, 0.13.0 Jan 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

functional simulation tests to enable external distribution classes #1105

functional simulation tests to enable external distribution classes #1105

sbenthall commented Jan 20, 2022

sbenthall commented Jan 20, 2022

llorracc commented Jan 21, 2022 •

edited

Loading

sbenthall commented Feb 24, 2022

llorracc commented Feb 24, 2022

sbenthall commented Feb 25, 2022

llorracc commented Feb 25, 2022

sbenthall commented May 11, 2022

sbenthall commented May 31, 2022

functional simulation tests to enable external distribution classes #1105

functional simulation tests to enable external distribution classes #1105

Comments

sbenthall commented Jan 20, 2022

sbenthall commented Jan 20, 2022

llorracc commented Jan 21, 2022 • edited Loading

sbenthall commented Feb 24, 2022

llorracc commented Feb 24, 2022

sbenthall commented Feb 25, 2022

llorracc commented Feb 25, 2022

sbenthall commented May 11, 2022

sbenthall commented May 31, 2022

llorracc commented Jan 21, 2022 •

edited

Loading