Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generic single-agent MDP classes #491

Closed
sbenthall opened this issue Feb 5, 2020 · 1 comment
Closed

Generic single-agent MDP classes #491

sbenthall opened this issue Feb 5, 2020 · 1 comment
Assignees
Labels
Expertise: Prob and Stats Needs ready familiarity with advanced undergraduate multivariate probability and statistics. Function: Simulation
Milestone

Comments

@sbenthall
Copy link
Contributor

sbenthall commented Feb 5, 2020

As a 2.0 direction (see planning document), HARK can have more generic code for its core logic, separating the economic modeling metaphors from the software for modeling and solving classes of problems.

My understanding is that the following kinds of representations would be useful for solving the range of problems currently in HARK:

  • (A) Mixed Continuous/Discrete MDP
  • (B) Discrete MDP
  • (C) Bellman form of Discrete MDP

In addition to these, there are cases for multi-agent problems with market equilibrium assumptions; I'll leave these out of scope of this issue for now.

One way to solve this problem is to allow the following functionality:

  • Specifying a problem of type (A)
  • Converting it via discretization into a problem of type (B)
  • Use generic DP and RL algorithms to solve problems of type (B)
  • Assist in the conversion of type (B) problem into type (C) problem, opening up a different range of algorithms

This is different from what is being done in HARK and Dolo, which is interpolation between the discrete values.

@sbenthall sbenthall added the Expertise: Prob and Stats Needs ready familiarity with advanced undergraduate multivariate probability and statistics. label Feb 6, 2020
@sbenthall sbenthall self-assigned this Feb 6, 2020
@sbenthall
Copy link
Contributor Author

Interpolation is a key distinguishing figure of what's being done in HARK.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Expertise: Prob and Stats Needs ready familiarity with advanced undergraduate multivariate probability and statistics. Function: Simulation
Projects
None yet
Development

No branches or pull requests

1 participant