Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

multivariate API and dependency on nalgebra #225

Open
YeungOnion opened this issue Apr 28, 2024 · 3 comments
Open

multivariate API and dependency on nalgebra #225

YeungOnion opened this issue Apr 28, 2024 · 3 comments

Comments

@YeungOnion
Copy link
Contributor

We support multivariate distributions and nalgebra is part of our API. We'll be merging #209 and this will be a breaking change. The new methods clearly rely on nalgebra, but it might be more implicit with other methods. I think we can either make it clear that args and outputs for/from these distributions will need the dependency or find some way to opt out of using the same or any nalgebra.

Options I can think (in order of what I think is increasing complexity)

  • clearly state the dependency in multivariate documentation
  • non-default feature for multivariate, would also generally address not compiling a linear algebra library
  • write API using standard types, this may not be possible without removing the benefits of Make MultivariateNormal generic over dimension #209
  • write API to support different numeric algebra backends, nalgebra is broadly used, but there are others

See #199 #209

@FreezyLemon
Copy link
Contributor

non-default feature for multivariate, would also generally address not compiling a linear algebra library

FYI: Dirichlet currently depends on nalgebra::DVector and Multinomial has nalgebra types in its public API (e.g. Multinomial::mean). So those would need to be addressed too.

One more issue I'd like to raise is one of developer ergonomics. There are a few functions which take nalgebra types as parameters (e.g. new_from_nalgebra), but library users cannot easily use these without both

  • adding nalgebra as a dependency to their own project and
  • figuring out any nalgebra version mismatches on their own, or asking for a version bump in statrs

It might make sense to reexport some nalgebra types from statrs to make this less of a problem.

write API to support different numeric algebra backends, nalgebra is broadly used, but there are others

If it's just for the public API, this could be done with extension traits, hidden behind a feature flag or maybe in a different crate.

@YeungOnion
Copy link
Contributor Author

It might make sense to reexport some nalgebra types from statrs to make this less of a problem.

This doesn't sound like a heavy lift, but I don't know how much we'd need to reexport. Would be good to look into.
A question I have: would that mean the syntax in user code would look like this?

use statrs::distribution::MultivariateNormal;
use statrs::appropriate_module_name::{...}; // whichever may be needed
use nalgebra as na;

let mvn = MultivariateNormal::new_from_nalgebra(statrs::appropriate_module_name::vec!(_), statrs::appropriate_module_name::matrix!(_));

Do you have ideas for what you'd choose for traits we'd create to connect to a general linear algebra library type? If not, I think we start small and see what mileage we get with re-exporting. Hopefully most users (and us) will be using an updated nalgebra.

@FreezyLemon
Copy link
Contributor

This doesn't sound like a heavy lift, but I don't know how much we'd need to reexport. Would be good to look into.

Well, it could be a minimal reexport (vector/matrix types plus maybe convenience macros) or just a re-export of the entire nalgebra library. This depends on the other API decisions too IMHO.

A question I have: would that mean the syntax in user code would look like this?
...

Generally, yes. I don't think there's a "right" way of doing these re-exports, but I could think of

use statrs::distribution::MultivariateNormal;
use statrs::nalgebra::{DMatrix, dvector}; // or whatever type it may be in the end

let mvn = MultivariateNormal::new_from_nalgebra(dvector!(...), DMatrix::from_column_slice(...));

or (if you're already importing types from another nalgebra)

use statrs::distribution::MultivariateNormal;
use statrs::nalgebra as statrs_na;

let mvn = MultivariateNormal::new_from_nalgebra(statrs_na::dvector!(...), statrs_na::DMatrix::from_column_slice(...));

Do you have ideas for what you'd choose for traits we'd create to connect to a general linear algebra library type?

The way I've seen this in the wild is often just a 1:1 extension trait per type. And in this case, I guess one trait per type per linalg library. So roughly:

// mod nalgebra_ext
pub trait MultivariateNormalNalgebraExt {
  pub fn new_from_nalgebra(...) -> MultivariateNormal;
  // ...
}

// ...
// mod ndarray_ext
pub trait MultivariateNormalNdarrayExt {
  pub fn new_from_ndarray(...) -> MultivariateNormal;
  // ...
}

// ...

and then their respective implementations. With some structuring and/or a prelude, this could allow users to

use statrs::ndarray_ext::*;
use statrs::distribution::MultivariateNormal;

MultivariateNormal::new_from_ndarray(...);

This would still mean that there's a dependency on nalgebra/etc. at some point, but I'm not sure I see a better way at the moment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants