From db53f2d00922c2cc0bcade95c75db88b1c9cf6ad Mon Sep 17 00:00:00 2001 From: vfisikop Date: Tue, 12 Mar 2024 17:45:22 +0200 Subject: [PATCH] Add JOSS paper --- joss_paper/paper.md | 72 +++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 72 insertions(+) diff --git a/joss_paper/paper.md b/joss_paper/paper.md index 06b30ef4a..f618100b0 100644 --- a/joss_paper/paper.md +++ b/joss_paper/paper.md @@ -1,10 +1,18 @@ --- +<<<<<<< HEAD title: 'volesti: C++ library for sampling and volume computation on convex bodies' +======= +title: 'volesti: A C++ library for sampling and volume computation on convex bodies' +>>>>>>> 7a9d674 (Add JOSS paper) tags: - C++ - geometry - randomization +<<<<<<< HEAD - Monte-Carlo methods +======= + - Monte Carlo methods +>>>>>>> 7a9d674 (Add JOSS paper) - convexity authors: - name: Apostolos Chalkis @@ -26,7 +34,11 @@ affiliations: index: 1 - name: Quantagonia index: 2 +<<<<<<< HEAD - name: Inria Paris and IMJ-PRG, Sorbonne Universit\`e +======= + - name: Inria Paris and IMJ-PRG, Sorbonne Universit\'e +>>>>>>> 7a9d674 (Add JOSS paper) index: 3 - name: GeomScale index: 4 @@ -44,6 +56,7 @@ bodies are fundamental operations that appear in optimization, finance, engineering, artificial intelligence, and machine learning. We present `volesti`, a C++ library that delivers efficient implementations of state-of-the-art, mainly randomized, algorithms to sample from general logconcave distributions. +<<<<<<< HEAD Based on these routines can estimate the volume of convex bodies in high dimensions, round them and also compute multidimensional integrals over them. The backbone of our library consists of Monte-Carlo algorithms, @@ -52,18 +65,34 @@ high-dimensional statistical tests to certify and verify the output. The focus of `volesti`' is scalability in high dimensions, that, depending on the problem at hand, could be in the order of hundreds or thousands dimension. +======= +Based on these routines, we can estimate the volume of convex bodies in high dimensions, +round them, and compute multidimensional integrals over them. +The backbone of our library consists of Monte Carlo algorithms, +which are randomized algorithms, the output of which can be incorrect with (usually very small) error probability; thus, we also provide several +high-dimensional statistical tests to certify and verify the output. + +The focus of `volesti`' is scalability in high dimensions, +that, depending on the problem at hand, could range from hundreds to thousands of dimensions. +>>>>>>> 7a9d674 (Add JOSS paper) Another novelty is the ability to handle a variety of different inputs for the constrained support of the various distributions. `volesti` supports three different types of polyhedra [@Ziegler:1995], spectrahedra [@Ramana:1999] and general non-linear convex objects. +<<<<<<< HEAD `volesti` relies on `Eigen` library [@eigen] for linear algebra but also support `MKL` optimizations [@mkl]. There are R [@Chalkis:2021] and Python [@Chalkis_dingo:2023] interfaces available; alas not all C++ functionality is available in through these interfaces. +======= +`volesti` relies on `Eigen` library [@eigen] for linear algebra but also supports `MKL` optimizations [@mkl]. +There are R [@Chalkis:2021] and Python [@Chalkis_dingo:2023] interfaces available. +>>>>>>> 7a9d674 (Add JOSS paper) # Statement of need High-dimensional sampling from multivariate distributions with Markov Chain Monte Carlo (MCMC) +<<<<<<< HEAD algorithms is a fundamental problem with many applications in the whole spectrum of science and engineering [@Iyengar:1988; @Somerville:1998; @Genz:2009; @Schellenberger:2009]. In particular, multivariate integration over a convex set @@ -117,5 +146,48 @@ but also to more theoretical and mathematical challenges, like the computation o We would like to thank the contributors to the `volesti` library for their valuable contributions and feedback. +======= +algorithms is a fundamental problem with many applications in science and engineering [@Iyengar:1988; @Somerville:1998; @Genz:2009; @Schellenberger:2009]. +In particular, multivariate integration over a convex set as well as the volume approximation of convex sets have garnered significant attention from theorists and engineers over the last decades. +Nevertheless, these problems are computationally hard for general dimensions [@Dyer:1988]. +MCMC algorithms made remarkable progress and their use allowed us to efficiently tackle the problems of sampling and volume estimation of convex bodies in theory, +by the introduction of (rigorous) theoretical guarantees [@Chen:2018; @Lee:2018; +@Mangoubi:2019]. +Unfortunately, these theoretical guarantees of the MCMC algorithms +do not extend in a straightforward manner to efficient implementations able to attack problems coming from real-life computations. +Therefore, we witnessed the birth of efficient in practice MCMC algorithm +that relax the theoretical guarantees and employ new algorithmic and statistical techniques +to be amenable to efficient implementations. +Remarkably, these algorithms, and the corresponding implementations, +also meet the requirements for high accuracy results +[@Emiris:2014; @Cousins:2015; @Chalkis_volume:2023; @Kook:2022]; +however several existing published methods are available as part of propertiary packages (MATLAB) [@Cousins:2015; @Kook:2022]. + +Our open-source package -- volesti -- offers all of the aforementioned functionality, together with the support of sampling from general log-concave densities @Chalkis_hmc:2023, and uniform sampling from spectrahedra @Chalkis_spectra:2022. + +Our implementation supports: +1. support various sampling techniques based on geometric walks, roughly speaking these are a continuous version of MCMC algorithms, such as Billard walk, Hamiltonian walk and others, +2. give the user the ability to sample from various distributions, like uniform, exponential, Gaussian, and general log-concave densities, +3. allows to consider the distributions constrained in various convex domains, such as hypercubes, zonotopes, general polytopes (defined either as a set of linear inequalities or as a convex hull of a pointset), spectrahedra (feasible sets of semidefinite programs), and, +4. can perform volume computations, integration, and solve problems from real life applications in very high dimensions. + +# Impact + +`volesti` has been used extensively in various research and engineering projects coauthored by the authors of this paper. +In particular, for the problem of sampling the flux space of metabolic networks +we were able to sample from the most complicated human metabolic network accessible today, Recon3D [@cftz-socg021], +used to model financial crises [@ccef-crises-j], +to detect low volatility anomalies in stock markets [@bcft-aistats-23], + to introduce randomized control in asset pricing and portfolio performance evaluation [@bcft-arxiv-24]), but also to sample from (and compute the volume of) spectrahedra [@Chalkis_spectra:2022], the feasible regions of semidefinite programs. + +Even more, `volesti` has been used by other research teams in conducting research in electric power systems [@Venzke:2019], for problems in probabilistic inference [@Spallitta:2024], +to perform resource analysis on programs [@pham-phd-2024]; +but also for more theoretical and mathematical challenges, like the computation of topological invariants [@co-alenex-2021], and persistent homology [@vm-fods-2022]. + +# Acknowledgements + +We would like to thank the contributors to the `volesti` library for their valuable contributions and feedback. +MP was partially supported by a Cornell University Fellowship, a grant from the A.G. Leventis Foundation, a grant from the Gerondelis Foundation, and a LinkedIn Ph.D. Fellowship. +>>>>>>> 7a9d674 (Add JOSS paper) # References