Merge pull request #6 from dfm/joss-edits

JOSS edits to paper
Viva-Lambda · Oct 30, 2021 · 3b4c808 · 3b4c808
2 parents 7904df0 + 102835d
commit 3b4c808
Showing 1 changed file with 57 additions and 66 deletions.
diff --git a/paper/paper.md b/paper/paper.md
@@ -10,80 +10,72 @@ authors:
   affiliation: 1
   orcid: 0000-0002-1552-8938
 affiliations:
-- name: EPHE, PSL
+- name: École Pratique des Hautes Études, Université PSL, Paris, France
   index: 1
 date: 02 February 2021
 bibliography: paper.bib
 ---
 
 # Summary
 
-Probabilistic Graphical Models (PGMs) are a marriage between Graphs as in
-Graph Theory, and Probability as in statistics and probability theory and are
-widely used in many fields. We noticed that most existing PGM
-libraries implement PGMs in a way that ignores their graphical nature.
+Probabilistic Graphical Models (PGMs) are a marriage between "graphs" from
+graph theory and "probability" from statistics and probability theory. While PGMs are
+widely used in many fields, we noticed that most existing PGM
+libraries are implement in a way that doesn't take full advantage of their graphical nature.
 `PyGModels`' value proposition is that it faithfully implements the graphical
 nature of PGMs, thereby giving `PyGModels`' instantiated objects both
 graph-theoretical and statistical properties, which allows users to explore
-and test inference algorithms that are rooted in both graph theory or
-statistics. `PyGModels` also implements several algorithms of interest on a
-LWF chain graphs, also known as mixed graphs.
+and test inference algorithms that are rooted in graph theory or
+statistics. `PyGModels` also implements several algorithms of interest on Lauritzen-Wermuth-Frydenberg
+(LWF) chain graphs, also known as mixed graphs.
 
 # Statement of Need
 
-Though the students of computer science or statistics might find a pedagogical
+Though the students of computer science or statistics might find pedagogical
 value going through source code along with a textbook on probabilistic
-graphical models (something like Sucar [see @Sucar_2015] or Cowell [see
-@Cowell_2005] or Koller and Friedman [see @Koller_Friedman_2009]), we believe
-that the value proposition of `PyGModels` speaks mostly to researchers.
-Let us try to demonstrate the need for `PyGModels` by a use case. 
-
-One has a set of categorical random variables in the form of a function
-specified by a probability distribution. One has a set of edges that encode a
-certain independence assumption over her random variables, and one has a set
-of factors, that factorizes a certain probability distribution over her entire
-graph. Given these, `PyGModels` might solve two major issues for the
-researcher or the student alike:
-
-1. Compute posterior probability distribution or most probable explanation
-  given certain evidence.
-
-2. Provide a basis for creating new algorithms of inference.
+graphical models [for example @Cowell_2005; @Koller_Friedman_2009; @Sucar_2015],
+`PyGModels` is mainly targeted at researchers.
+Let us demonstrate the need for `PyGModels` with a use case.
+Given a set of categorical random variables in the form of a function
+specified by a probability distribution, a set of edges that encode a
+certain independence assumption over these random variables, and a set
+of factors that factorizes this probability distribution over the
+graph, `PyGModels` is designed with the following use cases in mind:
+
+1. computation of the posterior probability distribution or most probable explanation
+  conditioned on evidence, and
+
+2. the development of new inference algorithms.
+
+The real forte of `PyGModels` is its support for implementing new algorithms
+due to its lightweight nature and its direct implementation of
+statistical and graph theoretic features in the same base class.
+We mostly follow @Koller_Friedman_2009 for statistical conventions, definitions,
+and inference algorithms. For graph theoretic conventions, we follow
+@Diestel_2017, with algorithms from @Erciyes_2018 and @Even_Guy_Even_2012.
+Throughout the code, exact pages for algorithmic references are cited in the
+docstrings for relevant functions.
 
 If the independence assumptions over the random variables requires the graph
 to be a LWF chain graph where the graph can have both directed and undirected
-edges, `PyGModels` can also solve:
-
-3. Decomposing the chain graph into chain components
-
-4. Moralizing the chain graph into a Markov Network.
-
-5. Decomposing the chain graph into Conditional Random Fields.
-
-Though the first issue is not irrelevant, the forte of `PyGModels` is the
-second issue due to its lightweight nature and its direct embodiment of
-statistical (we follow mostly Koller and Friedman [see @Koller_Friedman_2009]
-for statistical conventions and definitions and inference algorithms) and
-graph theoretic (we follow mostly Diestel [see @Diestel_2017] for graph
-theoretic conventions and definitions; most of the graph algorithms come
-from K. Erciyes [see @Erciyes_2018] and S. Even [see @Even_Guy_Even_2012];
-exact pages are cited in doc strings of related functions inside the
-source code) considerations in the same base class. 
+edges, `PyGModels` can also (a) decompose the chain graph into chain components,
+(b) moralize the chain graph into a Markov Network, and (c) decompose the chain
+graph into Conditional Random Fields.
 
-The entire library depends only on python standard library which makes it very
-extendible and easy to integrate and adapt to other projects as well. Through
+The entire library depends only on Python standard library which makes it easily
+extensible, and straightforward to integrate or adapt to other projects. Through
 its rigorous adoption of mathematical definitions of involved concepts, it
-becomes feasible to extend arbitrary factors through their point wise product,
+becomes feasible to extend arbitrary factors through their pointwise product,
 or apply common graph analysis algorithms such as finding articulation points
 or bridges, or finding an optimal path defined by a cost function.
 
 # Applications and Similar Works
 
 PGMs are known for their wide range of applications in computer vision,
 information retrieval, disease diagnosis and more recently, in the context of
-our PhD thesis, annotations of ancient documents.
+this author's PhD thesis, annotations of ancient documents.
 
-Other open sourced python libraries about PGMs include the following:
+Other open source Python libraries implementing PGMs include:
 
 - `pyGM` [see @Ihler_2020]
 
@@ -97,40 +89,39 @@ Other open sourced python libraries about PGMs include the following:
 
 - `pomegranate` [see @Schreiber_2018]
 
-The most popular and goto libraries are `pgmpy` and `pomegranate`. Both of
-them have also been used in several publications [see @Ankan_Panda_2015a; also
-@Ankan_Panda_2015b; and @Schreiber_2018]. Their functionalities are covered
+The most popular of these are `pgmpy` and `pomegranate`, both of
+which have been used in several publications [see @Ankan_Panda_2015a;
+@Ankan_Panda_2015b; @Schreiber_2018]. Their functionalities are covered
 with nice test suites as well. Overall both of them are reliable libraries for
 using PGMs in production.
 
 `pyGM` and `pgm` are particularly well organized alternatives to `PyGModels`,
 with `pyGM` being slightly more reliable than `pgm` due to its test suite.
-`pyfac` seems to concerns itself only with inference over factor graphs and
-`pgmPy` seems to be an inactive (last commit dates to 2013) side project
-rather than a dedicated library. We will make a small comparison with `pgmpy`
+`pyfac` is primarily focused on inference over factor graphs and
+`pgmPy`'s development is inactive (last commit dates to 2013).
+We will make a small comparison with `pgmpy`
 most of our remarks hold for other alternatives as well.
 
 `PyGModels` distinguishes from `pgmpy` by its lightweight nature (`PyGModels`
-depends only on python 3.6 standard library). Our test suit cites its source
-for most of the compared values inside doc string of functions for key
-functions like inference over graphs. Factors are specified by a set of
-random variables and a function whose domain is the cartesian product of
+depends only on python 3.6 standard library). Our test suite cites its source
+for most of the expected values in the function docstrings.
+Factors are specified by a set of
+random variables and a function whose domain is the Cartesian product of
 codomains of random variables. In all of the libraries above, a factor is
 specified through an array of values. This has no direct implications on the
 output. However, it has implications on the evaluation order of operations.
 Our implementation is lazier and it conforms to the definition provided by
-Koller and Friedman [see @Koller_Friedman_2009 p. 106-107]. The last aspect is
+Koller & Friedman [see @Koller_Friedman_2009 p. 106-107]. The last aspect is
 also the case for other packages, however `PyGModels` differs from them with
 respect to the data structure used in the implementation.
 
-The last aspect we deem important, is our capacity of doing inference on LWF
-chain graphs (its theoretical foundations are best explained by S. Lauritzen
-[see @Lauritzen_1996], the same author also provided its causal interpretation
-in a long article clearing out misconceptions [see
-@Lauritzen_Richardson_2002]; inference strategies over chain graphs are best
-exposed by R. Cowell [see @Cowell_2005]; and more recently by R. Dechter [see
-@Dechter_2019]), also known as mixed models or partially directed acyclic
-graphs [see @Koller_Friedman_2009 p. 37]. Our library shows that once we have
+Another key feature, is `PyGModels`' support of inference on LWF
+chain graphs. The theoretical foundations of these graphs are best explained by
+@Lauritzen_1996, and its causal interpretation and common misconceptions are
+discussed by @Lauritzen_Richardson_2002. Inference strategies over chain graphs are best
+exposed by @Cowell_2005, and more recently by @Dechter_2019. These are also known
+as mixed models or partially directed acyclic graphs [see @Koller_Friedman_2009 p. 37].
+With `PyGModels`, once we have
 the necessary set of factors, we can simply do inference over chain graphs
 just as we do over other PGMs like Bayesian Networks and Markov Random Fields.
 We implement several algorithms of interest for chain graphs such as