Merge pull request #115 from NiklasMelton/create-joss-paper

Create joss paper
NiklasMelton · Oct 22, 2024 · c2a4ea5 · c2a4ea5
2 parents 4478451 + 6b68b17
commit c2a4ea5
Show file tree

Hide file tree

Showing 5 changed files with 463 additions and 25 deletions.
diff --git a/.github/workflows/draft-pdf.yml b/.github/workflows/draft-pdf.yml
@@ -0,0 +1,24 @@
+name: Draft PDF
+on: [push]
+
+jobs:
+  paper:
+    runs-on: ubuntu-latest
+    name: Paper Draft
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+      - name: Build draft PDF
+        uses: openjournals/openjournals-draft-action@master
+        with:
+          journal: joss
+          # This should be the path to the paper within your repo.
+          paper-path: paper.md
+      - name: Upload
+        uses: actions/upload-artifact@v4
+        with:
+          name: paper
+          # This is the output path where Pandoc will write the compiled
+          # PDF. Note, this should be the same directory as the input
+          # paper.md
+          path: paper.pdf
diff --git a/artlib/biclustering/BARTMAP.py b/artlib/biclustering/BARTMAP.py
@@ -1,16 +1,13 @@
-"""BARTMAP.
-
-Xu, R., & Wunsch II, D. C. (2011).
-BARTMAP: A viable structure for biclustering.
-Neural Networks, 24, 709–716. doi:10.1016/j.neunet.2011.03.020.
-
-Xu, R., Wunsch II, D. C., & Kim, S. (2012).
-Methods and systems for biclustering algorithm.
-U.S. Patent 9,043,326 Filed January 28, 2012,
-claiming priority to Provisional U.S. Patent Application,
-January 28, 2011, issued May 26, 2015.
-
-"""
+"""BARTMAP :cite:`xu2011bartmap`, :cite:`xu2012biclustering`."""
+# Xu, R., & Wunsch II, D. C. (2011).
+# BARTMAP: A viable structure for biclustering.
+# Neural Networks, 24, 709–716. doi:10.1016/j.neunet.2011.03.020.
+#
+# Xu, R., Wunsch II, D. C., & Kim, S. (2012).
+# Methods and systems for biclustering algorithm.
+# U.S. Patent 9,043,326 Filed January 28, 2012,
+# claiming priority to Provisional U.S. Patent Application,
+# January 28, 2011, issued May 26, 2015.
 
 import numpy as np
 from typing import Optional
@@ -25,16 +22,18 @@ class BARTMAP(BaseEstimator, BiclusterMixin):
     """BARTMAP for Biclustering.
 
     This class implements BARTMAP as first published in:
-    Xu, R., & Wunsch II, D. C. (2011).
-    BARTMAP: A viable structure for biclustering.
-    Neural Networks, 24, 709–716. doi:10.1016/j.neunet.2011.03.020.
-
-    BARTMAP accepts two instantiated ART modules `module_a` and `module_b` which
-    cluster the rows (samples) and columns (features) respectively. The features
-    are clustered independently, but the samples are clustered by considering
-    samples already within a row cluster as well as the candidate sample and
-    enforcing a minimum correlation within the subset of features belonging to
-    at least one of the feature clusters.
+    :cite:`xu2011bartmap`.
+
+    .. # Xu, R., & Wunsch II, D. C. (2011).
+    .. # BARTMAP: A viable structure for biclustering.
+    .. # Neural Networks, 24, 709–716. doi:10.1016/j.neunet.2011.03.020.
+
+    BARTMAP accepts two instantiated :class:`~artlib.common.BaseART.BaseART` modules
+    `module_a` and `module_b` which cluster the rows (samples) and columns (features)
+    respectively. The features are clustered independently, but the samples are
+    clustered by considering samples already within a row cluster as well as the
+    candidate sample and enforcing a minimum correlation within the subset of
+    features belonging to at least one of the feature clusters.
 
     """
 

diff --git a/docs/source/references.rst b/docs/source/references.rst
@@ -2,5 +2,4 @@ References
 ==================================
 
 .. bibliography:: ../../references.bib
-   :style: unsrt
-   :all:
+   :style: unsrt
diff --git a/paper.md b/paper.md
@@ -0,0 +1,150 @@
+---
+title: 'Adaptive Resonance Lib: A Python package for Adaptive Resonance Theory (ART) models'
+tags:
+  - Python
+  - clustering
+  - classification
+  - regression
+  - reinforcement learning
+  - machine learning
+authors:
+  - name: Niklas M. Melton
+    corresponding: true
+    orcid: 0000-0001-9625-7086
+    affiliation: 1
+  - name: Dustin Tanksley
+    orcid: 0000-0002-1677-0304
+    affiliation: 1
+  - name: Donald C. Wunsch II
+    orcid: 0000-0002-9726-9051
+    affiliation: 1
+affiliations:
+ - name: Missouri University of Science and Technology, USA
+   index: 1
+   ror: 00scwqd12
+date: 18 October 2024
+bibliography: references.bib
+---
+
+# Summary
+
+The Adaptive Resonance Library (**artlib**) is a Python library that implements a wide
+range of Adaptive Resonance Theory (ART) algorithms. **artlib** currently supports eight
+elementary ART models and 11 compound ART models, including Fuzzy ART
+[@carpenter1991fuzzy], Hypersphere ART [@anagnostopoulos2000hypersphere], Ellipsoid ART
+[@anagnostopoulos2001a; @anagnostopoulos2001b], Gaussian ART
+[@williamson1996gaussian], Bayesian ART [@vigdor2007bayesian], Quadratic Neuron
+ART [@su2001application; @su2005new], ARTMAP [@carpenter1991artmap], Simplified
+ARTMAP [@gotarredona1998adaptive], SMART [@bartfai1994hierarchical], TopoART
+[@tscherepanow2010topoart], Dual Vigilance ART [@da2019dual], CVIART [@da2022icvi],
+BARTMAP [@xu2011bartmap; @xu2012biclustering], Fusion ART [@tan2007intelligence],
+FALCON [@tan2004falcon], and TD-FALCON [@tan2008integrating]. These models can be
+applied to tasks such as unsupervised clustering, supervised classification, regression,
+and reinforcement learning [@da2019survey]. This library provides an extensible and
+modular framework where users can integrate custom models or extend current
+implementations, allowing for experimentation with existing and novel machine learning
+techniques.
+
+In addition to the diverse set of ART models, **artlib** offers implementations of
+visualization methods for various cluster geometries, along with pre-processing
+techniques such as Visual Assessment of Tendency (VAT) [@bezdek2002vat], data
+normalization, and complement coding.
+
+# Statement of Need
+
+The Adaptive Resonance Library (**artlib**) is essential for researchers, developers,
+and educators interested in adaptive neural networks, specifically ART algorithms.
+While deep learning dominates machine learning, ART models offer unique advantages
+in incremental and real-time learning environments due to their ability to learn new
+data without forgetting previously learned information.
+
+Currently, no comprehensive Python library implements a variety of ART models in an
+open-source, modular, and extensible manner. **artlib** fills this gap by offering a
+range of ART implementations that integrate seamlessly with machine learning workflows,
+including scikit-learn's `Pipeline` and `GridSearchCV` [@scikit-learn]. The library is
+designed for ease of use and high performance, leveraging Python's scientific stack
+(NumPy [@harris2020array], SciPy [@2020SciPy-NMeth], and scikit-learn [@scikit-learn])
+for fast numerical computation.
+
+The modular design of **artlib** enables users to create novel compound ART models,
+such as Dual Vigilance Fusion ART [@da2019dual; @tan2007intelligence] or
+Quadratic Neuron SMART [@su2001application; @su2005new; @bartfai1994hierarchical].
+This flexibility offers powerful experimental and time-saving benefits, allowing
+researchers and practitioners to evaluate models on diverse datasets efficiently.
+
+Additionally, the library serves as a valuable educational tool, providing
+well-documented code and familiar APIs to support hands-on experimentation with ART
+models. It is ideal for academic courses or personal projects in artificial
+intelligence and machine learning, making **artlib** a versatile resource.
+
+**artlib** is actively maintained and designed for future extension, allowing users
+to create new ART models, adjust parameters for specific applications, and explore ART's
+potential for novel research problems. Its integration with popular Python libraries
+ensures its adaptability to current machine learning challenges.
+
+# Comparison to Existing Implementations
+
+While there are several open-source repositories that provide
+python implementations of specific ART models [@birkjohann2023artpython;
+@aiopenlab2023art; @dilekman2022artificial; @artpy2022; @dixit2020adaptive;
+@inyoot2021art; @valixandra2021adaptive; @wan2022art2; @ray2023artpy], they lack
+modularity and are limited in scope, often implementing just one or two models. For
+instance, MATLAB-based ART toolboxes [@mathworks_art1s; @mathworks_fuzzyart_fuzzyartmap;
+@mathworks_topoart; @mathworks_art_fuzzyart_artmap] provide implementations of
+Fuzzy ART, TopoART, ART1, and ARTMAP models, but they lack the flexibility and
+modularity required for broader experimentation. The most significant existing ART
+implementation exists in julia and provides just five models
+[@Petrenko_AdaptiveResonance_jl_A_Julia_2022] but, like the previously listed
+MATLAB-based toolboxes, it is not easily accessible to Python-based work flows.
+
+These existing implementations of ART models may provide standalone versions of
+individual models, but they are often not designed to integrate seamlessly with modern
+Python libraries such as scikit-learn, NumPy, and SciPy. As a result, researchers and
+developers working in Python-based environments face challenges when trying to
+incorporate ART models into their machine learning pipelines.
+
+**artlib** addresses these challenges by offering a comprehensive and modular
+collection of ART models, including both elementary and compound ART architectures.
+It is designed for interoperability with popular Python tools, enabling users to easily
+integrate ART models into machine learning workflows, optimize models using
+scikit-learn's `GridSearchCV`, and preprocess data using standard libraries. This
+flexibility and integration make **artlib** a powerful resource for both research and
+practical applications.
+
+# Adaptive Resonance Theory (ART)
+
+ART is a class of neural networks known for solving the stability-plasticity dilemma,
+making it particularly effective for classification, clustering, and incremental
+learning tasks [@grossberg1976a; @grossberg1976a; @Grossberg1980HowDA;
+@grossberg2013adaptive; @da2019survey]. ART models are designed to dynamically learn
+and adapt to new patterns without catastrophic forgetting, making them ideal for
+real-time systems requiring continuous learning.
+
+Over the years, dozens of ART variations have been published [@da2019survey],
+extending the applicability of ART to nearly all learning regimes, including
+reinforcement learning [@tan2004falcon; @tan2008integrating], hierarchical and
+topological clustering [@tscherepanow2010topoart; @bartfai1994hierarchical], and
+biclustering [@xu2011bartmap; @xu2012biclustering]. These numerous models provide an
+ART-based solution for most machine learning use cases. However, the rapid development
+of bespoke models and the difficulty in understanding the core principles of ART
+have resulted in a lack of open-source and approachable implementations of most
+ART variants.
+
+The ability of ART to preserve previously learned patterns while learning new data in
+real-time has made it a powerful tool in domains such as robotics, medical diagnosis,
+and adaptive control systems. **artlib** aims to extend the application of these models
+in modern machine learning pipelines, offering a unique and approachable toolkit for
+leveraging ART's strengths.
+
+# Acknowledgements
+
+This research was supported by the National Science Foundation (NSF) under Award
+Number 2420248. The project titled EAGER: Solving Representation Learning and
+Catastrophic Forgetting with Adaptive Resonance Theory provided essential funding for
+the completion of this work.
+
+We would also like to thank Gail Carpenter and Stephen Grossberg for their
+feedback regarding this project and their immense contribution to machine learning by
+pioneering Adaptive Resonance Theory.
+
+# References