Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use Mamba in install instructions #96

Merged
merged 2 commits into from
Nov 7, 2022
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
67 changes: 49 additions & 18 deletions docs/source/getting_started/install.rst
Original file line number Diff line number Diff line change
@@ -1,31 +1,42 @@
Installation
============

We recommend using `Anaconda <https://www.anaconda.com/download/>`_
to handle installation. Anaconda will manage dependencies and
versioning, which simplifies the process of installation.
We recommend `Mamba <https://mamba.readthedocs.io/en/latest/installation.html>`_
for installation handling. `Conda <https://www.anaconda.com/download/>`_ can be
used as an alternative, but Conda can take a long time to resolve dependencies,
thus rendering installation via Conda
significantly slower than installation via Mamba. Mamba/Conda will
both manage dependencies and versioning, which simplifies the
process of installation.

Install Snekmer
---------------
If you already have Conda but wish to use Mamba for installation,
you can install Mamba by running the following:

Use conda to install an environment from the YML file with Snekmer and
all of its dependencies. (Note: Users may either download the
.. code-block:: bash

conda install -c conda-forge mamba

Install Snekmer via Mamba/Conda
-------------------------------

The simplest method for installation is via the included YML file, which will create
a new environment containing Snekmer and all of its dependencies. Users may either
directly download the
`YML file <https://github.com/PNNL-CompBio/Snekmer/blob/main/environment.yml>`_
directly from the repository, or clone the repository beforehand
using the ``git clone`` command.)
directly, or clone/fork the repository to obtain a local copy of the repository and all
included files.

.. code-block:: bash

conda env create -f environment.yml

mamba env create -f environment.yml

Note that if you want to use the optional Blazing Signature Filter (BSF) to
speed up clustering you must follow the BSF installation instructions below
and then you can use the alternate conda environment.

.. code-block:: bash

conda env create -f environment_BSF.yml
mamba env create -f environment_BSF.yml

After the install completes activate the conda environment

Expand All @@ -35,16 +46,20 @@ After the install completes activate the conda environment

The package should now be ready to use!

Note that the instructions above can be replicated, subsituting ``mamba``
for ``conda``, for users who wish to use Conda to manage installation.

Troubleshooting Notes
`````````````````````

If you are a Windows user and running into conflicts/errors when
creating the conda environment, you may need to install the minimal
version of Snakemake:
The full version of Snakemake is
`incompatible with Windows <https://snakemake.readthedocs.io/en/stable/getting_started/installation.html#full-installation>`_.
Thus, you will need to install the environment specifications that
include only the minimal version of Snakemake:

.. code-block:: bash

conda create -n snekmer -c conda-forge -c bioconda -c numba python>=3.9 biopython matplotlib numpy>=1.22.3 numba>=0.56 scipy pandas seaborn snakemake-minimal==7.0 scikit-learn
mamba env create -f environment_Windows.yml

.. Install Snekmer
.. ---------------
Expand All @@ -67,6 +82,22 @@ version of Snakemake:
.. # option 2: direct install (no repository download required)
.. pip install git+https://github.com/PNNL-CompBio/Snekmer

Install Snekmer via pip
-----------------------

**Warning:** Installation of Snekmer using ``pip`` is not recommended due to the complexity
of dependencies associated with Snakemake. Mamba/Conda will handle these automatically,
whereas ``pip`` will not.

The ``pip`` implementation has not been fully tested, but users may attempt installation
using the included specifications:

.. code-block:: bash

pip install -r requirements.txt
pip install -e git+https://github.com/PNNL-CompBio/Snekmer#egg=snekmer


(optional) Install GCC for BSF
------------------------------

Expand Down Expand Up @@ -117,7 +148,7 @@ Windows or Linux/Unix
`````````````````````

Please refer to the
`BSF documentation <https://github.com/PNNL-Compbio/bsf-jaccard-py#install-gcc-49-or-newers>`_
`BSF documentation <https://github.com/PNNL-CompBio/bsf-jaccard-py#install-gcc-49-or-newers>`_
for Linux/Unix or Windows instructions for installing GCC.

BSF Install for Snekmer Use
Expand All @@ -126,4 +157,4 @@ In the snekmer conda environment use the command

.. code-block:: bash

pip install git+https://github.com/PNNL-Compbio/bsf-jaccard-py#egg=bsf
pip install git+https://github.com/PNNL-CompBio/bsf-jaccard-py#egg=bsf
54 changes: 27 additions & 27 deletions docs/source/getting_started/usage.rst
Original file line number Diff line number Diff line change
Expand Up @@ -62,14 +62,14 @@ The following output directories and files will always be created:
├── input/
│ ├── A.fasta
│ └── B.fasta
── output/
├── kmerize/
│ ├── A.kmers # kmer labels for A
│ └── B.kmers # kmer labels for B
├── vector/
│ ├── A.npz # sequences, sequence IDs, and kmer vectors for A
│ └── B.npz # sequences, sequence IDs, and kmer vectors for B
── ...
── output/
├── kmerize/
│ ├── A.kmers # kmer labels for A
│ └── B.kmers # kmer labels for B
├── vector/
│ ├── A.npz # sequences, sequence IDs, and kmer vectors for A
│ └── B.npz # sequences, sequence IDs, and kmer vectors for B
── ...

Mode-Specific Output Files
--------------------------
Expand Down Expand Up @@ -103,25 +103,25 @@ and directories in addition to the files described previously.
.. code-block:: console

.
── output/
├── ...
├── scoring/
│ ├── A.matrix # Similarity matrix for A seqs
│ ├── B.matrix # Similarity matrix for B seqs
│ ├── A.scorer # Object to apply A scoring model
│ ├── B.scorer # Object to apply B scoring model
│ └── weights/
│ ├── A.csv.gz # Kmer score weights in A kmer space
│ └── B.csv.gz # Kmer score weights in B kmer space
── model/
├── A.model # (A/not A) classification model
├── B.model # (B/not B) classification model
├── results/ # Cross-validation results tables
│ ├── A.csv
│ └── B.csv
└── figures/ # Cross-validation results figures
├── A/
└── B/
── output/
├── ...
├── scoring/
│ ├── A.matrix # Similarity matrix for A seqs
│ ├── B.matrix # Similarity matrix for B seqs
│ ├── A.scorer # Object to apply A scoring model
│ ├── B.scorer # Object to apply B scoring model
│ └── weights/
│ ├── A.csv.gz # Kmer score weights in A kmer space
│ └── B.csv.gz # Kmer score weights in B kmer space
── model/
├── A.model # (A/not A) classification model
├── B.model # (B/not B) classification model
├── results/ # Cross-validation results tables
│ ├── A.csv
│ └── B.csv
└── figures/ # Cross-validation results figures
├── A/
└── B/

Snekmer Search Output Files
:::::::::::::::::::::::::::
Expand Down
1 change: 1 addition & 0 deletions environment.yml
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,7 @@ dependencies:
- seaborn
- scikit-learn
- snakemake == 7.0
- tabulate == 0.8.10
- umap-learn
- hdbscan
- pip
Expand Down
1 change: 1 addition & 0 deletions environment_BSF.yml
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,7 @@ dependencies:
- seaborn
- scikit-learn
- snakemake == 7.0
- tabulate == 0.8.10
- umap-learn
- hdbscan
- pip
Expand Down
1 change: 1 addition & 0 deletions environment_Windows.yml
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,7 @@ dependencies:
- seaborn
- scikit-learn
- snakemake-minimal == 7.0
- tabulate == 0.8.10
- umap-learn
- hdbscan
- pip
Expand Down
25 changes: 13 additions & 12 deletions requirements.txt
Original file line number Diff line number Diff line change
@@ -1,12 +1,13 @@
# biopython
# matplotlib
# numpy >= 1.22.3
# numba >= 0.56
# pandas
# seaborn
# scipy
# scikit-learn
# snakemake == 7.0
# scikit-learn
# umap-learn
# hdbscan
biopython
matplotlib
numpy >= 1.22.3
numba >= 0.56
pandas
seaborn
scipy
scikit-learn
snakemake == 7.0
scikit-learn
tabulate == 0.8.10
umap-learn
hdbscan