Skip to content

Multiple uncounted motif inference by nested sampling of independent component sequence source models

License

Notifications You must be signed in to change notification settings

mmattocks/BioMotifInference.jl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BioMotifInference

Build Status codecov Project Status: Inactive – The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows.

BioMotifInference.jl implements nested sampling of ensembles of Independent Component Analysis (ICA) models. These are composed of Position Weight Matrix (PWM) DNA motif signals emitting on a background Hidden Markov Model of genomic noise (BHMM). It is intended for Distributed.jl cluster use, although smaller problems can be solved on one machine. BHMMs can be trained using BioBackgroundModels.jl. BioMotifInference.jl is similar to nMICA, implemented by Down and Hubbard in Java in 2004 (Down, T. A., and Tim J.P. Hubbard. “NestedMICA: Sensitive Inference of over-Represented Motifs in Nucleic Acid Sequence.” Nucleic Acids Research 33, no. 5 (March 8, 2005): 1445–53. https://doi.org/10.1093/nar/gki282. ). It can be, but is not necessarily, in detailed balance; the user specifies the sampling routine from a library of permute functions, and can write new ones, and so is responsible for ensuring that the sampler behaves appropriately for the application.

Installation

Add the github url using the Julia package manager:

julia>]add https://github.com/mmattocks/BioMotifInference.jl

Example use

A synthetic example, using observations generated from a background model of D. rerio intergenic sequences, with both a periodic CTC motif and an irregular, but recurring TATAWAW box spiked in, can be found in tests/spike recovery.jl.

include(dirname(dirname(pathof(BioMotifInference)))*"/test/spike_recovery.jl")

Output:

Documentation

More extensive documentation is available here.

About

Multiple uncounted motif inference by nested sampling of independent component sequence source models

Resources

License

Stars

Watchers

Forks

Packages

No packages published