This package provides a codon optimization tool for tissue-specific gene design. It follows a probabilistic approach with two steps:
- Translate tissue-specific codon preferences into a pool of optimal sequences
- Select the desired sequence based on parameters of relevance.
-
(optional) Create a virtual environment to install the tool
-
Install CUSTOM and its requirements using pip:
pip install custom_optimizer
As a basic example, here is the code to optimize an eGFP protein to kidney:
# Import package
from custom import TissueOptimizer
# Start the optimizer
opt = TissueOptimizer("Kidney", n_pool=100)
# Optimize the eGFP sequence
egfp = "MVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYK"
opt.optimize(egfp)
# Select the top 10 sequences
best_egfp_kidney = opt.select_best(by={"MFE":"min","MFEini":"max","CAI":"max","CPB":"max","ENC":"min"},homopolymers=7, top=10)
This project has been developed at Center for Genomic Regulation.
Hernandez-Alias, X., Benisty, H., Radusky, L.G., Serrano, L. & Schaefer, M. H. (2023). Using protein-per-mRNA differences among human tissues in codon optimization. Genome Biology, 24(1):34. (https://doi.org/10.1186/s13059-023-02868-2)
CUSTOM is under a common GNU GENERAL PUBLIC LICENSE. Plese, check LICENSE for further information.