Skip to content

elkebir-group/TRIBAL_Dandelion_Preprocess

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Prepare single-cell BCR/TCR data from the 10x Genomics 5’ solution for use in TRIBAL analysis.

TRIBAL (TRee Inference of B cell clonAl Lineages) is a method designed to infer B cell lineage trees from single-cell RNA sequencing data. Access TRIBAL here.

Contributors to the repository: Derek Reiman, Leah Weber, Xinyu Gu

create env

conda env create -f tribal.yml 
conda activate tribal

change R path

To avoid conflict with your local R installation you might have, you would want to run the following command to set R PATH after activated the environment.

export PATH=$HOME/miniconda3/envs/TRIBAL/bin:$PATH

Parameters

To change parameters used in the file, modify them in config.yaml

	1.	species: Target species for the analysis (only “mouse” or "human" are supported for now).
	2.	data_dir: Path to the main data directory.
	3.	blastdb_dir: Path to the BLAST database directory.
	4.	igblastdb_dir: Path to the IgBLAST database directory.
	5.	germline_dir: Path to the germline sequences directory.
	6.	dandelion_dir: Directory for Dandelion-related files.
	7.	germline_vdj: Path template for V(D)J germline sequences.
	8.	v_germline_file: Path to the V germline FASTA file.
	9.	blastdb_bcr_c: Path to the BCR constant region database file.
	10.	contig_file: Path to the filtered contig file for Dandelion.
	11.	filtered_feature_h5: Name of the HDF5 file with feature data.
	12.	output_columns: List of attributes to include in the output.
	13.	mouse_isotypes: List of mouse antibody isotypes.
	14.	human_isotypes: List of human antibody isotypes.
	15.	tribal_input_table: Name of the TRIBAL input table.
	16.	tribal_root_sequences: Name of the TRIBAL root sequences file.

Run the script

python preprocess.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published