JBrowse Pangenome Visualization Guide

This guide outlines the process for visualizing pangenome graphs using JBrowse, specifically tailored for bioinformaticians interested in genomic research. The workflow leverages several tools and plugins, facilitating an interactive exploration of pangenomes.

Prerequisites

Ensure the following tools and resources are available:

JBrowse: JBrowse components
JBrowse MAF Plugin: MAFViewer
WGA Tools: wgatools
MAFChunk: mafchunk
maf2bed: Install via Cargo with cargo install maf2bed

Workflow

1. Building the Graph with `pggb`

Generate the graph using pggb within a Docker container:

docker run -it -v .:/data ghcr.io/pangenome/pggb:latest pggb -i data/<SEQUENCES.fa> -M -o data/OUTPUTDIR -n 2 -t 15 -p 90 -s 5000

Replace <SEQUENCES.fa> with your assembly file names, formatted as ASSEMBLYNAME.CHROMOSOME (e.g., hg38.chr1, chm13.chr12).

2. Generating MAF Files with `wgatools`

Convert PAF files to MAF format for visualization:

wgatools pafpseudo -f <FASTA_FILE> -g <TARGET> -o <OUTPUT.maf> <INPUT.paf>

<FASTA_FILE>: File containing sequences from the pggb graph build.
<TARGET>: Reference assembly name (leave empty for multiple .maf outputs).
<OUTPUT.maf>: Resulting MAF file.
<INPUT.paf>: PAF file from pggb graph construction.

3. Chunking MAF Files

Split the MAF file into manageable chunks for visualization:

mafchunk <OUTPUT.maf> 100000 > <CHUNKED.maf>

100000 is recommended for interactive visualization, adjustable based on chromosome size.

4. Creating Pseudo-BED for MAFViewer

Generate a pseudo-BED file from the chunked MAF:

cat <CHUNKED.maf> | maf2bed <TARGET> | bgzip > out.bed.gz

5. Indexing the Pseudo-BED

Index the generated BED file:

tabix -p bed out.bed.gz

Launching JBrowse

Start JBrowse (either desktop or web version):

For the desktop version, initiate the webpack dev server and electron window:
```
cd products/jbrowse-desktop
yarn start
yarn electron
```
Install the MAF Plugin from the Plugin Store within JBrowse if not already done.
Add a MAF Track using the out.bed.gz and out.bed.gz.tbi files, along with the assembly names from your <FASTA_FILE>.

You can now add additional tracks for your reference assembly, facilitating a comprehensive pangenome analysis.

Result Preview

The visualization process culminates in an interactive pangenome view, akin to this example:

Synteny View Visualization

Visualizing synteny becomes straightforward with JBrowse's capability to directly use the .paf files produced by pggb. Follow these steps for an insightful synteny analysis:

1. Adding Assemblies to JBrowse

Navigate to File -> Open Assembly in JBrowse and add the assemblies involved in the alignment.

2. Creating a Dotplot

To initiate, open a dotplot comparing two assemblies. This involves creating a new synteny track and incorporating the .paf file from pggb. The resulting dotplot provides a visual representation of synteny between the assemblies:

Troubleshooting Dotplots

An empty dotplot usually indicates a mismatch in assembly naming conventions (e.g., using hg38.chr21 versus simply chr21). Ensure consistency in naming within the FASTA files.
The order of assembly comparison might affect the output. For optimal results, arrange the dotplot with the assembly listed first in the .paf file as the primary assembly.

3. Generating Linear Synteny Views

Following the dotplot visualization, you can select specific regions to create linear synteny views. Additionally, you may add tracks to enhance the comparison between the assemblies. Such linear views can reveal structural variations like insertions:

This step-by-step approach simplifies the visualization of genomic synteny, enabling a deeper understanding of structural variations and alignments.

Future Directions and Enhancements

The current workflow, while effective for visualizing pangenome graphs, has its limitations. Tools like Waragraph excel in interactive genome structure visualization but may fall short in highlighting genomic functions. This section outlines potential enhancements and considerations for future development.

Linear View Enhancements

While Waragraph and odgi layout offer compelling one-dimensional (1D) visualizations akin to:

These visualizations often struggle to incorporate functional genomic information, which is typically relevant to individual tracks. MAFViewer addresses some of these concerns but further improvements could include:

Color-coded Copy Numbers: Enhancing visual differentiation through color-coding based on copy number variations.
Reference Switching: Streamlining the process to switch between different reference genomes.
Comprehensive Feature Addition: Facilitating the addition of genomic features across all assemblies within a pangenome, requiring a unified approach like merging GFF3 files in JBrowse.
PanGene Annotations: Integrating PanGene annotations for GFA visualization could provide insightful views on gene presence and variation within the pangenome. A potential improvement could involve developing a GFA Adapter for JBrowse to allow for direct interaction with GFA or derived formats, enabling visualization of gene paths or functional features:

Two-Dimensional (2D) Visualization

Exploring 2D genome visualizations as offered by Waragraph presents an innovative way to examine genome structures:

However, integrating these visualizations into JBrowse, while maintaining a coherent relationship with existing data and features, poses a significant challenge and an exciting area for future exploration.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README.md		README.md
p5.png		p5.png
pg1.png		pg1.png
pg2.png		pg2.png
pg3.png		pg3.png
pg4.png		pg4.png
pg5.png		pg5.png
vis.sh		vis.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JBrowse Pangenome Visualization Guide

Prerequisites

Workflow

1. Building the Graph with `pggb`

2. Generating MAF Files with `wgatools`

3. Chunking MAF Files

4. Creating Pseudo-BED for MAFViewer

5. Indexing the Pseudo-BED

Launching JBrowse

Result Preview

Synteny View Visualization

1. Adding Assemblies to JBrowse

2. Creating a Dotplot

Troubleshooting Dotplots

3. Generating Linear Synteny Views

Future Directions and Enhancements

Linear View Enhancements

Two-Dimensional (2D) Visualization

About

Releases

Packages

Languages

pangenome/jbrowse-visualization

Folders and files

Latest commit

History

Repository files navigation

JBrowse Pangenome Visualization Guide

Prerequisites

Workflow

1. Building the Graph with pggb

2. Generating MAF Files with wgatools

3. Chunking MAF Files

4. Creating Pseudo-BED for MAFViewer

5. Indexing the Pseudo-BED

Launching JBrowse

Result Preview

Synteny View Visualization

1. Adding Assemblies to JBrowse

2. Creating a Dotplot

Troubleshooting Dotplots

3. Generating Linear Synteny Views

Future Directions and Enhancements

Linear View Enhancements

Two-Dimensional (2D) Visualization

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Building the Graph with `pggb`

2. Generating MAF Files with `wgatools`

Packages