Skip to content

Commit

Permalink
docs: v0.6 of ggoncoplot JOSS paper
Browse files Browse the repository at this point in the history
  • Loading branch information
selkamand committed Jul 29, 2024
1 parent 2149b21 commit df86e8a
Show file tree
Hide file tree
Showing 4 changed files with 43 additions and 35 deletions.
64 changes: 36 additions & 28 deletions paper/jats/paper.jats
Original file line number Diff line number Diff line change
Expand Up @@ -95,20 +95,22 @@ a Creative Commons Attribution 4.0 International License (CC BY
<body>
<sec id="summary">
<title>Summary</title>
<p>The ggoncoplot R package generates interactive oncoplots (also
called oncoprints) to visualize mutational patterns across patient
cancer cohorts
<p>The ggoncoplot R package generates interactive oncoplots to
visualize mutational patterns across patient cancer cohorts
(<xref alt="[fig:oncoplot]" rid="figU003Aoncoplot">[fig:oncoplot]</xref>).
Oncoplots reveal patterns of gene co-mutation and include marginal
plots that indicate co-occurrence of gene mutations and tumour
features. It is useful to relate gene mutation patterns seen in an
oncoplot to patterns seen in other plot types, including gene
expression t-SNE plots or methylation UMAPs. There are, however, no
existing oncoplot-generating R packages that support dynamic data
linkage between different plots. To addresses this gap and enable
rapid exploration of a variety of data types we constructed the
ggoncoplot package for the production of oncoplots that are easily
integrated with custom visualisations and that support synchronised
Oncoplots, also called oncoprints, reveal patterns of gene co-mutation
and include marginal plots that indicate co-occurrence of gene
mutations with tumour and clinical features. It is useful to relate
gene mutation patterns seen in an oncoplot to patterns in other plot
types, including gene expression t-SNE plots or methylation UMAPs. The
simplest and most intuitive approach to examining such relations is to
link plots dynamically such that samples selected in an oncoplot can
be highlighted in other plots. There are, however, no existing
oncoplot-generating R packages that support dynamic data linkage
between different plots. To address this gap and enable rapid
exploration of a variety of data types we constructed the ggoncoplot
package for the production of oncoplots that are easily integrated
with custom visualisations and that support synchronised
data-selections across plots
(<xref alt="[fig:multimodal_selection]" rid="figU003Amultimodal_selection">[fig:multimodal_selection]</xref>).
ggoncoplot is available on GitHub at
Expand Down Expand Up @@ -150,44 +152,50 @@ a Creative Commons Attribution 4.0 International License (CC BY
</sec>
<sec id="statement-of-need">
<title>Statement of Need</title>
<p>Oncoplots are highly effectively for visualising mutation data in
<p>Oncoplots are highly effective for visualising mutation data in
cancer cohorts but are challenging to generate with the major R
plotting systems (base, lattice, or ggplot2) due to their algorithmic
and graphical complexity. Simplifying the process would make oncoplots
more accessible to researchers. Packages like ComplexHeatmap
and graphical complexity. Simplifying the process of generating
oncoplots would make them more accessible to researchers. Existing
packages including ComplexHeatmap
(<xref alt="Gu, 2022" rid="ref-GuU003A2022" ref-type="bibr">Gu,
2022</xref>), maftools
(<xref alt="Mayakonda et al., 2018" rid="ref-MayakondaU003A2018" ref-type="bibr">Mayakonda
et al., 2018</xref>), and genVisR
(<xref alt="Skidmore et al., 2016" rid="ref-SkidmoreU003A2016" ref-type="bibr">Skidmore
et al., 2016</xref>) all make static oncoplots easier to create, but
there is still a significant unmet need for an easy method of creating
oncoplots with the following features:</p>
there is still a significant unmet need for user-friendly method of
creating oncoplots with the following features:</p>
<list list-type="bullet">
<list-item>
<p><bold>Interactive plots</bold>: Customizable tooltips,
cross-selection of samples across different plots, and
auto-copying of sample identifiers on click.</p>
auto-copying of sample identifiers on click. This enables
exploration of trends in multiomic datasets as shown in
<xref alt="[fig:multimodal_selection]" rid="figU003Amultimodal_selection">[fig:multimodal_selection]</xref>.</p>
</list-item>
<list-item>
<p><bold>Support for tidy datasets</bold>: Compatibility with
tidy, tabular mutation-level formats (MAF files or relational
databases), typical of cancer cohort datasets.</p>
databases), typical of cancer cohort datasets. This greatly
improves the range of datasets that can be quickly and easily
visualised in an oncoplot.</p>
</list-item>
<list-item>
<p><bold>Auto colouring</bold>: Automatic selection of colour
palettes for datasets where consequence annotations are aligned
with standard variant effect dictionaries (PAVE, SO, or MAF).</p>
<p><bold>Auto-colouring</bold>: Automatic selection of colour
palettes for datasets where the consequence annotations are
aligned with standard variant effect dictionaries (PAVE, SO, or
MAF).</p>
</list-item>
<list-item>
<p><bold>Versatility</bold>: The ability to visualize entities
other than gene mutations, including noncoding features (e.g.,
enhancers) and non-genomic entities (e.g., microbial presence in
microbiome datasets).</p>
other than gene mutations, such as noncoding features (e.g.,
promoter or enhancer mutations) and non-genomic entities (e.g.,
microbial presence in microbiome datasets).</p>
</list-item>
</list>
<p>We developed ggoncoplot as the first R package that addresses all
these challenges simultaneously
<p>We developed ggoncoplot as the first R package to address all these
challenges together
(<xref alt="[fig:comparison]" rid="figU003Acomparison">[fig:comparison]</xref>).
Examples of all key features are available in the
<ext-link ext-link-type="uri" xlink:href="https://selkamand.github.io/ggoncoplot/articles/manual.html">ggoncoplot
Expand Down
Binary file modified paper/paper.docx
Binary file not shown.
14 changes: 7 additions & 7 deletions paper/paper.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ affiliations:

# Summary

The ggoncoplot R package generates interactive oncoplots (also called oncoprints) to visualize mutational patterns across patient cancer cohorts (\autoref{fig:oncoplot}). Oncoplots reveal patterns of gene co-mutation and include marginal plots that indicate co-occurrence of gene mutations and tumour features. It is useful to relate gene mutation patterns seen in an oncoplot to patterns seen in other plot types, including gene expression t-SNE plots or methylation UMAPs. There are, however, no existing oncoplot-generating R packages that support dynamic data linkage between different plots. To addresses this gap and enable rapid exploration of a variety of data types we constructed the ggoncoplot package for the production of oncoplots that are easily integrated with custom visualisations and that support synchronised data-selections across plots (\autoref{fig:multimodal_selection}). ggoncoplot is available on GitHub at <https://github.com/selkamand/ggoncoplot>.
The ggoncoplot R package generates interactive oncoplots to visualize mutational patterns across patient cancer cohorts (\autoref{fig:oncoplot}). Oncoplots, also called oncoprints, reveal patterns of gene co-mutation and include marginal plots that indicate co-occurrence of gene mutations with tumour and clinical features. It is useful to relate gene mutation patterns seen in an oncoplot to patterns in other plot types, including gene expression t-SNE plots or methylation UMAPs. The simplest and most intuitive approach to examining such relations is to link plots dynamically such that samples selected in an oncoplot can be highlighted in other plots. There are, however, no existing oncoplot-generating R packages that support dynamic data linkage between different plots. To address this gap and enable rapid exploration of a variety of data types we constructed the ggoncoplot package for the production of oncoplots that are easily integrated with custom visualisations and that support synchronised data-selections across plots (\autoref{fig:multimodal_selection}). ggoncoplot is available on GitHub at <https://github.com/selkamand/ggoncoplot>.

![ggoncoplot output visualising mutational trends in the TCGA breast carcinoma cohort. Individual patient samples are plotted on the x-axis, hierarchically sorted so that samples with the most frequent gene mutations appear on the leftmost side. The plot indicates that PIK3CA is the most frequently mutated gene, followed by TP53. Marginal plots indicate the total number of mutations per sample (top), and the number of samples showing mutations in each gene, coloured by mutation type (right). A range of clinical features, including progesterone and estrogen receptor status are shown on the marginal plot at the bottom. A detailed description of the ggoncoplot sorting algorithm is available [here](https://selkamand.github.io/ggoncoplot/articles/sorting_algorithm.html). \label{fig:oncoplot}](oncoplot.pdf)

Expand All @@ -41,18 +41,18 @@ The ggoncoplot R package generates interactive oncoplots (also called oncoprints

# Statement of Need

Oncoplots are highly effectively for visualising mutation data in cancer cohorts but are challenging to generate with the major R plotting systems (base, lattice, or ggplot2) due to their algorithmic and graphical complexity. Simplifying the process would make oncoplots more accessible to researchers. Packages like ComplexHeatmap [@Gu:2022], maftools [@Mayakonda:2018], and genVisR [@Skidmore:2016] all make static oncoplots easier to create, but there is still a significant unmet need for an easy method of creating oncoplots with the following features:
Oncoplots are highly effective for visualising mutation data in cancer cohorts but are challenging to generate with the major R plotting systems (base, lattice, or ggplot2) due to their algorithmic and graphical complexity. Simplifying the process of generating oncoplots would make them more accessible to researchers. Existing packages including ComplexHeatmap [@Gu:2022], maftools [@Mayakonda:2018], and genVisR [@Skidmore:2016] all make static oncoplots easier to create, but there is still a significant unmet need for user-friendly method of creating oncoplots with the following features:

- **Interactive plots**: Customizable tooltips, cross-selection of samples across different plots, and auto-copying of sample identifiers on click.
- **Interactive plots**: Customizable tooltips, cross-selection of samples across different plots, and auto-copying of sample identifiers on click. This enables exploration of trends in multiomic datasets as shown in \autoref{fig:multimodal_selection}.

- **Support for tidy datasets**: Compatibility with tidy, tabular mutation-level formats (MAF files or relational databases), typical of cancer cohort datasets.
- **Support for tidy datasets**: Compatibility with tidy, tabular mutation-level formats (MAF files or relational databases), typical of cancer cohort datasets. This greatly improves the range of datasets that can be quickly and easily visualised in an oncoplot.

- **Auto colouring**: Automatic selection of colour palettes for datasets where consequence annotations are aligned with standard variant effect dictionaries (PAVE, SO, or MAF).
- **Auto-colouring**: Automatic selection of colour palettes for datasets where the consequence annotations are aligned with standard variant effect dictionaries (PAVE, SO, or MAF).

- **Versatility**: The ability to visualize entities other than gene mutations, including noncoding features (e.g., enhancers) and non-genomic entities (e.g., microbial presence in microbiome datasets).
- **Versatility**: The ability to visualize entities other than gene mutations, such as noncoding features (e.g., promoter or enhancer mutations) and non-genomic entities (e.g., microbial presence in microbiome datasets).


We developed ggoncoplot as the first R package that addresses all these challenges simultaneously (\autoref{fig:comparison}). Examples of all key features are available in the [ggoncoplot manual](https://selkamand.github.io/ggoncoplot/articles/manual.html).
We developed ggoncoplot as the first R package to address all these challenges together (\autoref{fig:comparison}). Examples of all key features are available in the [ggoncoplot manual](https://selkamand.github.io/ggoncoplot/articles/manual.html).


![Comparison of R packages for creating oncoplots. ^1^Requires the shiny and interactiveComplexHeatmap packages. ^2^Exclusively colours tiles based on mutation impact which must be described using valid MAF variant classification terms. ^3^Requires the user to first summarise mutations at the gene level and format as a sample by gene matrix with mutations separated by semicolons (wide format). ^4^For MAF inputs the most severe consequence is chosen, however for non-MAF datasets users must manually define the mutation impact hierarchy. ^5^Non-unique mutation types are treated as one observation, however if different mutation types affect one gene, the individual mutations can be plotted with different shapes or sizes in a user-configured manner. \label{fig:comparison}](ggoncoplot_comparison.pdf)
Expand Down
Binary file modified paper/paper.pdf
Binary file not shown.

0 comments on commit df86e8a

Please sign in to comment.