Skip to content

Commit

Permalink
feat: updates to RNA-Seq information
Browse files Browse the repository at this point in the history
  • Loading branch information
julianguyn committed Nov 27, 2024
1 parent 9c81b17 commit 8a699c8
Showing 1 changed file with 28 additions and 22 deletions.
50 changes: 28 additions & 22 deletions docs/disciplines/Bioinformatics/Data_Types/rnaseq/introduction.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,65 +4,71 @@

## **What is RNA Sequencing (RNA-seq)?**

**RNA sequencing (RNA-seq)** is a method scientists use to study RNA molecules inside cells.
**RNA sequencing (RNA-seq)** is a high-throughput sequencing technology that allows scientists to map and quantify the transcriptome - essentially studying RNA molecules within cells. The transcriptome, or the RNA space, is comprised of messenger RNA (mRNA) and other non-coding RNAs (ncRNAs).

**Central Dogma of Molecular Biology: DNA → RNA → Protein**

RNA acts as the "messenger" that carries instructions from DNA (the cell’s blueprint) to make proteins, which do the actual work in the body. By sequencing RNA, we get a snapshot of the genes that are active at a specific moment, allowing us to understand how cells function and respond to different conditions.
While all RNAs are transcribed from RNA, only the mRNAs are further translated into protein, hence they are oftend referred to as the "messengers" that carries instructions from DNA to create proteins, or the functional units within the cells. ncRNAs have several roles relating to the regulation of gene expression and other cellular activities. By sequencing RNA, we get a snapshot of the transcriptome, including active genes, allowing us to understand how cells function and respond to different conditions.

---

## **Purpose of RNA-seq in Bioinformatics/Transcriptomics**

**Bioinformatics** is the use of computers to study biology, and **transcriptomics** focuses on studying RNA. RNA-seq is used to understand:
**Bioinformatics** is the use of computational approaches to study biology, and **transcriptomics** focuses on studying RNA. Some purposes of RNA-seq in bioinformatic analysis include:

**1. Which genes are "switched on"?**
**1. Gene expression and activity**

Identifies the genes actively transcribed into RNA in a given condition or cell type.
Identifies the genes actively transcribed into mRNA in a given condition or cell type, giving an indication of which genes are "switched on". By quantifying the abundance of mRNA molecules, we can measure the activity levels of specific genes. RNA-Seq also allowed for examining post-transcriptional modifications such as alternative splicing, where RNA segments are rearranged or removed, influencing the diversity of proteins produced.

**2. How active is each gene?**
**2. ncRNA expression and activity**

Quantifies the abundance of RNA molecules to measure the activity levels of specific genes.
Beyond mRNAs, several ncRNAs are of interest for cancer research. RNA-Seq also profiles the expression levels of these transcripts such that similar analyses can be performed as with mRNA.

**3. What changes occur in gene activity?**
**3. Differential RNA Expression**

Detects differences in gene expression between conditions, such as healthy vs. diseased states (e.g., cancer) or before and after treatment.
Differential expression analysis can identify differences in RNA expression between different conditions (e.g. healthy vs diseased states, before and after treatment, etc).

**4. How do RNA changes impact health?**
**4. Biomarker Analysis**

Links changes in RNA levels or structures to downstream effects on proteins, signaling pathways, and overall cellular function.
RNA transcript expression can be associted with the response to different drugs, hence are often used as input features to identify biomarkers for cancer treatment.

**5. How is RNA processed?**
**5. Subtype Identification**

Patterns of RNA transcript expression can be used to identify distinct clusters or subgroups within a given population. This approach is often used to identify cancer subtypes.

**6. Functional Enrichment**

Using databases such as Gene Ontology (GO), KEGG, Reactome, mSigDB, etc, sets of RNA transcripts (e.g. from differential expression analysis) can be linked to downstream effects on biological processes, signaling pathways, and overallc cellular function.

Examines post-transcriptional modifications such as alternative splicing, where RNA segments are rearranged or removed, influencing the diversity of proteins produced.

---

## **How is RNA-seq Data Obtained?**

Here’s a simplified process:

1. **Extract RNA**: Scientists first collect RNA from cells or tissues.
2. **Make cDNA**: RNA is fragile, so it is converted into complementary DNA (cDNA), which is more stable.
3. **Cut into Pieces**: The cDNA is broken into smaller fragments.
4. **Sequence the Pieces**: Machines called sequencers read the fragments and identify the order of their building blocks (A, T, G, C).
5. **Assemble the Puzzle**: Bioinformatics tools put the fragments back together and measure how much RNA was in the original sample.
1. **RNA extraction**: Scientists first isolate and collect RNA from cells or tissue samples.
2. **Synthesize cDNA**: Due to the fragile nature of RNA, these transcripts are converted into complementary DNA (cDNA), which is more stable.
3. **Fragmentation**: The cDNA is broken into smaller fragments that enable them to be sequenced.
4. **Sequencing of Fragments**: Machines called sequencers read the fragments and identify the order of RNA nucleotides (A, U, G, C).
5. **Alignment**: Sequenced fragments are aligned to a reference genome or transcriptome to get position-based mapping information, such as mapping mRNA transcripts to their respective genes.
6. **Quantification**: With aligned fragments, transcript expression can be quantified.

---

## **How is RNA-seq Different from DNA Sequencing?**

| **Feature** | **DNA Sequencing** | **RNA Sequencing** |
|--------------------------|---------------------------------------------|------------------------------------------|
| **What is studied?** | DNA, the cell’s blueprint. | RNA, the messengers from DNA. |
| **Purpose** | Understand the structure of genes, mutations, and inheritance. | Study gene activity and which genes are "on." |
| **What is studied?** | DNA, the cell’s blueprint. | RNA, transcribed from DNA. |
| **Purpose** | Understand the structure of genes, mutations, and inheritance. | Study gene and transcript activity |
| **Stable or Changing?** | DNA is mostly stable and the same in all cells. | RNA levels vary depending on the cell’s activity. |
| **Building Blocks** | A, T, G, C (Adenine, Thymine, Guanine, Cytosine). | A, U, G, C (Uracil replaces Thymine in RNA). |
| **Output** | The sequence of an organism's entire genome or specific parts. | A snapshot of all active RNA molecules. |
| **Output** | The sequence of an organism's entire genome (WGS) or specific parts (WES). | A snapshot of all active RNA molecules. |

---


## **Summary**

RNA-seq fits into the Central Dogma by focusing on RNA, the "middle step" between DNA and proteins. It provides a dynamic snapshot of gene activity, offering crucial insights into how cells work, respond to the environment, and contribute to diseases. By analyzing RNA, scientists can better understand the processes that sustain life and develop treatments for various conditions.
RNA-seq fits into the Central Dogma by focusing on RNA, the "middle step" between DNA and proteins. It provides a dynamic snapshot of gene and transcript activity, offering crucial insights into how cells work, respond to the environment, and contribute to diseases. By analyzing RNA, scientists can better understand the processes that sustain life and develop treatments for various conditions.

0 comments on commit 8a699c8

Please sign in to comment.