Skip to content
This repository has been archived by the owner on Sep 14, 2021. It is now read-only.

Examples of how to get started with genomics data in BigQuery in many languages.

License

Notifications You must be signed in to change notification settings

googlegenomics/getting-started-bigquery

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

getting-started-bigquery

The repository contains examples of using BigQuery with genomics data. The code within each language-specific folder demonstrates the same set of queries upon the Platinum Genomes dataset. For more detail about this data see Google Genomics Public Data.

Using the BigQuery browser tool

  1. Go to the BigQuery Browser Tool.
  2. Click on "Compose Query".
  3. Copy and paste the following query into the dialog box and click on "Run Query":
#standardSQL
-- Count the number of records (variant and reference segments) we have in
-- the dataset and the total number of calls nested within those records.
--
-- The source data for table genomics-public-data:platinum_genomes.variants
-- was gVCF so a record can be a particular variant or a non-variant segment.
-- https://sites.google.com/site/gvcftools/home/about-gvcf
--
SELECT
  reference_name,
  COUNT(reference_name) AS num_records,
  SUM(ARRAY_LENGTH(call)) AS num_calls
FROM
  `genomics-public-data.platinum_genomes.variants` v
GROUP BY
  reference_name
ORDER BY
  reference_name

View the results!

Query Results

What next?


Alternate ways to work with BigQuery

Instead of using the browser tool to send queries to BigQuery, you can use code in many languages to call the BigQuery API.

  • Try the "getting started" samples in one or more languages by navigating to the subdirectory in this repository for the desired language:

  • All languages will require a Project ID from a project that has the BigQuery API enabled.

  • For more information on accessing BigQuery from other languages, see: Create A Simple Application With the API

About

Examples of how to get started with genomics data in BigQuery in many languages.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published