Incident VS 365 clinvar classification fix #7769

RoriCremer · 2022-04-08T18:49:15Z

In the python file that transformed the annotation jsons into the jsons for big query ingest, there were two typos, preventing "pathogenic" and "likely pathogenic" from being used as values

mcovarr

I commented on the parts I felt I understood but there's a lot I still don't understand. 🙂

mcovarr · 2022-04-08T19:17:05Z

scripts/variantstore/wdl/GvsCreateVATFromAnnotations.wdl

+          export GOOGLE_APPLICATION_CREDENTIALS=local.service_account.json
+          gcloud auth activate-service-account --key-file=local.service_account.json
+
+          gsutil cp ~{inputFileofFileNames} ~{updated_input_files}


Looks this line is meant to be outside the if? If has_service_account_file is not true this command block will do nothing and the output expression for input_jsons might not be happy about that. 🙂

line 77 should save me in this scenario, but soon, no SA!

mcovarr · 2022-04-08T19:17:37Z

scripts/variantstore/wdl/GvsCreateVATFromAnnotations.wdl

+            export GOOGLE_APPLICATION_CREDENTIALS=local.service_account.json
+            gcloud auth activate-service-account --key-file=local.service_account.json
+
+            gsutil cp ~{annotation_json} ~{updated_annotation_json}


similar "should be outside if" situation here

mcovarr · 2022-04-08T19:18:58Z

scripts/variantstore/wdl/GvsCreateVATFromAnnotations.wdl

+        String dataset_name
+        File? vat_schema_json_file = "gs://broad-dsp-spec-ops/scratch/rcremer/Nirvana/schemas/vat_schema.json"
+        File? variant_transcript_schema_json_file = "gs://broad-dsp-spec-ops/scratch/rcremer/Nirvana/schemas/vt_schema.json"
+        File? genes_schema_json_file = "gs://broad-dsp-spec-ops/scratch/rcremer/Nirvana/schemas/genes_schema.json"


is this a stable location for these files?

nope---we need to have a conversation about where all the Nirvana stuff and VAT schemas should live

mcovarr · 2022-04-08T19:24:17Z

scripts/variantstore/wdl/GvsCreateVATFromAnnotations.wdl

+    # ------------------------------------------------
+    # Runtime settings:
+    runtime {
+        docker: "us.gcr.io/broad-dsde-methods/variantstore:rc_vat_update_2022_05_06"


Just a heads up this date is in the future 🙂

well if I keep getting distracted by other work....

mcovarr · 2022-04-08T19:36:06Z

scripts/variantstore/wdl/GvsCreateVATFromAnnotations.wdl

+       echo "Loading data into a pre-vat table ~{dataset_name}.~{variant_transcript_table}"
+       echo ~{vt_path}
+       echo ~{genes_path}
+       bq --location=US load --project_id=~{project_id} --source_format=NEWLINE_DELIMITED_JSON ~{dataset_name}.~{variant_transcript_table} ~{vt_path}


Is it okay to proceed with loading data into the variant transcript table if it already existed? The seemingly similar logic for the vat table on line 248 rms the table if it already existed.

yes---the logic was originally created such that we could keep adding data to the two temp tables--the Variant-Transcript table and the Genes table--and then a fresh VAT would be created based on that.

Now that the usage has changed a bit, reworking the logic and failsafes here would all be helpful

mcovarr · 2022-04-08T19:36:22Z

scripts/variantstore/wdl/GvsCreateVATFromAnnotations.wdl

+       fi
+
+       echo "Loading data into a pre-vat table ~{dataset_name}.~{genes_table}"
+       bq --location=US load  --project_id=~{project_id} --source_format=NEWLINE_DELIMITED_JSON  ~{dataset_name}.~{genes_table} ~{genes_path}


Codecov Report

❗ No coverage uploaded for pull request base (ah_var_store@2381a09). Click here to learn what that means.
The diff coverage is n/a.

@@               Coverage Diff                @@
##             ah_var_store     #7769   +/-   ##
================================================
  Coverage                ?   86.308%           
  Complexity              ?     35194           
================================================
  Files                   ?      2170           
  Lines                   ?    164837           
  Branches                ?     17775           
================================================
  Hits                    ?    142267           
  Misses                  ?     16248           
  Partials                ?      6322

mcovarr · 2022-04-26T11:46:01Z

scripts/variantstore/variant_annotations_table/GvsValidateVAT.wdl

-        gcloud config set project ~{query_project_id}
+          gsutil cp ~{service_account_json_path} local.service_account.json
+          gcloud auth activate-service-account --key-file=local.service_account.json
+          gcloud config set project ~{query_project_id}


preexisting issue but it seems like we mix 2 and 4 space indents in the command blocks of the same WDLs

gbggrant · 2022-04-26T20:04:11Z

scripts/variantstore/wdl/GvsCreateVATFromAnnotations.wdl

+        memory: "8 GB"
+        preemptible: 5
+        cpu: "1"
+        disks: "local-disk 250 SSD"


Are you sure you need SSD here?

No---I'd ideally like to parameterize a bunch of these values

RoriCremer changed the base branch from master to ah_var_store April 8, 2022 18:49

RoriCremer force-pushed the rc-vs-365-clinvar-class branch 2 times, most recently from c409998 to 0cf009a Compare April 8, 2022 19:07

mcovarr reviewed Apr 8, 2022

View reviewed changes

RoriCremer added 10 commits April 25, 2022 22:09

fix typos

a3785d0

create new WDL for second half of workflow only

2a94c07

clean up deleted file

a7bb12e

catch any unexpected clinvar classification values

61107a4

clean up optional files while here

c705099

add test and clean up vat validation

92df776

update dockstore

9a0c4fe

better VAT creation gating

58506ef

update docker along with python scripts

9de31b6

update default VAT pipeline docker too

f6f0d48

RoriCremer force-pushed the rc-vs-365-clinvar-class branch from ff38131 to f6f0d48 Compare April 26, 2022 02:16

mcovarr approved these changes Apr 26, 2022

View reviewed changes

gbggrant approved these changes Apr 26, 2022

View reviewed changes

RoriCremer merged commit 75b5115 into ah_var_store Apr 27, 2022

RoriCremer deleted the rc-vs-365-clinvar-class branch April 27, 2022 21:13

This was referenced Mar 17, 2023

lb merge gvs branch #8248

Closed

testing something, please ignore #8251

Closed

+                      # note: tab delimiter and compression creates tsv.gz files
+                      bq query --nouse_legacy_sql --project_id=~{project_id} \
+                      'EXPORT DATA OPTIONS(
+                      uri="~{export_path}",

+                       bq --location=US mk --project_id=~{project_id} ~{dataset_name}.~{vat_table} ~{nirvana_schema}
+                     else
+                       bq rm -t -f --project_id=~{project_id} ~{dataset_name}.~{vat_table}
+                       bq --location=US mk --project_id=~{project_id} ~{dataset_name}.~{vat_table} ~{nirvana_schema}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incident VS 365 clinvar classification fix #7769

Incident VS 365 clinvar classification fix #7769

RoriCremer commented Apr 8, 2022 •

edited

Loading

mcovarr left a comment

mcovarr Apr 8, 2022

RoriCremer Apr 25, 2022

mcovarr Apr 8, 2022

mcovarr Apr 8, 2022

RoriCremer Apr 25, 2022

mcovarr Apr 8, 2022

RoriCremer Apr 25, 2022

mcovarr Apr 8, 2022

RoriCremer Apr 25, 2022

mcovarr Apr 8, 2022

mcovarr Apr 8, 2022

RoriCremer Apr 25, 2022

RoriCremer Apr 25, 2022

mcovarr Apr 8, 2022

RoriCremer Apr 26, 2022

codecov bot commented Apr 26, 2022 •

edited

Loading

mcovarr Apr 26, 2022

gbggrant Apr 26, 2022

RoriCremer Apr 27, 2022

Incident VS 365 clinvar classification fix #7769

Incident VS 365 clinvar classification fix #7769

Conversation

RoriCremer commented Apr 8, 2022 • edited Loading

mcovarr left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Apr 26, 2022 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

RoriCremer commented Apr 8, 2022 •

edited

Loading

codecov bot commented Apr 26, 2022 •

edited

Loading