-
Notifications
You must be signed in to change notification settings - Fork 590
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add withdrawn and is_control columns [VS-70] [VS-213] #7736
Conversation
echo "SELECT i.sample_name FROM \`${INFO_SCHEMA_TABLE}\` p JOIN items i ON (p.partition_id = CAST(i.sample_id AS STRING)) WHERE p.total_logical_bytes > 0 AND (table_name like 'ref_ranges_%' OR table_name like 'vet_%' OR table_name like 'pet_%')" >> query.sql | ||
echo "UNION DISTINCT " >> query.sql | ||
echo "SELECT i.sample_name FROM items i WHERE i.is_loaded = True " >> query.sql | ||
echo "SELECT i.sample_name FROM items i WHERE i.is_loaded = True AND i.withdrawn IS NULL " >> query.sql |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
since we are doing this---is the benefit to clustering the samples table by the withdrawn col too negligible?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't believe this adds clustering, it is checking for samples that should have data loaded in ref_ranges
or vet
tables.
From https://docs.google.com/document/d/1YxYddVhQ-ZHEjRY9_XRTLCufDAUtq-MDZYpoO1ex0wA
withdrawn
field (type: TIMESTAMP, nullable) tosample_info
tableis_control
field (type: BOOLEAN, required) tosample_info
tablesamples_are_controls
boolean parameter to GvsAssignIds (false by default) which will populate that field for ingestwithdrawn IS NULL
control_samples
boolean parameter to GvsPrepareRangesCallset.wdl (false by default)__SAMPLES table
ifwithdrawn IS NULL
control_samples
boolean parameter to GvsExtractCallset.wdl (false by default)Closes