-
Notifications
You must be signed in to change notification settings - Fork 0
v7 consensus_wgs_plus_cnvkit_wxs_x_and_y.tsv.gz status== NA
issue
#168
Comments
This has been fixed in V8 histologies file - additionally, the v8 release of |
Thanks for the update @runjin326 However, I believe we need to remove all status==NA in the cnv files. For WGS the GMKF germliner_sex_estimate fixes majority of the issue of WGS xy calls but in general we remove the NAs here status == NA in WGS denote the regions where CNVs were not called in WGS samples. For WXS since we only run the annotation we are status==NAs in XY chromosomes is from missing the germline_sex_estimate values. The status column is used while uploading to pedbcio and NA is not a accepted value there. cc @jharenza for input if I missed anything. |
@kgaonkar6, thanks for clarifying this and I just reopened this! So you meant maybe we should add another line of code to remove all WXS samples with status ==NA? |
@kgaonkar6 - were these left in for v7? I believe @migbro just set them to 0 for pedcbio to be aligned with GISTIC no copy change calls. I think we want to keep these in because we would want these to simulate a full, not partial seg file. Additionally, for chromothripsis analysis, we may need these regions available for recoding to infer CN oscillation events. So, if the only reason to remove is for pedcbio, I would say don't, and @migbro can handle that during his transform. |
Hmm so the status==NA in XY chromosomes in WXS samples is because the germline_sex_estimate is not available for WXS and thus a status cannot be assigned using the logic here, not because there is no copy change in the region I believe: Example here we see the copy change compared to ploidy but since germline_sex_estimate is NA the status is NA:
|
Ahh, I see - in this case, I think we probably ought to use the |
Close with PR89 for now - the remaining issue will be addressed in ticket 177 |
What data file(s) does this issue pertain to?
consensus_wgs_plus_cnvkit_wxs_x_and_y.tsv.gz
What release are you using?
v7
Put your question or report your issue here.
There are NAs in consensus_wgs_plus_cnvkit_wxs_x_and_y.tsv.gz because germline_sex_estimate is NA and is used in annotating the status for samples here
WXS samples will not have germline_sex_estimate so we will have to add a filter specfic to WXS which will not have
germline_sex_estimate
The WGS samples from GMKF didn't have germline_sex_estimate in v7 thus status == NA. We will have it updated in v8.
The text was updated successfully, but these errors were encountered: