-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Patch/seq and processing #354
Conversation
Test Suite ReportTemplate Generation
Manifest Validation
|
Hmm, yeah, it's kind of an issue with the model of data organization that DCA imposes. I could create custom manifests using the fileFormat info; e.g. delete all the rows that are not fq in a level 1 manifest, all the rows that are not bam/cram in a level 2 manifest. This will add more complexity w/r/t the synapse_storage_manifest.csv which I'd have to keep removing. I could also reorganize the data into top-level folders that reflect the data level. This would add more complexity when it comes time to index the next data deposit in the bucket, as we'd probably get duplicate filehandles created by the indexing workflow for the files that have already been indexed. 🤔 Anyway, the PR looks good to me! I caught a few OLS3 links and replaced with the actual pURL since that OLS3 is deprecated at the end of the month. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approved because LGTM, but if you could commit my suggested changes first that would be great! :)
Co-authored-by: Robert Allaway <[email protected]>
Co-authored-by: Robert Allaway <[email protected]>
Co-authored-by: Robert Allaway <[email protected]>
Additions mostly focused on processed data, though also fixed a broken ref.
So @allaway some data we're handling right now for KS upload is closer to "level 2" = ProcessedAlignedReads.
The "default" for
anymost assays is the level 1 template, which you're using, and things like genomic reference is not in there. But I suppose we do still want to collect level 1 attributes like library prep somewhere as well.Our version:
nf-metadata-dictionary/NF.jsonld
Lines 6022 to 6094 in a84b5e3
HTAN's closest version (DNA-seq level 2):