-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New "marker" module #24
Conversation
Ok, the locus-specific stuff has been migrated, and most of the tests are passing. Still need to troubleshoot some bugs. But IMO the |
I pulled your most recent commit and have a ton of failed tests now? Are you getting that too? I was going through code before I realized you had made the commit and found an error in the FGA function (I fixed it manually). After that I pulled the most recent commit and had a bunch fail. |
Yes, I'm seeing a bunch of tests fail now too. My latest troubleshooting has focused on |
Yeah, looks like the forward and UAS sequences are getting switched around. Loci that are failing need to be RCed. |
The recent spike in failing tests for But yes, many of the remaining failures still look like revcom issues. |
The output is also switched... you have You added a function that changed the uas.sequence to the RC and forward_sequence to the UAS output sequence.. so those are switched. |
Regarding uas sequence vs forward sequence: I used to have But I think I've detected another possible issue just now: no reverse complementation should be done when |
Regarding the forward vs reverse annotation, having it in the order I have now results in fewer errors, so...yeah, still need to figure out why things are mixed up. |
Here's an example where I could use your help @rnmitchell. Check out the new |
Ok, I was able to figure out the issue with the D1 locus. Only issues with the flanking annotations remain!!! |
Ok great!! I was actually just going to work on it, so I'm glad you got it working. I've been having issues with git and it's been a huge pain to stash away changes I've made to pull your newest commits. |
Ok awesome!! So I have one D3 sequence where the annotation is not matching to what was called previously (and what is "known"). Sequence: My guess is that D3 should be annotated using the sequence block approach and not a prioritized list of variants, but I need to look further into it. Just wanted to give you a heads up! |
Is the D3 sequence above a full forward strand sequence? |
It is a UAS region forward sequence (the sequences I've been checking with are from the UAS). |
Ok, I added a new regression test for D3. Working on the subtleties. |
Ok, new update fixes this case. Want to run it against your big data set again? |
Updated the PR description. Ready for review and, if it passes the bigly private dataset, merge. |
yes! it worked! woo hoo! |
It wasn't letting me review it for some reason, but it looks good! So I'm going to merge. Thanks for your help!! |
This update implements a new "marker" module that delegates all handling of UAS and non-UAS sequences to dedicated object classes. Generic behavior is defined in a base
STRMarker
class, and then locus-specific special handling of the sequence and flanks is delegated to subclasses such asSTRMarker_FGA
andSTRMarker_CSF1PO
. This doesn't decrease the lines of code per se, but the new organization makes it much easier to reason about the base functionality and the locus-specific special rules. Subsequent improvements to these bits of code will hopefully be much easier now.