forked from ave-dcd/dcd_mapping
-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
refactor: major edits to VRS mapping and supporting methods (#7)
- Loading branch information
1 parent
d39681e
commit 8c728f5
Showing
13 changed files
with
6,571 additions
and
134 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,24 @@ | ||
# Remaining TODO | ||
|
||
General: | ||
* notes scattered in comments and docstrings (sorry!) | ||
* Docker-ification. More or less needs to start from scratch (there should be existing images for SeqRepo and UTA, not sure how up-to-date they are). | ||
* Add extra stuff that appears in mapping JSON objects. | ||
* Currently using VRS 2.0a-based libraries. For lifting back to VRS 1.3, some basic post-processing should be fine (annoying but shouldn't be too trivial) | ||
* Without access to a production DynamoDB instance, Gene Normalizer will be quickest and easiest to set up via a PostgreSQL data backend. That, however, requires an extra dependency group (noted in README). We might want to make a `pg` dependency group here or just include it in core dependencies. | ||
* On that note, I've only done minimal testing of how possible it would be to drop the gene normalizer dependency entirely, but it'd be nice to get there. | ||
* Some of the singleton/factory stuff might be cleaner as `global`s | ||
|
||
Alignment: | ||
* Pretty sure this is mostly done. Haven't tested exhaustively, though. | ||
* Need to sufficiently mock/patch things in tests | ||
|
||
Transcript selection: | ||
* IndexError in calculating offset on lots of new (2023) scoresets. | ||
* Tests will need some extensive mocking (or cassettes?) for reliance on UTA and other external dependencies | ||
|
||
VRS mapping: | ||
* In general, this stuff is still pretty rough. Tests aren't passing yet. | ||
* Finish double-checking the SeqRepo storage workaround | ||
* A fair amount of small questions about conditions written to handle specific scoresets/edge cases | ||
* More testing. Can be ready for CI by manually patching the SequenceStore class (or using SeqRepoRESTDataProxy and cassettes). |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.