-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Develop 0.3.6 #19
Merged
Merged
Develop 0.3.6 #19
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…ith less than 10 reads back to the previous labels.
…i` because it is no longer being used.
…en an error occurs. In such cases, "Very Early Error" is indicated in the Title. Therefore, we have made it so that it returns False in those situations.
Due to alignment errors, there can be instances where a true match is mistakenly replaced with "insertion following a deletion". For example, although it should be "=C,=T", it gets replaced by "-C,+C|=T". In such cases, a process is performed to revert it back to "=C,=T".
…d in mutation_loci as mutations by replacing them with "@". For example, if there are no mutations in mutation_loci, "=G,=C,-C" and "~G,=G,=C" become "@,@,@" and "@,@,@" respectively, making them the same and ensuring they do not affect clustering.
…uence as a coverage
…ally returns True if the 'sample' shows more than 5% variation compared to the 'control'.
…d in mutation_loci as mutations by replacing them with "@". For example, if there are no mutations in mutation_loci, "=G,=C,-C" and "~G,=G,=C" become "@,@,@" and "@,@,@" respectively, making them the same and ensuring they do not affect clustering.
…rence between sample and control is more than 20%, it is unconditionally considered a mutation.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
v0.3.6 (2024-01-10)
📝 Documentation
🚀 Update
Preprocess
Updated
input_validator.py
: The UCSC Blat server sometimes returns a 200 HTTP status code even when an error occurs. In such cases, "Very Early Error" is indicated in the title. Therefore, we have made it so that it returns False in those situations. Commit DetailSimplified
homopolymer_handler.py
for error detection using cosine similarity. Commit DetailUpdated
mutation_extractor.py
to use cosine similarity to filter dissimilar loci. Commit DetailUpdated the
mutation_extractor.identify_dissimilar_loci
so that it unconditionally returns True if the 'sample' shows more than 5% variation compared to the 'control'. Commit DetailAdded
preprocess.midsv_caller.convert_consecutive_indels_to_match
: Due to alignment errors, instances where a true match is mistakenly replaced with "insertion following a deletion" are corrected. For example, "=C,=T" mistakenly replaced by "-C,+C|=T" is reverted back to "=C,=T". Commit DetailClassification
allele_merger.merge_minor_alleles
to reclassify alleles with fewer than 10 reads to suppress excessive subdivision of alleles. Commit DetailClustering
Added the function
merge_minor_cluster
to revert labels clustered with fewer than 10 reads back to the previous labels to suppress excessive subdivision of alleles. Commit DetailUpdated
generate_mutation_kmers
to consider indices not registered in mutation_loci as mutations by replacing them with "@". For example, "=G,=C,-C" and "=G,=G,=C" become "@,@,@" in both cases, making them the same and ensuring they do not affect clustering. Commit DetailConsensus
LocalOutlierFactor
to filter abnormal control reads. Commit Detail