Would like contig name remapping #22
Labels
enhancement
New feature or request
good first issue
Good for newcomers
help wanted
Extra attention is needed
https://github.com/dpryan79/ChromosomeMappings provides an outstanding repo of contig name maps across builds.
In the simplest case, UCSC/Gencode naming calls the first human chromosome
chr1
, while Ensembl calls it1
. It is not merely enough to slice off (or add)chr
however, becausechrM == MT
, and there are numerous unlocalized and unplaced contigs. In addition, UCSC and Gencode are only identical with respect to the basic 23 chromosomes; they have different names for alt/unloc/unplaced contigs.Crossmap takes the naive approach of renaming based on the
chr
prefix, which is of course a hug ehelp to users who face the very real problem of mismatching contig names, but an incomplete solution.Here, I propose two possible remappings:
The text was updated successfully, but these errors were encountered: