kseq parser #6

harishankarv · 2020-03-14T20:07:50Z

clean up kseqparser, add comments on issues below in code itself
what happens when individual sequences greater than 4096
when to encode, along with parsing?
encoding needs to handle N character
sliding window logic: character N logic

harishankarv · 2020-03-19T02:03:14Z

kseq.h expects a whole file (offset = beginning of file), but we are giving it offsets into middle of the file. So when one thread starts parsing from the middle of the file, kseq.h skips characters until it encounters the next "record" (the next ">" character). Need to handle this.

utsavjainb · 2020-03-23T18:11:53Z

clean up kseqparser, add comments on issues below in code itself

what happens when individual sequences greater than 4096
kseq_read is able to read into buffer sequences long

when to encode, along with parsing?
As of now, each individual sequence in read into kseq buffer, and then fully encoded into DnaBitset object (3 bit encoding).

encoding needs to handle N character

sliding window logic: character N logic
Enqueues kmers of length k, if N character (encoded as 100) is found, pointer is shifted.

utsavjainb · 2020-03-23T20:11:56Z

kseq.h expects a whole file (offset = beginning of file), but we are giving it offsets into middle of the file. So when one thread starts parsing from the middle of the file, kseq.h skips characters until it encounters the next "record" (the next ">" character). Need to handle this.

Kseq_read handles this

harishankarv assigned utsavjainb Mar 19, 2020

arkivm unassigned utsavjainb Oct 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kseq parser #6

kseq parser #6

harishankarv commented Mar 14, 2020 •

edited by chenyjade

Loading

harishankarv commented Mar 19, 2020 •

edited

Loading

utsavjainb commented Mar 23, 2020 •

edited

Loading

utsavjainb commented Mar 23, 2020

kseq parser #6

kseq parser #6

Comments

harishankarv commented Mar 14, 2020 • edited by chenyjade Loading

harishankarv commented Mar 19, 2020 • edited Loading

utsavjainb commented Mar 23, 2020 • edited Loading

utsavjainb commented Mar 23, 2020

harishankarv commented Mar 14, 2020 •

edited by chenyjade

Loading

harishankarv commented Mar 19, 2020 •

edited

Loading

utsavjainb commented Mar 23, 2020 •

edited

Loading