Major rewrite to increase speed, eliminate redundant steps, and implement parallelization of barcode extraction. Steps 1-4 were completely rewritten to replace search based barcode extraction with position based barcode extraction. On our cluster these modifications resulted in a 63X increase in speed when run on a single core and 67X increase when run on 4 cores. This faster version is accessible using --version fast
.