-
Notifications
You must be signed in to change notification settings - Fork 506
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
STARsolo: support for multiple (3) barcode locations #838
Comments
Hi Gert, sorry for the belayed reply. Please let me know if you have any issues with these paraemeters. If they do not work, I could look at a few thousand reads to tweak them. Cheers |
Hi Alex, We have tested this and it does indeed work. We used the settings --soloCBposition 0_0_0_9 0_20_0_29 0_40_0_49 however. Florian |
Would it be possible to write the corrected cell barcode to the SAM attributes too? We use the following settings:
|
Hi Florian, you are right, the positions are 0-based, had to check my code to make sure. :( Thanks! |
Hello Alex, Our library does not have a UMI, so we had to input a dummy UMI setting to make STARsolo work. I believe we used --soloUMIposition 0_0_0_1, where these two bases are not really random. I was getting core dumps if I did not enter a UMI position. Florian |
Hi Gert, the CB tags do not work presently for the "complex" CBU_UMI barcodes... Cheers |
Hi Florian on the 2nd thought, I do not think replacing UMI with a constant 2b sequence is going to work, as all of the will "collapse" into 1 read, so you will have no more than 1 read per cell. I will need to implement an option to count all reads without collapsing UMIs. Cheers |
Hello Alex, We have also previously considered pre-processing the barcode read and to use simple CB/UMI. Our current design uses 96x96x96 barcode possibilities (some 880k unique barcodes) and a simple CB/UMI approach would work well. However, we might consider scaling up to 384x384x384 in the future. If I understand correctly, working with a barcode whitelist that spans 56m possibilities could be computationally challenging, which is why CB/UMI complex is so attractive. We are currently not concerned about the collapsing of reads in the expression matrix since we are mostly interested in the .bam file, but the option to run STARsolo without UMI might be helpful to demultiplex single cell sequencing libraries not stemming from scRNA-seq experiments in the future. Florian |
Hi Florian, actually, the 56m barcode list should not create serious problems, so I would try creating the 30b cell barcode, as it's generally easier to handle. Cheers |
@alexdobin In the past a 30 bp barcode definetely would not work with STAR as the longest CB supported by STAR is/was 16 bp (due to the use of a 32 bit integer). So barcodes could collapse (in the past all nucleotides except the last 16 would be Does the code use a 64 bit integer already for the CB? See an old pull request: #588 |
I can confirm that a 30 bp cell barcode works with |
Hi Gert, I pulled in your request in 2.7.1a, it should work now. Cheers |
Hi Alex,
We have a custom inhouse design which requires 3 separate barcode locations (10bp each) with a different whilelist for each barcode separated by 2 adapters.
Read 2:
with:
BC1
,BC2
,BC3
= 10bp, each with a different white listUMI
= 8 bpAny chance to get support for something like this in STARsolo?
The text was updated successfully, but these errors were encountered: