Can AMULET identify doublets with relatively low read counts from cells with a wide range of read counts

Hi! I am applying AMULET on snATAC-seq data generated from my lab. For a dataset like this, after filtering with TSSE >= 10 and log10(UQ + 1) >= 3 (UQ means the number of uniquely mapped fragments), the cells still have a wide range of UQ.
<img width="677" alt="屏幕快照 2022-04-04 14 23 37" src="https://user-images.githubusercontent.com/63088388/161634558-78154195-d64e-4224-925b-2f02c6d60aad.png">
Since the number of overlaps (genomic regions with >2 overlapping reads) for a cell positively correlates with the UQ of the cell, cells have high UQ tend to have a larger number of overlaps. Because the majority of cells fall in the range of 3 < log10(UQ + 1) < 4, I believe the majority of doublets are also there (at least with the same order of magnitude). However, with a doublet formed by two singlets each with UQ = 3000 and a singlet with UQ = 30000, AMULET would very probably locate more overlaps in the singlet than in the doublet. The cells whose UQ > 10000 tend to have more overlaps and are thus more likely to be identified as doublets.
The AMULET paper indicates that 25K median valid reads per nucleus is optimal, but is there a way to maximize the probability of finding doublets in the range of 3 < log10(UQ + 1) < 4 in this dataset given that there are cells in range 4 < log10(UQ + 1) < 5?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Can AMULET identify doublets with relatively low read counts from cells with a wide range of read counts #17

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Can AMULET identify doublets with relatively low read counts from cells with a wide range of read counts #17

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions