Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to avoid to getting repeat region ? #78

Open
panxiaoguang opened this issue Jan 25, 2022 · 1 comment
Open

how to avoid to getting repeat region ? #78

panxiaoguang opened this issue Jan 25, 2022 · 1 comment

Comments

@panxiaoguang
Copy link

Hi Inigo,

I know that you use get_mate_intervals to build realign intervals, and you will use all the soft-clipped/hard-clipped reads and discordant reads in this function. But we know that if some eccDNA reads were from repeat regions (actually it's common), the aligner such as bwa will have multi-align reads including supplement reads and discordant reads mostly similar to eccDNA reads. Is it right?

So how can we avoid getting repeat regions in eccDNA? Will it work if I use --only_discordant?

Best wishes.
Pan xiaoguang

@iprada
Copy link
Owner

iprada commented Feb 1, 2022

Hi,

Read alignments to repeats and multi mapped regions should be handled by the probabilistic model. Reads with low probabilities are filtered out by default.

best,

Inigo

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants