Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve the bed split method #47

Closed
mfoll opened this issue Oct 8, 2015 · 3 comments
Closed

Improve the bed split method #47

mfoll opened this issue Oct 8, 2015 · 3 comments
Assignees
Milestone

Comments

@mfoll
Copy link
Member

mfoll commented Oct 8, 2015

My R script is not splitting regions in the bed file but is rather grouping them to launch in parallel. This is good for exomes as the bed file contains many line (~ one per exon), but the user should also have the option to allow cutting lines of the bed to make calculation faster when they contain only a few large lines. This is related to #39 where for full genomes there is no bed so we will have to cut the chromosomes in chunks.

@mfoll mfoll self-assigned this Oct 8, 2015
@mfoll mfoll added this to the v1.0 milestone Oct 8, 2015
@mfoll
Copy link
Member Author

mfoll commented Oct 8, 2015

Of course splitting a region is not always good if you split right in the middle of an indel, so it should remain the user choice (like with an option allow_bed_cut or something like this).

@mfoll
Copy link
Member Author

mfoll commented Oct 8, 2015

Actually I'm not sure my last comment is true because:

  • we don't consider MNP
  • samtools keep the full reads overlapping a position and then indels will never be cut

@mfoll mfoll assigned tdelhomme and unassigned mfoll Oct 8, 2015
@mfoll
Copy link
Member Author

mfoll commented Dec 11, 2015

In this paper they seem to have the right way of doing it with "bins of equal depth":
http://biorxiv.org/content/biorxiv/early/2014/12/05/012179.full.pdf

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants