Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about output file #10

Open
aakashsur opened this issue Mar 7, 2019 · 1 comment
Open

Question about output file #10

aakashsur opened this issue Mar 7, 2019 · 1 comment

Comments

@aakashsur
Copy link

So after step 6, is there a way to go from the output file:

HWI-ST279:283:D1ACDACXX:8:1101:1345:16298       chr11   1955000 chr4    1155000 0.04448364631169624
HWI-ST279:283:D1ACDACXX:8:1101:1345:16298       chr11   45000   chr11   1955000 0.5951475510998886
HWI-ST279:283:D1ACDACXX:8:1101:1345:16298       chr7    35000   chr11   1955000 0.36036880258841525
HWI-ST279:283:D1ACDACXX:8:1101:1345:54161       chr8    455000  chr4    945000  0.2559876018501595
HWI-ST279:283:D1ACDACXX:8:1101:1345:54161       chr8    455000  chr4    965000  0.7440123981498405

To a count format where the multi-mapping reads have been assigned to the highest probability location?

@yezhengSTAT
Copy link
Owner

Glad that you have finished all the steps in mHiC. Yes, after mHi-C processing, you will need to do a filtering based on your own needs. We recommend filter the probability column (6th) column by >0.5 so that each multi-read can have at most one selected alignment position. You can also do >0.6 or >0.9 for more stringent filtering. Then merge with your uni-reads bin-pair count file and get the interaction counts for each bin-pair. You will need to write your own code but it can always be done by a few shell commands. I can add a few recommended commands to the manual later.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants