-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merging replicates #19
Comments
Hi Pavla, Thanks, |
Hi, Thank you |
Hello Pavla, Thanks, |
Hello Ye, Thank you, |
Hello Pavla,
Hope it helps. P.S. if you observe fewer interactions after adding multi-mapping reads, maybe you should check if the uni-mapping reads have changed or visualize it to see if it is because the background signal changes (background signal would be higher with more multi-mapping reads). Thanks, |
Hi Ye,
So for using your suggested code (to concatenate the replicates), I am going to use the file "HiChIP_2.validPairs.binPairCount.uni" in place of $rep2.binpair? After this merging step, How can I restart from step 4.3 Uni-binPair contact matrix normalization/preparation for step 5 prior building? At least that is how I understand it......Or could I start directly from step 5? Thanks, |
Hi Ye, Thank you, |
Hello Pavla, I guess if you can prepare the input files for step5, you should be good to move on. Thanks, |
Hi Ye,
and use this merged file in place of $contactFile in step 4.3 of the s4_bin.sh
and continue to step 5. Thank you! |
Yes, it looks reasonable to me! Best, |
Hi Ye, Thanks |
I see your confusion. Yes, it could be quite messy to merge all kinds of intermediate files and modify the code to start in the middle of step 4. I think maybe there is another way out. The initial need is to remove the duplicates but still keep the read pairs from the same location but different replicates. Maybe you can consider adding a suffix such as "_rep1", "_rep2" in column 4 (strand column) and column 9. Then concatenate .validPairs files from different replicas. The duplicates are removed based on column2, 3, 4, 7, 8, 9. With the additional suffix, we should be able to keep the overlapped read pairs across replicates. Going forward, columns 4 and 9 won't be used in forming the binPair files, therefore should be fine to leave the suffix there. In other words, after concatenating the .validPairs with an additional replicate suffix, we should be able to run step4_bin.sh without adjustment. Let me know if you run into further problems! Hope this time, it works! Thanks, |
OK, Ye. Let`s try, but I have to make sure I understand well:
Is this what I should do? Thank you for helping me to solve this :), |
Yes, exactly. |
Hi,
I plan using your tool to analyze HiChIP data (followed by the FithHiChIP tool).
At what step and how is it possible to merge replicates (which have been checked for reproducibility by HiCRep)? I would do it after removing duplicates, but am not sure if I can simply merge .ValidPair files.
Thanks, Pavla
The text was updated successfully, but these errors were encountered: