Skip to content

Commit

Permalink
add fileinfo to readme
Browse files Browse the repository at this point in the history
  • Loading branch information
Smetanin Alexander committed Apr 23, 2020
1 parent e94def1 commit c28087a
Show file tree
Hide file tree
Showing 2 changed files with 17 additions and 4 deletions.
19 changes: 16 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,8 +18,8 @@ cat <vcf_file> | bcftools view -c 1 -Ou | bcftools +fill-tags -Ou -- -S configs/

In case vcf file is (b)gzipped use samtools tabix.

3. Then use main script.
Currently supported modes: bayes, fb.
### Script usage:
Currently supported modes: bayes, fb, softmax.
Note: fb is around 20 times slower.

```bash
Expand All @@ -41,4 +41,17 @@ for vcf file with around 120k SNPs.
|--|--|--|
|fb | 20 | ? |
|bayes| 1 | ? |
|softmax| 0.5 | ? |
|softmax| 0.5 | ? |



### Files explanation
As a result of the pipeline we get 3 files:
1. `<group>_<mode>_<window-len>_predictions.csv`
Csv file with a list of most probable population in each window.

2. `<group>_<mode>_<window-len>_snp_prob.tsv`
Tsv (tab-separated) file with a list of all SNPs and probabilities that it came from each population.

3. `<group>_<mode>_<window-len>_stats.csv`
Csv file with statistics that shows the fraction of windows assigned to each population.
2 changes: 1 addition & 1 deletion src/process_individuals.py
Original file line number Diff line number Diff line change
Expand Up @@ -168,7 +168,7 @@ def main(file, win_len, mode, n_threads=4):
base_path = Path(file).parent

input_file = str(file_path)
snp_prob_file = f'{base_path}/{group}_{mode}_{win_len}_snp_prob.txt'
snp_prob_file = f'{base_path}/{group}_{mode}_{win_len}_snp_prob.tsv'
prediction_file = f'{base_path}/{group}_{mode}_{win_len}_predictions.csv'
stats_file = f'{base_path}/{group}_{mode}_{win_len}_stats.csv'

Expand Down

0 comments on commit c28087a

Please sign in to comment.