Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PubMLST URL change breaks pubmlstget #304

Closed
rpetit3 opened this issue Aug 17, 2020 · 2 comments
Closed

PubMLST URL change breaks pubmlstget #304

rpetit3 opened this issue Aug 17, 2020 · 2 comments

Comments

@rpetit3
Copy link
Contributor

rpetit3 commented Aug 17, 2020

ariba pubmlstget "Staphylococcus aureus" datasets/species-specific/staphylococcus-aureus/mlst/default/ariba'

WARNING: I tried to get the version of spades with: "/home/rpetit/miniconda3/envs/bactopia/bin/spades.py --version" and the output didn't match this regular expression: "SPAdes\s+v([0-9\.]+)"
Traceback (most recent call last):
  File "/home/rpetit/miniconda3/envs/bactopia/bin/ariba", line 312, in <module>
    args.func(args)
  File "/home/rpetit/miniconda3/envs/bactopia/lib/python3.6/site-packages/ariba/tasks/pubmlstget.py", line 11, in run
    preparer.run()
  File "/home/rpetit/miniconda3/envs/bactopia/lib/python3.6/site-packages/ariba/pubmlst_ref_preparer.py", line 85, in run
    self._load_fasta_files_and_write_clusters_file(self.mlst_download_dir)
  File "/home/rpetit/miniconda3/envs/bactopia/lib/python3.6/site-packages/ariba/pubmlst_ref_preparer.py", line 49, in _load_fasta_files_and_write_clusters_file
    raise Error('Cannot find file "' + infile + '" for gene ' + gene_name)

verbose output

Downloading "http://pubmlst.org/data/dbases.xml" and saving as "/home/rpetit/test-grounds/tmp.get_pubmlst_xmlxwc5g9c5/out.xml" ...
Downloading "https://rest.pubmlst.org/db/pubmlst_saureus_seqdef/schemes/1/profiles_csv" and saving as "datasets/species-specific/staphylococcus-aureus/mlst/default/ariba/pubmlst_download/profile.txt" ...
Downloading "https://rest.pubmlst.org/db/pubmlst_saureus_seqdef/loci/arcC/alleles_fasta" and saving as "datasets/species-specific/staphylococcus-aureus/mlst/default/ariba/pubmlst_download/alleles_fasta.tmp" ...
Downloading "https://rest.pubmlst.org/db/pubmlst_saureus_seqdef/loci/aroE/alleles_fasta" and saving as "datasets/species-specific/staphylococcus-aureus/mlst/default/ariba/pubmlst_download/alleles_fasta.tmp" ...
Downloading "https://rest.pubmlst.org/db/pubmlst_saureus_seqdef/loci/glpF/alleles_fasta" and saving as "datasets/species-specific/staphylococcus-aureus/mlst/default/ariba/pubmlst_download/alleles_fasta.tmp" ...
Downloading "https://rest.pubmlst.org/db/pubmlst_saureus_seqdef/loci/gmk/alleles_fasta" and saving as "datasets/species-specific/staphylococcus-aureus/mlst/default/ariba/pubmlst_download/alleles_fasta.tmp" ...
Downloading "https://rest.pubmlst.org/db/pubmlst_saureus_seqdef/loci/pta/alleles_fasta" and saving as "datasets/species-specific/staphylococcus-aureus/mlst/default/ariba/pubmlst_download/alleles_fasta.tmp" ...
Downloading "https://rest.pubmlst.org/db/pubmlst_saureus_seqdef/loci/tpi/alleles_fasta" and saving as "datasets/species-specific/staphylococcus-aureus/mlst/default/ariba/pubmlst_download/alleles_fasta.tmp" ...
Downloading "https://rest.pubmlst.org/db/pubmlst_saureus_seqdef/loci/yqiL/alleles_fasta" and saving as "datasets/species-specific/staphylococcus-aureus/mlst/default/ariba/pubmlst_download/alleles_fasta.tmp"

Looks like the change in API endpoint is causing all the files are being named alleles_fasta (alleles_fasta) instead of the expected loci_name.tfa (arcC.tfa).

Getting the same error for multiple species

@rpetit3
Copy link
Contributor Author

rpetit3 commented Aug 20, 2020

This is fixed on the Bioconda side. You will need to update Ariba in your Conda environment.

conda update -c conda-forge -c bioconda ariba

You should see something like:

The following packages will be UPDATED:

  ariba                               2.14.5-py36hf0b53f7_1 --> 2.14.5-py36hf0b53f7_2

After update:

ariba pubmlstget "Staphylococcus aureus" test-pubs
WARNING: I tried to get the version of spades with: "/local/home/rpetit/miniconda3/envs/bactopia/bin/spades.py --version" and the output didn't match this regular expression: "SPAdes\s+v([0-9\.]+)"
WARNING: Median sequence length is 456 but arcC.548 has length 522 which is too long or short. Removing.
WARNING: Median sequence length is 456 but arcC.567 has length 516 which is too long or short. Removing.
ariba db directory prepared. You can use it like this:
ariba run test-pubs/ref_db reads_1.fq reads_2.fq output_directory

@puethe
Copy link
Contributor

puethe commented Sep 7, 2020

Solved with #305

@puethe puethe closed this as completed Sep 7, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants