Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NCBI AST parser #1

Open
jhawkey opened this issue Jan 14, 2025 · 4 comments
Open

NCBI AST parser #1

jhawkey opened this issue Jan 14, 2025 · 4 comments
Assignees

Comments

@jhawkey
Copy link
Contributor

jhawkey commented Jan 14, 2025

make a parser for AST data downloaded from NCBI

@katholt
Copy link
Contributor

katholt commented Jan 17, 2025

created basic function:
https://github.com/interpretAMR/AMRgen/blob/main/R/import_pheno.R

to import NCBI AST files, parse antibiotic/species/mic/disk data into relevant classes, and optionally interpret them with as.sir (on by default)

desirable features to add:

  1. if no testing standard is specified, EUCAST will be used but it would be good to allow more options for this interpretation step in future - added parameter 'default_guideline' however currently assumes only 'EUCAST' or 'CLSI' are allowed, and all values are interpreted using the human breakpoint (if interpret=T) and/or ECOFF (if interpret=T), would be good to expand options
  2. currently the input is a file path to a NCBI AST file, would be good to allow this to be a dataframe already read in by the user - DONE
  3. would be good to support downloading latest NCBI AST file for a given query organism (e.g. from https://www.ncbi.nlm.nih.gov/pathogens/ast#Pseudomonas%20aeruginosa)

@katholt
Copy link
Contributor

katholt commented Jan 24, 2025

forgot to include the measurement sign in the new column 'mic', of class 'mic'
this is now done properly (prior to the optional interpretation step)

@katholt
Copy link
Contributor

katholt commented Jan 25, 2025

added interpretation with ecoff rather than breakpoint, reported in new column 'ecoff' as NWT/WT

@katholt
Copy link
Contributor

katholt commented Jan 25, 2025

remaining to-do list:

  1. expand options for interpreting breakpoints (e.g. allow animal breakpoints to be used)
  2. add support for downloading latest NCBI AST file for a given query organism (e.g. from https://www.ncbi.nlm.nih.gov/pathogens/ast#Pseudomonas%20aeruginosa)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants