Searchs independent terms against different databases and retrieves gene sequences from:
- Ruby runtime environment
- Windows
- Mac OSX and Linux
- Bundle Gem -
gem install bundle
- Bioruby gem
- Run
bundle install --path vendor/bundle
to install dependencies (currently only Bioruby) - Create a
keys.txt
file (either by copying keys.txt.example or creating a blank one)
- Add query terms to keys.txt (separated by new lines)
- Create a
config.yml
file (either by copying keys.txt.example or creating a blank one)
- Open the file and change options (if need be)
- Run
bundle exec ruby script.rb
to search and download all the associated genes
- If you don't install gems locally then just run
ruby script.rb
YML syntax is used to configure GeneExtractor. It is an hierarchical file that uses indentation to define children attribute or lists.
- email: user's valid email address necessary to use NCBI Rest API
- output:
- dir: parent folder to place results from GeneExtractor
- data_prefix: add an additional fodler level with date and time when GeneExtractor was executed
- kegg: folder name for kegg results
- ncbi: folder name for ncbi results
- search:
- ncbi: list of fields that should be searched in NCBI (each field)
email: [email protected]
output:
dir: queries
date_prefix: true
kegg: kegg
ncbi: ncbi
search:
ncbi:
- Protein name
- Gene name
- Title
This tool was created as a part of FCT grant SFRH/BD/97415/2013 and European Commission research project BacHBerry (FP7- 613793)