In this repository are all the scripts for PREGO gathering data module. Scripts are organized based on the resources, each resource has its' own collection of scripts. Included are functions for API calls, FTP access urls and association extraction.
Currently, 24/12/2021, there are five supported resousces, BioProject, JGI IMG, MG-RAST, MGnify and Struo pipeline.
Each resource has scripts with the prefix *get*
, *extract*
and *functions*
.
*get*
contains the API calls and transformations of data and
*extract*
scripts perform the Named Entity Recognition using the EXTRACT tagger,
find the associations and then calculate the score for each association.