Skip to content

Star2xml v1.0.0 [Project v0.0.0]

Latest
Compare
Choose a tag to compare
@M-casado M-casado released this 28 Apr 15:33
· 605 commits to main since this release
21ea2d3

[1.0.0] - 2021-04-26

Added

  • ega-metadata-schema/README.md - Main README of the project with all its related information.
  • Star2xml/README.md - Secondary README of the project, containing all the information related to the star2xml tool (usage, prerequisites, scripts...).
  • LICENSE - License applied to the ega-metadata-schema project.
  • Descriptive XMLs - Folder containing (at least) one descriptive XML (with instructions on what information corresponds to which nodes of the XML) for each metadata object.
  • True XMLs - Folder containing (at least) one "true" XML (with fabricated values an little to no comments to see what an XML ready to be submitted would look like) for each metadata object.
  • EGA_Array_based_Format_V4.3.xlsx - Spreadsheet used as template to store and submit Array Format (AF) metadata.
  • EGA_metadata_submission_template_v1.xlsx - Spreadsheet used as template to store metadata related to Sequence data. Such spreadsheet can be used as input of the star2xml tool.
  • star2xml.py - Main wrapper (Python script) to call the project. Can either generate XMLs based on a given tabular input, or both create them and call the secondary wrapper validateXML.py to validate them.
  • validateXML.py - Secondary wrapper (Python script) of the project. Will validate the given XMLs against the given XML schemas (.xsd files).
  • requirements.txt - Text file containing all required packages (with their teste versions) to run the project.
  • input_configuration.yaml - Configuration file (Yaml) containing the required fields (column names) that shall appear in the input file.
  • xml_schema.yaml - Configuration file (Yaml) containing general information about the project (e.g. ENA's GitHub) and the structure of each metadata object's XML, upon which XML_creator.py will iterate to construct XMLs.
  • Input_reader.py - Python script to transform given input file into a dataframe.
  • XML_creator.py - Python script to construct an XML based on the given input dataframe and the XML structure from xml_schemas.yaml.
  • ftp_downloader.py - Python script with the required tools to download ENA's XML schemas (.xsd files) from their FTP server.
  • git_downloader.py - Python script with the required tools to download ENA's XML schemas (.xsd files) from their GitHub repository.
  • scripts_for_validation.py - Python script with the required tools to validate an XML against a given XML schema (.xsd file).
  • utils.py - Python script with diverse functions used by other scripts (e.g. report_error_messages())
  • Several images within miscellaneous/ to be referenced by star2xml's README.
  • Two mock example XMLs (test_run.xml and test_sample.xml) to check what the output of the tool could be.