Skip to content

module__org.bibliome.alvisnlp.modules.rdf.RDFProjector

Robert Bossy edited this page Jul 27, 2017 · 1 revision

#org.bibliome.alvisnlp.modules.rdf.RDFProjector

Synopsis

Projects OBO terms and synonyms on sections.

This module is experimental.

Description

org.bibliome.alvisnlp.modules.rdf.RDFProjector reads source SKOS terminologies or OWL ontologies and searches for class and concept labels in sections.

The parameters allowJoined, allUpperCaseInsensitive, caseInsensitive, ignoreDiacritics, joinDash, matchStartCaseInsensitive, skipConsecutiveWhitespaces, skipWhitespace and wordStartCaseInsensitive control the matching between the section and the entry keys.

The subject parameter specifies which text of the section should be matched. There are two options:

  • the entries are matched on the contents of the section, subject can also control if matches boundaries coincide with word delimiters;
  • the entries are matched on the feature value of annotations of a given layer separated by a whitespace, in this way entries can be searched against word lemmas for instance.

org.bibliome.alvisnlp.modules.rdf.RDFProjector creates an annotation for each matched entry and adds these annotations to the layer named targetLayerName. The created annotations will have the feature uriFeatureName containing the URI of the matched class or concept. org.bibliome.alvisnlp.modules.rdf.RDFProjector may also map property object values into features specified by labelFeatures.

Parameters

Optional

Type: SourceStream

Path to the source SKOS/OWL files.

Optional

Type: String

Name of the layer that contains the match annotations.

Optional

Type: String

Feature where to store the entry URI.

Optional

Type: Mapping

Constant features to add to each annotation created by this module

Optional

Type: OutputFile

Serialization is not supported.

Optional

Type: InputFile

Serialization is not supported.

Default value: false

Type: Boolean

Either the match allows case substitution on all characters in words that are all upper case.

Default value: false

Type: Boolean

Either the match allows arbitrary suppression of whitespace characters in the subject. For instance, the contents aminoacid matches the entry amino acid.

Default value: false

Type: Boolean

Either the match allows case substitutions on all characters.

Default value: true

Type: Expression

Only process document that satisfy this filter.

Default value: false

Type: Boolean

Either the match allows dicacritics substitutions on all characters. For instance the contents acide amine matches the entry acide aminé.

Default value: false

Type: Boolean

Either to treat dash characters (-) as whitespace characters if allowJoined is true. For instance, the contents aminoacid matches the entry amino-acid.

Default value: {rdfs-label=rdfs:label, skos-prefLabel=skos:prefLabel}

Type: Mapping

Mapping from feature names to property URIs. This parameter indicates the properties of the entry to record in features.

Default value: rdfs:label,skos:prefLabel,skos:altLabel,skos:hiddenLabel,skos:notation,oboInOwl:hasExactSynonym,oboInOwl:hasRelatedSynonym,oboInOwl:hasSynonym

Type: String[]]

RDF properties whose object values that represent entry keys.

Default value: false

Type: Boolean

Either the match allows case substitution on the first character of the entry key.

Default value: all

Type: MultipleEntryBehaviour

Specifies the behavious of org.bibliome.alvisnlp.modules.rdf.RDFProjector if dictFile contains several entries with the same key.

Default value: {}

Type: Mapping

Specify URI prefixes to be used in resourceTypeURIs, labelURIs, and labelFeatures

Default value: owl:Class,skos:Concept

Type: String[]]

Type of RDF resources that represent an entry.

Default value: true

Type: Expression

Process only sections that satisfy this filter.

Default value: false

Type: Boolean

Either the match allows insertion of consecutive whitespace characters in the subject. For instance, the contents amino acid matches the entry amino acid.

Default value: false

Type: Boolean

Either the match allows arbitrary insertion of whitespace characters in the subject. For instance, the contents amino acid matches the entry aminoacid.

Default value: WORD

Type: Subject

Specifies the contents to match.

Default value: false

Type: Boolean

Either the match allows case substitution on the first character of words.

Clone this wiki locally