-
Notifications
You must be signed in to change notification settings - Fork 6
Configure
A configuration of TextAE can be written inside an annotation JSON file, or in a separate JSON file. The latter is more convenient if you want to share a configuration across multiple annotation JSON files.
In case you have prepared a configuration in a separate JSON file, you can put it in any location which is accessible through the net (a github repository would be a good idea, because you can manage revision history of your configuration).
An example of TextAE configuration can be found at: http://textae.pubannotation.org/examples/config-example.json
Then, when you open an instance of TextAE, the configuration can be specified using the parameter, configure, e.g.,
http://textae.pubannotation.org/editor.html?config=http://textae.pubannotation.org/examples/config-example.json...
The elements of a TextAE configuration is described below:
You can predefine entity types to be assigned to spans through denotation-type of annotations. In following example, two entity types, Protein and Regulation, are defined with their color specification to be #0000FF (blue) and #FF0000 (red):
"entity types": [
{
"id": "Protein",
"color": "#0000FF",
"default": true
},
{
"id": "Regulation",
"color": "#FF0000",
}
]
Optionally, you can specify a dominant type to be a default type (Protein in above example). Then, the type will be preselected when you create a denotation-type annotation.
You can predefine relation types to be used for relation-type of annotations. In following example, two relation types, "themeOf" and "causeOf", are defined with their color specification to be #0000FF (blue) and #FF0000 (red):
"relation types": [
{
"id": "themeOf",
"color": "#0000FF",
"default": true
},
{
"id": "causeOf",
"color": "#FF0000"
}
]
TextAE features the function, Boundary Detection (). When the feature is on, word boundaries are automatically detected to help selecting spans to be annotated.
The set of Delimiter characters defines the characters to be regarded as word boundaries. Following is the default set of delimiter characters:
"delimiter characters": [
" ", ".", "!", "?", ",", ":", ";", "-", "/", "&",
"(", ")", "{", "}", "[", "]",
"\\", "\"", "'", "\n", "–"
]
If necessary, you can re-defined it in your configuration file.
The set of non-edge characters defines the characters which cannot appear at the edge of any span. Following is the default set of non-edge characters:
"non-edge characters": [
" ",
"\n"
],