-
Notifications
You must be signed in to change notification settings - Fork 154
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some spans are missing begin
offset field
#1496
Comments
Hm, ok. We use the It arrives in INCEpTION via a module we re-use from WebAnno: JsonFormatSupport: So, you could add However, I wonder if it wouldn't make sense to report the issue upstream to the UIMA issue tracker to suggest always including begin/end offsets even while other 0 values might still be omitted. |
We could also consider changing the default for |
- Disable omission of default values for legacy UIMA JSON format - Add option to switch to previous behavior of omitting default values - Updated documentation
…s-are-missing-begin-offset-field #1496 - Some spans are missing begin offset field
* main: (189 commits) No issue. Minor additions to BioC format description #4062 - ViewportTracker should focus on block-like elements #4032 - Allow using externalized strings from backend code #4060 - Clean up redundant code in annotation handlers #4026: Support for error tracking with Sentry #3673 - Update dependencies update dead link to the new file #4055 - Editor scrolls up when left sidebar is opened/closed [maven-release-plugin] prepare for next development iteration [maven-release-plugin] prepare release inception-28.2 #4052 - Admins no longer see all projects in the project overview #3673 - Update dependencies #4048 - Document navigation options not visible to manager when viewing other users document #3673 - Update dependencies #3673 - Update dependencies #1496 - Some spans are missing begin offset field #1511 - External recommender fails when CAS contains control characters #1496 - Some spans are missing begin offset field #4040 - Ability to store preferences from client-side code #1066 - Recommender status info ... % Conflicts: % inception/inception-api-annotation/src/main/java/de/tudarmstadt/ukp/clarin/webanno/api/annotation/page/AnnotationPageBase.java % inception/inception-brat-editor/src/main/java/de/tudarmstadt/ukp/clarin/webanno/brat/annotation/BratAnnotationEditor.java % inception/inception-diam/src/main/java/de/tudarmstadt/ukp/inception/diam/service/DiamWebsocketController.java % inception/inception-documents/src/test/java/de/tudarmstadt/ukp/inception/documents/DocumentServiceImplConcurrencyTest.java % inception/inception-external-search-solr/pom.xml % inception/inception-html-editor/src/main/java/de/tudarmstadt/ukp/inception/htmleditor/docview/HtmlDocumentViewControllerImpl.java % inception/inception-html-editor/src/main/resources/META-INF/spring/org.springframework.boot.autoconfigure.AutoConfiguration.imports % inception/inception-preferences/src/main/java/de/tudarmstadt/ukp/inception/preferences/config/PreferencesServiceAutoConfig.java % inception/inception-recommendation/src/main/java/de/tudarmstadt/ukp/inception/recommendation/service/LearningRecordServiceImpl.java % inception/inception-recommendation/src/main/java/de/tudarmstadt/ukp/inception/recommendation/service/RecommendationServiceImpl.java % inception/inception-recommendation/src/main/java/de/tudarmstadt/ukp/inception/recommendation/tasks/TrainingTask.java % inception/inception-recommendation/src/test/java/de/tudarmstadt/ukp/inception/recommendation/footer/RecommendationEventWebsocketControllerImplTest.java % inception/inception-support/pom.xml % inception/inception-ui-core/pom.xml % inception/inception-websocket/pom.xml % inception/pom.xml
Describe the bug
When we process annotated documents exported from Inception as JSON files, we use the begin-end offsets of each span to identify words and phrases. In some annotated documents, one of the spans will only list the end offset, which causes an error when we attempt to process such documents. We suspect that the issue might come from annotating spans that begin at the first index of the document.
Snippet
In this example snippet from exported JSON, span 5368 lacks a
begin
field:To Reproduce
Steps to reproduce the behavior:
end
, but notbegin
.begin
field.Speculation as to cause
Because this only happens when
end
offsets are small, it might be the case that this is triggered whenbegin
would have a value of zero. Perhaps something like @JsonInclude.NON_DEFAULT is being used? If you can point us to the relevant code, we are happy to look into this.The text was updated successfully, but these errors were encountered: