Entity Localization Bug: Sentence. Runaway sentences in many papers #191
Labels
bad-entity-detection
An issue or task related to an entity that was detected in the wrong place
bug
Something isn't working
entity-localization
An issue or task related to entity localization
sentences
An issue or task related to sentences
Milestone
Description: In some papers, it appears that the end of a sentence is never detected, such that that sentence bounding boxes start somewhere in the middle of the paper, and then continue throughout the entire rest of the paper until the end.
I have inspected 24 papers from PR #188, and observed this problem in the following papers:
_How to fix: One fix is to fix the sentence segmenter; I would guess that the command for undoing the color at the end of a sentence is placed in a part of the TeX that is invalid, and hence doesn't successfully reverse the color. Another simpler fix is to do outlier detection, not uploading sentences which contain more than some threshold number of bounding boxes (e.g., more than 100).
The text was updated successfully, but these errors were encountered: