Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider document structure when splitting sentences in custom XML formats #4901

Closed
reckart opened this issue Jun 25, 2024 · 0 comments
Closed
Assignees
Labels
⭐️ Enhancement New feature or request
Milestone

Comments

@reckart
Copy link
Member

reckart commented Jun 25, 2024

Is your feature request related to a problem? Please describe.
Projects employing a custom XML format encode speech turns as distinct <p> elements. The existing automatic detection of sentence boundaries within the annotation tool lacks the required accuracy. To address this, it is necessary to enable the specification that the end of a <p> element consistently indicates the end of a sentence.

Describe the solution you'd like
Enhance the annotation tool to support customizable sentence boundary detection, tailored for projects utilizing a custom XML format.

@reckart reckart added the ⭐️ Enhancement New feature or request label Jun 25, 2024
@reckart reckart added this to the 33.1 milestone Jun 25, 2024
@reckart reckart self-assigned this Jun 25, 2024
reckart added a commit that referenced this issue Jun 25, 2024
…m XML formats

- Added options to define block elements in XML files which are considered when doing sentence splitting
reckart added a commit that referenced this issue Jun 25, 2024
…m XML formats

- Added options to define block elements in XML files which are considered when doing sentence splitting
reckart added a commit that referenced this issue Jun 25, 2024
…-document-structure-when-splitting-sentences-in-custom-XML-formats

#4901 - Consider document structure when splitting sentences in custom XML formats
@reckart reckart closed this as completed Jun 25, 2024
reckart added a commit that referenced this issue Jun 25, 2024
* release/33.x:
  #4901 - Consider document structure when splitting sentences in custom XML formats
  #4880 - Upgrade dependencies

% Conflicts:
%	pom.xml
reckart added a commit that referenced this issue Jul 3, 2024
…de-to-proper-curation-page

* main: (24 commits)
  #4919 - Unable to import project with knowledge-based exported by INCEpTION 33.0 (again)
  #4921 - Constraints do not work properly when rules are distributed over multiple rulesets
  #4915 - Support for MySQL 8
  #4051 - Postgres support
  #3407 - Remove old database migrations that seem to get always executed
  #3407 - Remove old database migrations that seem to get always executed
  #4909 - Upgrade dependencies
  #4911 - No results when searching as curator after re-indexing project
  #4909 - Upgrade dependencies
  [maven-release-plugin] prepare for next development iteration
  [maven-release-plugin] prepare release inception-33.1
  #4901 - Consider document structure when splitting sentences in custom XML formats
  #4880 - Upgrade dependencies
  #4898 - Unable to import project with knowledge-based exported by INCEpTION 33.0
  #4896 - Ability to search for additional languages in knowledge-base
  No issue: Trim LLM responses
  #4892 - Curation sidebar state not cleared when logging out
  #4891 - Exception in sidebar curation mode when document-level annotations exist
  #4727 - Switching between editors can cause the old and the new editors to be loaded and initialized
  #4889 - Document sometimes not rendered in by-line brat render modes
  ...

% Conflicts:
%	inception/inception-ui-annotation/src/main/java/de/tudarmstadt/ukp/clarin/webanno/ui/annotation/AnnotationPage.java
reckart added a commit that referenced this issue Jul 3, 2024
…de-to-proper-curation-page

* main: (24 commits)
  #4919 - Unable to import project with knowledge-based exported by INCEpTION 33.0 (again)
  #4921 - Constraints do not work properly when rules are distributed over multiple rulesets
  #4915 - Support for MySQL 8
  #4051 - Postgres support
  #3407 - Remove old database migrations that seem to get always executed
  #3407 - Remove old database migrations that seem to get always executed
  #4909 - Upgrade dependencies
  #4911 - No results when searching as curator after re-indexing project
  #4909 - Upgrade dependencies
  [maven-release-plugin] prepare for next development iteration
  [maven-release-plugin] prepare release inception-33.1
  #4901 - Consider document structure when splitting sentences in custom XML formats
  #4880 - Upgrade dependencies
  #4898 - Unable to import project with knowledge-based exported by INCEpTION 33.0
  #4896 - Ability to search for additional languages in knowledge-base
  No issue: Trim LLM responses
  #4892 - Curation sidebar state not cleared when logging out
  #4891 - Exception in sidebar curation mode when document-level annotations exist
  #4727 - Switching between editors can cause the old and the new editors to be loaded and initialized
  #4889 - Document sometimes not rendered in by-line brat render modes
  ...

% Conflicts:
%	inception/inception-ui-annotation/src/main/java/de/tudarmstadt/ukp/clarin/webanno/ui/annotation/AnnotationPage.java
@reckart reckart added this to Kanban Aug 7, 2024
@reckart reckart moved this to 🍹 Done in Kanban Aug 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
⭐️ Enhancement New feature or request
Projects
Archived in project
Development

No branches or pull requests

1 participant