Skip to content

Apache UIMA Java SDK 3.4.0

Compare
Choose a tag to compare
@reckart reckart released this 02 Feb 08:23
· 1305 commits to main since this release

Apache UIMA (Unstructured Information Management Architecture) v3.4.0 Release Notes

Contents

What is UIMA?
Major Changes in this Release
List of JIRA Issues Fixed in this Release
How to Get Involved
How to Report Issues

What is UIMA?

Unstructured Information Management applications are software systems that analyze large volumes of
unstructured information in order to discover knowledge that is relevant to an end user. UIMA is a
framework and SDK for developing such applications. An example UIM application might ingest plain
text and identify entities, such as persons, places, organizations; or relations, such as works-for
or located-at. UIMA enables such an application to be decomposed into components, for example
"language identification" -> "language specific segmentation" -> "sentence boundary detection" ->
"entity detection (person/place names etc.)". Each component must implement interfaces defined by
the framework and must provide self-describing metadata via XML descriptor files. The framework
manages these components and the data flow between them. Components are written in Java or C++; the
data that flows between components is designed for efficient mapping between these languages. UIMA
additionally provides capabilities to wrap components as network services, and can scale to very
large volumes by replicating processing pipelines over a cluster of networked nodes.

Apache UIMA is an Apache-licensed open source implementation of the UIMA specification (that
specification is, in turn, being developed concurrently by a technical committee within
OASIS, a standards organization). We invite and encourage you to
participate in both the implementation and specification efforts.

UIMA is a component framework for analysing unstructured content such as text, audio and video. It
comprises an SDK and tooling for composing and running analytic components written in Java and C++,
with some support for Perl, Python and TCL.

Notable changes in this release

This is a feature fix release.

What's Changed

Improvements

  • ⭐️ [UIMA-6474] Switch to getDataPathElements() in UIMA context by @reckart in #216
  • ⭐️ Issue #226: Provide SPI interfaces to locate descriptors by @reckart in #227, #237, #249
  • ⭐️ Issue #225: Improve deployment in OSGi environments by @reckart in #232, #244, #250
  • ⭐️ Issue #245: Utility method for loading type systems in SPI providers by @reckart in #246
  • ⭐️ Issue #247: RelativePathResolver should consider TCCL by @reckart in #248
  • ⭐️ Issue #268: UIMA components log and then re-throw exceptions which usually leads to errors being logged twice by @reckart in #279

Bugs fixed

  • 🦟 Issue #252: Potential failure to look up FsGenerator3 in OSGI-like contexts by @reckart in #253
  • 🦟 Issue #265: Unable to release without auto-staging by @reckart in #282
  • 🦟 Issue #266: Unable to install UIMA 3.3.1 Eclipse Plugins in Eclipse 2022-09 by @reckart in #274
  • 🦟 Issue #267: UIMA Log4jLogger_impl not compatible with log4j 2.18.0+ by @reckart in #269, #280
  • 🦟 Issue #272: select on FSArray seems broken by @reckart in #277, #278, #291
  • 🦟 Issue #275: Improved error message made it even more misleading by @reckart in #276
  • 🦟 Issue #285: NPE while deserializing an XMI in a PEAR context by @reckart in #287

Refactorings

  • ⚙️ [UIMA-6440] Stage release artifacts as part of the build by @reckart in #199
  • ⚙️ [UIMA-6443] Fresh Eclipse update site for every release by @reckart in #200
  • ⚙️ [UIMA-6462] Avoid deploy broken checksum files for p2content.xml and artifacts.xml in feature modules by @reckart in #205
  • ⚙️ [UIMA-6463] Use toolchains to ensure compatibility with Java 1.8 by @reckart in #206
  • ⚙️ [UIMA-6436] Move maintainer documentation from website into maintainer guide by @reckart in #197, #217
  • ⚙️ Issue #230: Remove version overrides in Maven plugin modules by @reckart in #231
  • ⚙️ Issue #228: Move the UimaDecompiler class by @reckart in #229
  • ⚙️ Issue #283: Update issue report generation by @reckart in #284
  • ⚙️ Issue #292: Make CAS serialization tests more robust by @reckart in #293
  • ⚙️ Issue #298 - SHA512 checksums contain wrong file name by @reckart in #299
  • ⚙️ Issue #300: Differences between source ZIP and repository tag by @reckart in #301
  • 🩹 [UIMA-6459] Upgrade dependencies by @reckart in #204, #207
  • 🩹 Issue #270: Update dependencies by @reckart in #271, #273, #290

For a full list of issues affecting this release, please see:

How to Get Involved

The Apache UIMA project really needs and appreciates any contributions, including documentation
help, source code and feedback. If you are interested in contributing, please visit
http://uima.apache.org/get-involved.html.

How to Report Issues

The Apache UIMA project uses GitHub for issue tracking. Please report any issues you find at
https://github.com/apache/uima-uimaj/issues.

Download

The official release comes with verifiable signatures and hashes and can be downloaded from the Apache UIMA homepage: https://uima.apache.org/downloads.cgi

Note that in particular the Source code" links below do not provide official release artifacts. It is only as a convenience by GitHub.

Convenience binaries are provided via Maven Central: https://search.maven.org/search?q=g:org.apache.uima