From 04f866735a7128a1ff05448cab748dcd28bfaff5 Mon Sep 17 00:00:00 2001 From: acameron103 Date: Fri, 6 Oct 2023 13:29:15 -0700 Subject: [PATCH] changelog --- CHANGELOG.md | 3 ++- unstructured/__version__.py | 2 +- 2 files changed, 3 insertions(+), 2 deletions(-) diff --git a/CHANGELOG.md b/CHANGELOG.md index e2250111b1..7722f5c142 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -1,4 +1,4 @@ -## 0.10.20-dev4 +## 0.10.20-dev5 ### Enhancements @@ -6,6 +6,7 @@ * **Adds data source properties to the Jira connector** These properties (date_created, date_modified, version, source_url, record_locator) are written to element metadata during ingest, mapping elements to information about the document source from which they derive. This functionality enables downstream applications to reveal source document applications, e.g. a link to a GDrive doc, Salesforce record, etc. * **Improve title detection in pptx documents** The default title textboxes on a pptx slide are now categorized as titles. * **Improve hierarchy detection in pptx documents** List items, and other slide text are properly nested under the slide title. This will enable better chunking of pptx documents. +* **Applies `max_characters=` argument to all element types in `add_chunking_strategy` decorator** Previously this argument was only utilized in chunking Table elements and now applies to all partitioned elements if `add_chunking_strategy` decorator is utilized, further preparing the elements for downstream processing. ### Features diff --git a/unstructured/__version__.py b/unstructured/__version__.py index 085ba4d4be..adcfc625cb 100644 --- a/unstructured/__version__.py +++ b/unstructured/__version__.py @@ -1 +1 @@ -__version__ = "0.10.20-dev4" # pragma: no cover +__version__ = "0.10.20-dev5" # pragma: no cover