Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Backport 2.x][FEATURE] Support batch ingestion in TextEmbeddingProcessor & SparseE… #762

Merged

Conversation

chishui
Copy link
Contributor

@chishui chishui commented May 27, 2024

…ncodingProcessor (#744)

  • Support batch ingestion in TextEmbeddingProcess & SparseEncodingProcessor

Signed-off-by: Liyun Xiu [email protected]

  • Update Changelog

Signed-off-by: Liyun Xiu [email protected]

  • Add UT and IT

Signed-off-by: Liyun Xiu [email protected]

  • Add comments

Signed-off-by: Liyun Xiu [email protected]

  • Sort texts by length before sending for inference

Signed-off-by: Liyun Xiu [email protected]

  • Make consistent check for inferenceList

Signed-off-by: Liyun Xiu [email protected]


Signed-off-by: Liyun Xiu [email protected]
(cherry picked from commit afd1215)

Description

[Describe what this change achieves]

Issues Resolved

[List any issues this PR will resolve]

Check List

  • New functionality includes testing.
    • All tests pass
  • New functionality has been documented.
    • New functionality has javadoc added
  • Commits are signed as per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

…ncodingProcessor (opensearch-project#744)

* Support batch ingestion in TextEmbeddingProcess & SparseEncodingProcessor

Signed-off-by: Liyun Xiu <[email protected]>

* Update Changelog

Signed-off-by: Liyun Xiu <[email protected]>

* Add UT and IT

Signed-off-by: Liyun Xiu <[email protected]>

* Add comments

Signed-off-by: Liyun Xiu <[email protected]>

* Sort texts by length before sending for inference

Signed-off-by: Liyun Xiu <[email protected]>

* Make consistent check for inferenceList

Signed-off-by: Liyun Xiu <[email protected]>

---------

Signed-off-by: Liyun Xiu <[email protected]>
(cherry picked from commit afd1215)
@chishui chishui changed the title [FEATURE] Support batch ingestion in TextEmbeddingProcessor & SparseE… [Backport 2.x][FEATURE] Support batch ingestion in TextEmbeddingProcessor & SparseE… May 27, 2024
@zhichao-aws zhichao-aws merged commit 15b4a0f into opensearch-project:2.x May 27, 2024
118 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants