Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] parquet records are not completely ingested into the open search severless sink #3856

Open
jw-amazon opened this issue Dec 13, 2023 · 2 comments
Labels
bug Something isn't working question Further information is requested

Comments

@jw-amazon
Copy link

jw-amazon commented Dec 13, 2023

Describe the bug
I am seeing parquet records are not completely ingested into the open search severless sink sometimes.

To Reproduce
Steps to reproduce the behavior:

  1. Go to AWS console
  2. Click on Open Search Ingestion Pipeline
  3. Check the document metrics and no documents failed to ingest and dlq is empty, however, I still see some parquet records were not completely ingested.

Expected behavior
The parquet record read count need to be the same with document write count. Would also like to see a metrics to reflect how many parquet records are ingested, so I can be confident that all records have been successfully read.

Screenshots

Environment (please complete the following information):

Additional context
Opened a internal ticket as well, will link this issue to the internal ticket.

@dlvenable
Copy link
Member

@jw-amazon , Do you have any samples of the counts that you are seeing? Also, which metrics specifically are you comparing?

@dlvenable dlvenable added the question Further information is requested label Dec 19, 2023
@jw-amazon
Copy link
Author

Hello, @dlvenable , I created this issue based on an internal ticket I created, I will ping you the internal ticket. This issue happened multiple times to us.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working question Further information is requested
Projects
Development

No branches or pull requests

2 participants