-
Notifications
You must be signed in to change notification settings - Fork 25k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Transform API update documents already inserted #80277
Comments
Hey @mjrlgue transform by default creates document IDs based on the values of the group_by keys. So, in your case document IDs match 1-1 the This is by design. There are two ways forward if you want a new occurrence of
I suggest option 1, as your destination data seems to be time based (being you have the concept of "new" vs "old" |
Pinging @elastic/ml-core (Team:ML) |
@mjrlgue I am going to close this issue as it seems things are working as designed. Please feel free to reopen, open a http://discuss.elastic.co/ ticket, or a separate issue. |
Hey @benwtrent Thank you for your reply and sorry for my late reply ^^. Well we tried to add, as you suggested, the we added the date histogram as follow:
But I like the idea of providing our own I think this detail about Transform API that generates ids based on the value of group_by keys should be specified in the doc. Because it will save time for further users. I think ill do a PR for that for the doc what do you think? Thanks :) |
As part of the effort of making JDBC driver self sufficient, remove the ES lib geo dependencies without any replacement. Currently the JDBC driver takes the WKT text and instantiates a geo object based on the ES lib geo. Moving forward the driver will return the WKT string representation without any conversion letting the user pick the geo library desired. That can be ES lib geo, jts, spatial4j or others. Note this is a breaking change. Relates elastic#80277
As part of the effort of making JDBC driver self sufficient, remove the ES lib geo dependencies without any replacement. Currently the JDBC driver takes the WKT text and instantiates a geo object based on the ES lib geo. Moving forward the driver will return the WKT string representation without any conversion letting the user pick the geo library desired. That can be ES lib geo, jts, spatial4j or others. Note this is a breaking change. Relates #80277
As part of the effort of making JDBC driver self sufficient, remove the ES lib geo dependencies without any replacement. Currently the JDBC driver takes the WKT text and instantiates a geo object based on the ES lib geo. Moving forward the driver will return the WKT string representation without any conversion letting the user pick the geo library desired. That can be ES lib geo, jts, spatial4j or others. Note this is a breaking change. Relates elastic#80277
As part of the effort of making JDBC driver self sufficient, remove the ES lib geo dependencies without any replacement. Currently the JDBC driver takes the WKT text and instantiates a geo object based on the ES lib geo. Moving forward the driver will return the WKT string representation without any conversion letting the user pick the geo library desired. That can be ES lib geo, jts, spatial4j or others. Note this is a breaking change. Relates #80277
Remove JDBC driver dependency on XContent by cloning the necessary classes into the driver. This has the advantage of keeping the parsing/writing code style in sync and hopefully making maintenance easier in the future at the cost of bringing over a lot of code that is potentially unnecessary. The imported classes were kept as close as possible to the original and placed under a different package. Noteable exception is JDBC XContentBuilder which exposes its internal generator to allow unwrapping inside sql-action. The bridging between XContent in ES and JDBC is done in sql-action through ProtoShim which relies on delegation to allow ES XContent to be used inside the JDBC classes. Fix elastic#80277
Remove JDBC driver dependency on XContent by cloning the necessary classes into the driver. This has the advantage of keeping the parsing/writing code style in sync and hopefully making maintenance easier in the future at the cost of bringing over a lot of code that is potentially unnecessary. The imported classes were kept as close as possible to the original and placed under a different package. Noteable exception is JDBC XContentBuilder which exposes its internal generator to allow unwrapping inside sql-action. The bridging between XContent in ES and JDBC is done in sql-action through ProtoShim which relies on delegation to allow ES XContent to be used inside the JDBC classes. Fix #80277
Remove JDBC driver dependency on XContent by cloning the necessary classes into the driver. This has the advantage of keeping the parsing/writing code style in sync and hopefully making maintenance easier in the future at the cost of bringing over a lot of code that is potentially unnecessary. The imported classes were kept as close as possible to the original and placed under a different package. Noteable exception is JDBC XContentBuilder which exposes its internal generator to allow unwrapping inside sql-action. The bridging between XContent in ES and JDBC is done in sql-action through ProtoShim which relies on delegation to allow ES XContent to be used inside the JDBC classes. Fix elastic#80277
Remove JDBC driver dependency on XContent by cloning the necessary classes into the driver. This has the advantage of keeping the parsing/writing code style in sync and hopefully making maintenance easier in the future at the cost of bringing over a lot of code that is potentially unnecessary. The imported classes were kept as close as possible to the original and placed under a different package. Noteable exception is JDBC XContentBuilder which exposes its internal generator to allow unwrapping inside sql-action. The bridging between XContent in ES and JDBC is done in sql-action through ProtoShim which relies on delegation to allow ES XContent to be used inside the JDBC classes. Fix #80277
As part of the effort of making JDBC driver self sufficient, remove the ES lib geo dependencies without any replacement. Currently the JDBC driver takes the WKT text and instantiates a geo object based on the ES lib geo. Moving forward the driver will return the WKT string representation without any conversion letting the user pick the geo library desired. That can be ES lib geo, jts, spatial4j or others. Note this is a breaking change. Relates elastic#80277
Remove JDBC driver dependency on XContent by cloning the necessary classes into the driver. This has the advantage of keeping the parsing/writing code style in sync and hopefully making maintenance easier in the future at the cost of bringing over a lot of code that is potentially unnecessary. The imported classes were kept as close as possible to the original and placed under a different package. Noteable exception is JDBC XContentBuilder which exposes its internal generator to allow unwrapping inside sql-action. The bridging between XContent in ES and JDBC is done in sql-action through ProtoShim which relies on delegation to allow ES XContent to be used inside the JDBC classes. Fix elastic#80277
Elasticsearch version (
bin/elasticsearch --version
): 7.15License: Basic
Plugins installed: none
JVM version (
java -version
):OS version (
uname -a
if on a Unix-like system):Description of the problem including expected versus actual behavior:
Steps to reproduce:
Hey I am facing a big issue (I don't know if it's designed to behave like this or not) as we are going to production using Transform API. What we are facing is the aggregation calculated with a transform API are updated once a term aggregation already exist in the destination index.
Here is the transformation:
The following response is:
The problem is, when I insert a new document, let's say having
os=os3
, what I expect is having a new insertion in the destination index as follow:But it gets updated like this:
From the doc of Transform API, what I understand is that it will calculate aggregations and store them in the destination index, but here it's not inserting but updating documents already inserted in the destination index with the same term.
I added an ingest pipeline to insert a date field
ingestedAt
but still updating. is it supposed to behave like this ?Thanks.
The text was updated successfully, but these errors were encountered: