Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ML] GET _transform only returns 100 #43052

Closed
sophiec20 opened this issue Jun 10, 2019 · 8 comments · Fixed by #66177
Closed

[ML] GET _transform only returns 100 #43052

sophiec20 opened this issue Jun 10, 2019 · 8 comments · Fixed by #66177

Comments

@sophiec20
Copy link
Contributor

sophiec20 commented Jun 10, 2019

Found in 7.2.0-BC5

GET _data_frame/transforms/_all will only return up to 100 documents.

GET _data_frame/transforms/_all?size=200 will return up to 200 documents.

If I had 105 data frame transforms, and I ran GET _data_frame/transforms/_all without specifying the size, then unfortunately the response does not give any indication that I need to page through the results. As a consequence, the UI only ever supports 100 transforms.

There is a hard-coded defensive upper limit for 10k data frame transforms (same as for anomaly detection). Consider how we can improve usability for cases where there are more than 100 transforms.

@sophiec20 sophiec20 added :ml Machine learning team-discuss :ml/Transform Transform labels Jun 10, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/ml-core

@benwtrent
Copy link
Member

A couple of thoughts:

@droberts195
Copy link
Contributor

  • We should return the total count of the transforms that match the id pattern supplied

Yes, agreed, this is how to solve the problem of knowing whether it's worth trying to get another page.

@peteharverson
Copy link

Note that the data frame _stats endpoint also uses a default size of 100:

GET _data_frame/transforms/_stats
GET _data_frame/transforms/_all/_stats

For a system with over 100 transform stats docs, both the above return 100 docs, with count returned as 100.

@benwtrent
Copy link
Member

Cool, I will get a PR together for fixing this behavior in 7.3. Returning the total count of docs makes sense :)

@benwtrent
Copy link
Member

#43241 <- PR so that the count is the total count given the id pattern.

@droberts195
Copy link
Contributor

#43241 fixed many of the problems, but I don't think it fixed the HLRC entirely. GetDataFrameTransformStatsResponse in the HLRC still assumes that count is the number of hits returned. However, please don't work on this immediately because #44350 is making major changes to the stats response and any fix to the reporting of count will clash horribly.

@sophiec20 sophiec20 removed the :ml Machine learning label Aug 19, 2019
@hendrikmuhs hendrikmuhs changed the title [ML] GET _data_frame only returns 100 [ML] GET _transform only returns 100 Sep 24, 2020
@hendrikmuhs
Copy link

#62809 fixed HLRC, I wonder if this issue can be closed now.

@elastic/ml-qa would one of you be able to verify that we haven't missed anything? If so, we can close this issue.

hendrikmuhs pushed a commit to hendrikmuhs/elasticsearch that referenced this issue Dec 10, 2020
hendrikmuhs pushed a commit that referenced this issue Dec 15, 2020
add hlrc integration tests for get and getStats

closes #43052
hendrikmuhs pushed a commit that referenced this issue Dec 15, 2020
add hlrc integration tests for get and getStats

closes #43052
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

6 participants