-
Notifications
You must be signed in to change notification settings - Fork 8.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ML] Error in experimental file data visualizer #29578
Comments
Pinging @elastic/ml-ui |
There are certain log structures that are slow to analyse, including those that have multiple timestamps on the same line that nearly match our supported timestamps but not quite - see elastic/elasticsearch#35137 It’s going to be hard to debug this problem without the file that causes it. @wolframhaussig would you be able to supply us with an anonymised version of the file, with everything confidential or identifying individuals removed? If this is not possible something else you could do to help narrow down the problem is to call the backend API directly, gradually increasing |
Is there a minimum number of lines which can be visualized? I reduced the logfile to 9 rows and get the error message:
I added an anonymized version of the file with which the error is reproducable: I will now try to reproduce the timeout error. |
@wolframhaussig thanks for uploading that sample. I can see what the problem is now. The file structure finder endpoint only recognises a small set of timestamp formats, and yours is not one of them. To see the formats it recognises search for "the formats it knows" in https://www.elastic.co/guide/en/elasticsearch/reference/current/ml-find-file-structure.html When analysing a semi-structured log file the structure finder groups the lines into messages by assuming the timestamp is on the first line of each message. But for your file it didn't recognise the timestamp at all so it thinks the whole file is leftovers from a previous incomplete message before the first complete message. This is why you get the "Number of messages analyzed must be positive" error - it thinks there are 0 complete messages after grouping the lines into messages. Since this is a backend problem I will open an issue in the elasticsearch repo to improve the timestamp recognition. |
Given what the problem is, please don't spend any more time doing this. |
Actually that statement I made was wrong, and it's why you don't get a more useful "this file does not contain a timestamp" error.
|
I opened elastic/elasticsearch#38086. Anyone who encounters the problem reported in this issue should subscribe to elastic/elasticsearch#38086 to follow progress towards improving things. Since this issue is in the UI repo I will close it now. |
Kibana version: 6.5.4
Elasticsearch version: 6.5.4
Server OS version: RedHat 7
Browser version: Chrome 71
Browser OS version: Windows 10
Original install method (e.g. download page, yum, from source, etc.): download page
Description of the problem including expected versus actual behavior:
I tried the experimental File Data Visualizer with a logback generated logfile of 390KB. Instead of showing the data an error occurs:
We have a 3 node elasticsearch cluster - all nodes were available, management console shows green status.
Errors in browser console (if relevant):
Provide logs and/or server output (if relevant):
Elasticsearch:
Kibana:
The text was updated successfully, but these errors were encountered: