Filebeat: osquery module #5971

tsg · 2018-01-02T09:50:25Z

This adds a Filebeat module for centralising osquery logs.

osqueryd writes the results in JSON, which makes it fairly easy to ingest to the Elastic stack. The module uses the JSON decoding support in Filebeat, and then renames the fields to match the Beats naming conventions (most fields prefixed with osquery.result). There is an option (use_namespace: false) to leave the fields as they were in the original JSON, but changing that settings makes the sample dashboards unusable.

Another issue is that the osquery JSON format represents all data as strings (the numbers are quoted). This is both bad and good. It's bad because number aggregations don't work on what should be numbers, but it's good because it means there can't be any type conflicts. We could, potentially, do the type conversions ourselves based on the osquery schema, but that seems risky with regards to the schema changes. The osquery JSON output should also be fixed once osquery switches to RapidJSON, which seems to be in progress.

ruflin · 2018-01-02T11:25:39Z

filebeat/module/osquery/result/_meta/fields.yml

+    - name: calendar_time
+      tupe: keyword
+      description: >
+        String representation of the collection time, as formatted by osquery.


Interesting that this is called calendar_time. I like the description with collection time. I wonder if we should have a common field in the future for the collection_time (or a different name).

In osquery, they have unixTime and calendarTime which are different representations of the same moment. These get translated to unix_time and calendar_time by the module, and @timestamp is computed from unix_time. So there's quite a bit of redundancy, we could consider dropping the calendar_time by default.

Ok, my assumption was that the calendar_time is the time when the entry was actually read and unix_time when it was created. Like for a log line where we have @timestamp potentially from the log line and beat.read_time. Perhaps beat.read_time should be event.collection_timestamp?

I added read_time (btw, it's not prefixed by beat) as being the read time when filebeat reads the log lines.

andrewkroh · 2018-01-04T05:41:05Z

filebeat/module/osquery/result/_meta/fields.yml

+    - name: unix_time
+      type: long
+      description: >
+        Unix timestamp of the event. Used for computing the `@timestamp` column.


Is this in seconds since epoch?

Yes, I'll update the docs for clarity.

tsg · 2018-01-09T13:26:58Z

Added another dashboard, addressed comments, and rebased to master. If green, this is good for merging from my POV.

This adds a Filebeat module for centralising osquery logs. osqueryd writes the results in JSON, which makes it fairly easy to ingest to the Elastic stack. The module uses the JSON decoding support in Filebeat, and then renames the fields to match the Beats naming conventions (most fields prefixed with osquery.result). There is an option (use_namespace: false) to leave the fields as they were in the original JSON, but changing that settings makes the sample dashboards unusable. This module comes with two dashboards, one for the compliance pack in osquery, the other for the ossec-rootkit pack.

…cumented

ruflin · 2018-01-12T01:46:50Z

filebeat/module/kafka/log/ingest/pipeline.json

@@ -45,7 +45,7 @@
    {
      "rename": {
        "field": "@timestamp",
-        "target_field": "beat.read_time"
+        "target_field": "read_timestamp"


@tsg This seems to be a breaking change? Also saw there is no CHANGELOG for this PR.

etursunbaev · 2018-02-20T12:23:57Z

Hi, all!
Cannot find information how to add osquery module to filebeat?
In /usr/share/filebeat there is no osquery module

/usr/share/filebeat/module# ls -l
total 20
drwxr-xr-x 4 root root 4096 Feb 19 17:51 apache2
drwxr-xr-x 3 root root 4096 Feb 19 17:51 auditd
drwxr-xr-x 4 root root 4096 Feb 19 17:51 mysql
drwxr-xr-x 4 root root 4096 Feb 19 17:51 nginx
drwxr-xr-x 4 root root 4096 Feb 19 17:51 system

@tsg ^^^

jjqq2013 · 2018-03-15T08:46:12Z

@etursunbaev your filebeat version seems old.

To use osquery module in older version, you can reference this:
94ad82c#commitcomment-27831592

https://github.com/jjqq2013/misc/tree/master/elasticsearch5.4.0

jjqq2013 · 2018-03-15T08:50:25Z

Hi, all, do you know what will be the document type of osquery module?
When I use output to kafka, I need set the topic

topic: '%{[fields.log_topic]}'

What is the value when use osquery module?

jjqq2013 · 2018-03-15T15:32:31Z

Oh, no problem. I found it: I need define fields. log_topic in prospector.

tsg added Filebeat Filebeat in progress Pull request is currently in progress. module review labels Jan 2, 2018

ruflin reviewed Jan 2, 2018

View reviewed changes

andrewkroh reviewed Jan 4, 2018

View reviewed changes

tsg force-pushed the module/osquery branch from e1d54b2 to 25e5687 Compare January 9, 2018 13:26

tsg removed the in progress Pull request is currently in progress. label Jan 9, 2018

tsg added 4 commits January 10, 2018 09:54

Set read_timestamp

dd179ca

Clarified the unix_timestamp column

58f99d9

make update

4a105ed

tsg force-pushed the module/osquery branch from 7e7caeb to 4a105ed Compare January 10, 2018 07:54

Rename Kafka module beat.read_time to read_timestamp, which is do…

6df40d2

…cumented

kvch merged commit 94ad82c into elastic:master Jan 10, 2018

ruflin reviewed Jan 12, 2018

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filebeat: osquery module #5971

Filebeat: osquery module #5971

tsg commented Jan 2, 2018

ruflin Jan 2, 2018

tsg Jan 3, 2018

ruflin Jan 4, 2018

tsg Jan 9, 2018

andrewkroh Jan 4, 2018

tsg Jan 8, 2018

tsg commented Jan 9, 2018

ruflin Jan 12, 2018

etursunbaev commented Feb 20, 2018 •

edited

Loading

jjqq2013 commented Mar 15, 2018

jjqq2013 commented Mar 15, 2018

jjqq2013 commented Mar 15, 2018

Filebeat: osquery module #5971

Filebeat: osquery module #5971

Conversation

tsg commented Jan 2, 2018

ruflin Jan 2, 2018

Choose a reason for hiding this comment

tsg Jan 3, 2018

Choose a reason for hiding this comment

ruflin Jan 4, 2018

Choose a reason for hiding this comment

tsg Jan 9, 2018

Choose a reason for hiding this comment

andrewkroh Jan 4, 2018

Choose a reason for hiding this comment

tsg Jan 8, 2018

Choose a reason for hiding this comment

tsg commented Jan 9, 2018

ruflin Jan 12, 2018

Choose a reason for hiding this comment

etursunbaev commented Feb 20, 2018 • edited Loading

jjqq2013 commented Mar 15, 2018

jjqq2013 commented Mar 15, 2018

jjqq2013 commented Mar 15, 2018

etursunbaev commented Feb 20, 2018 •

edited

Loading