Feature: Allow tag used for planting to be configurable #24

warmfusion · 2016-04-04T14:43:49Z

Scenario

All events from all systems are coming through to a Forest plugin to push into ElasticSearch using the first two elements of the tag as an index.

The rest of the tag is application specific meta data used by the various members of various teams.

Example of a tag; live.productA.haproxy.access or staging.productA.nginx.error.

Problem

As each unique tag results in a newly planted connection to ElasticSearch, a considerable number of new connections are established even tho the configuration is identical and the connection can be reused.

Proposal

Present an additional argument to the matching part of the configuration which defines a 'grove' of similar trees such that even though there may be hundreds of unique trees (based on unique tags) they are grouped into common groves (based on this new config) such that they share a common connection to ES (in this example).

Perhaps something that'd let me do this:

<match **>
    @type forest
    grove ${tag_parts[0..2]}
    subtype elasticsearch
    <template>
        logstash_format true
        logstash_prefix ${tag_parts[0..2]}
        hosts elasticsearch.priv.example.com
    </template>
</match>

I'd then expect for events tagged into the system, new planted trees only exist for the grove, and not for each tag.

input tag	"grove"
live.product.haproxy.access	live.product.haproxy
live.product.application.serviceA.event.subkey	live.product.application
live.product.application.serviceB.event.otherkey	live.product.application

I believe the change would be to the @mapping hash, and more specifically around here

The text was updated successfully, but these errors were encountered:

tagomoris · 2016-04-04T15:12:47Z

I can understand your problem, but in general, forest plugin cannot assure that grove configuration value has consistent unit for each plants with configured parameters. Misconfigured configuration might break behavior of output plugins.
So that, i think forest plugin cannot provide such options.

On the other hand, Fluentd v0.14 plugin API will provide variable tag handling in native. It'll satisfy your requirement, i think.

warmfusion · 2016-04-04T20:48:43Z

While I appreciate the concern around misconfiguration of plugins, I'd argue that any sufficiently advance plugin has scope for breaking itself. 😃

I don't think i'll be able to use 0.14 for a while yet; still working on transitioning from Ruby 1.9.3 😢

The impact of inconsistent hash keys on the mapping makes sense, and it absolutely follows that the possibility of having one plant when the output needs multiple would be remarkably confusing as events may not be handled consistently. That being said, would my suggested implementation provide a solution to my stated problem?

I'm wondering if I need to try and implement the changes myself to suit my use case, at least till we can get to 0.14.

tagomoris · 2016-04-05T02:13:00Z

My answer for this proposal is - I have no motivation to write it by myself, but I'll consider to merge pull-request for this if that code is good enough.
Thank you for detailed proposal.

macdjord · 2016-08-18T16:21:48Z

How about a simpler partial solution? Frequently, I write Forest configurations with no tag-specific content at all - i.e. I want a.** to do this, and b.c.* to do that, but all the tags in each category are handled exactly the same. This is easy to check for - if a config never uses __TAG__, ${tag}, etc., then anything matching that <case> or <template> will have the same config, guaranteed. In that situation, you could just create a single tree for all matching tags.

macdjord · 2016-08-18T16:28:40Z

More complete solution: Make TAG == grove. That is, if you define a grove, then TAG (and ${tag}, ${tag_parts[X]}, etc.) only contain the parts of the tag that were matched in the grove name.

macdjord · 2016-08-18T17:15:20Z

Another approach: When planting a new tree, cache the config used to initialize it. Every time a new tag comes in, generate the tree config from the , matching if any, and tag - but don't yet plant the tree. Compare this config to the configs of all previously created trees. If it is identical to one of them, forward this new tag to that existing tree. Only if the new config is distinct from all previous configs do you actually create a new tree for it.

This approach would be completely automatic - the user need to manually define 'groves' at all - and would be perfectly functionally identical to the current system.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Allow tag used for planting to be configurable #24

Feature: Allow tag used for planting to be configurable #24

warmfusion commented Apr 4, 2016

tagomoris commented Apr 4, 2016

warmfusion commented Apr 4, 2016

tagomoris commented Apr 5, 2016

macdjord commented Aug 18, 2016

macdjord commented Aug 18, 2016 •

edited

Loading

macdjord commented Aug 18, 2016

Feature: Allow tag used for planting to be configurable #24

Feature: Allow tag used for planting to be configurable #24

Comments

warmfusion commented Apr 4, 2016

Scenario

Problem

Proposal

tagomoris commented Apr 4, 2016

warmfusion commented Apr 4, 2016

tagomoris commented Apr 5, 2016

macdjord commented Aug 18, 2016

macdjord commented Aug 18, 2016 • edited Loading

macdjord commented Aug 18, 2016

macdjord commented Aug 18, 2016 •

edited

Loading