ClickHouse exporter produces duplicates and poor compression without sorting attributes #33634

JustinMason · 2024-06-18T17:14:32Z

Component(s)

exporter/clickhouse

Is your feature request related to a problem? Please describe.

The default table created by the exporter isn't a good pattern for optimizing compression and removing duplicates. ClickHouse does not sort the map values, so even though there may be duplicate records the order of their attributes may be different. This causes ClickHouse to treat them as unique records for storage and merge trees. This also effects ClickHouses compression so the same data takes up a lot more disk.

Describe the solution you'd like

We identified this issue and the solution was to use a NULL Engine for the primary table the Exporter writes to, then using a Materialized View we explicitly sort the attributes before insert.
mapSort(Attributes) as Attributes,

After this the compression rate for billions of rows was greater than 250, making the storage needed much less. It also eliminated duplicates and helped streamline the increase functions so we could avoid extra processing.

This makes the initial table creation a bit trickier but it is critical in my experience.

Describe alternatives you've considered

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

github-actions · 2024-06-18T17:14:47Z

Pinging code owners:

exporter/clickhouse: @hanjm @dmitryax @Frapschen @SpencerTorres

See Adding Labels via Comments if you do not have permissions to add labels yourself.

SpencerTorres · 2024-06-18T17:36:22Z

Interesting information, we actually have a PR open to update the table (#33611)

Considering the complexity of the materialized view required, it might be best to do this in the exporter code. Maps are unpredictable in Go, so we would need to convert it to a slice and sort it. Any thoughts on this approach?

hanjm · 2024-06-19T03:12:44Z

Yes, map sort can improve compression, clickhouse-go sdk support column.IterableOrderedMap ClickHouse/clickhouse-go#1152, exporter can use this sdk type to write clickhouse map type with order, welcome a PR to try it.

crobert-1 · 2024-06-20T14:53:04Z

Removing needs triage based on response from code owners.

…pen-telemetry#33634)

github-actions · 2024-10-14T03:35:34Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

exporter/clickhouse: @hanjm @dmitryax @Frapschen @SpencerTorres

See Adding Labels via Comments if you do not have permissions to add labels yourself.

…pen-telemetry#33634)

hanjm · 2024-10-21T05:45:38Z

still valid

SpencerTorres · 2024-10-29T02:21:29Z

This can be closed once #35725 is merged

zdyj3170101136 · 2024-11-14T03:17:50Z

is this is an optimization?

here is my table definition:

CREATE TABLE default.test
(
    `timestamp` DateTime('UTC') CODEC(Delta(4), ZSTD(1)),
    `service` LowCardinality(String) CODEC(ZSTD(1)),
    `operation` LowCardinality(String) CODEC(ZSTD(1)),
    `tags.key` Array(LowCardinality(String)) CODEC(ZSTD(1)),
    `tags.value` Array(String) CODEC(ZSTD(1))
)
ENGINE = MergeTree
PARTITION BY toDate(timestamp)
ORDER BY (-toUnixTimestamp(timestamp), service, operation)
SETTINGS index_granularity = 8192 

 CREATE TABLE default.test_1
(
    `timestamp` DateTime('UTC') CODEC(Delta(4), ZSTD(1)),
    `service` LowCardinality(String) CODEC(ZSTD(1)),
    `operation` LowCardinality(String) CODEC(ZSTD(1)),
    `tags` Map(LowCardinality(String), String) CODEC(ZSTD(1))
)
ENGINE = MergeTree
PARTITION BY toDate(timestamp)
ORDER BY (-toUnixTimestamp(timestamp), service, operation)
SETTINGS index_granularity = 8192

 CREATE TABLE default.test_2
(
    `timestamp` DateTime('UTC') CODEC(Delta(4), ZSTD(1)),
    `service` LowCardinality(String) CODEC(ZSTD(1)),
    `operation` LowCardinality(String) CODEC(ZSTD(1)),
    `tags` Map(LowCardinality(String), String) CODEC(ZSTD(1))
)
ENGINE = MergeTree
PARTITION BY toDate(timestamp)
ORDER BY (-toUnixTimestamp(timestamp), service, operation)
SETTINGS index_granularity = 8192

here is my insert:

insert into test_1 SELECT
                  timestamp, service, operation,
                  mapFromArrays(tags.key, tags.value) AS tags
              FROM test;
insert into test_2 select timestamp, service, operation, mapSort(tags) select * from test_1

seems the compressed bytes is no difference!

SELECT
                  name,
                  type,
                  data_compressed_bytes,
                  data_uncompressed_bytes,
                  (data_compressed_bytes / data_uncompressed_bytes) * 100 AS compression_rate,table
              FROM system.columns
              WHERE table like 'test%' and name like 'tag%'

SELECT
    name,
    type,
    data_compressed_bytes,
    data_uncompressed_bytes,
    (data_compressed_bytes / data_uncompressed_bytes) * 100 AS compression_rate,
    table
FROM system.columns
WHERE (table LIKE 'test%') AND (name LIKE 'tag%')

Query id: c45cc475-1420-41ad-aaf7-ed715cf4e064

┌─name───────┬─type────────────────────────────────┬─data_compressed_bytes─┬─data_uncompressed_bytes─┬──compression_rate─┬─table──┐
│ tags.key   │ Array(LowCardinality(String))       │              14258905 │               365525854 │ 3.900929262311497 │ test   │
│ tags.value │ Array(String)                       │             205625812 │              3444912929 │ 5.968969789308716 │ test   │
│ tags       │ Map(LowCardinality(String), String) │             219946760 │              3810479168 │ 5.772154899758791 │ test_1 │
│ tags       │ Map(LowCardinality(String), String) │             219910517 │              3810470959 │ 5.771216192596627 │ test_2 │
└────────────┴─────────────────────────────────────┴───────────────────────┴─────────────────────────┴───────────────────┴────────┘

clickhouse server version:

ClickHouse server version 23.8.16 revision 54465.

@JustinMason @SpencerTorres cc

earwin · 2024-11-14T14:17:16Z

@zdyj3170101136 this is both an optimization and correctness issue.
E.g. GROUP BY ResourceAttributes will yield duplicate entries for various reorderings of the same set of attributes.

Your example is incomplete without knowing the layout the tags.key has in the original default.test table. Is it already de-facto sorted or not?
Also the layout of the tags across the records, when these records are sorted according to (-toUnixTimestamp(timestamp), service, operation). If there are no contiguous runs of the same tagsets, there would be little advantage compression-wise.

Besides compression it is also a question of query efficiency, e.g. otel_metrics_gauge is at the moment ORDER BY (ServiceName, MetricName, Attributes, toUnixTimestamp64Nano(TimeUnix)).
Since the Attributes field is part of the sort key, compression won't be affected even if you write randomly-ordered keys to this map. But if you want to query the table by Attributes, you'll need to read more non-contiguous blocks.

…pen-telemetry#33634)

…3634 (#35725) #### Description Our attributes are stored as Map(String, String) in CH. By default the order of keys is undefined and as described in #33634 leads to worse compression and duplicates in `group by` (unless carefully accounted for). This PR uses the `column.IterableOrderedMap` facility from clickhouse-go to ensure fixed attribute key order. It is a reimplementation of #34598 that uses less allocations and is (arguably) somewhat more straightforward. I'm **opening this as a draft**, because this PR (and #34598) are blocked by ClickHouse/clickhouse-go#1365 (fixed in ClickHouse/clickhouse-go#1418) In addition, I'm trying to add the implementation of `column.IterableOrderedMap` used to clickhouse-go upstream: ClickHouse/clickhouse-go#1417 If it is accepted, I will amend this PR accordingly. #### Link to tracking issue Fixes #33634 #### Testing The IOM implementation was used in production independently. I'm planning to build otelcollector with this PR and cut over my production to it in the next few of days.

…en-telemetry#33634 (open-telemetry#35725) #### Description Our attributes are stored as Map(String, String) in CH. By default the order of keys is undefined and as described in open-telemetry#33634 leads to worse compression and duplicates in `group by` (unless carefully accounted for). This PR uses the `column.IterableOrderedMap` facility from clickhouse-go to ensure fixed attribute key order. It is a reimplementation of open-telemetry#34598 that uses less allocations and is (arguably) somewhat more straightforward. I'm **opening this as a draft**, because this PR (and open-telemetry#34598) are blocked by ClickHouse/clickhouse-go#1365 (fixed in ClickHouse/clickhouse-go#1418) In addition, I'm trying to add the implementation of `column.IterableOrderedMap` used to clickhouse-go upstream: ClickHouse/clickhouse-go#1417 If it is accepted, I will amend this PR accordingly. #### Link to tracking issue Fixes open-telemetry#33634 #### Testing The IOM implementation was used in production independently. I'm planning to build otelcollector with this PR and cut over my production to it in the next few of days.

JustinMason added enhancement New feature or request needs triage New item requiring triage labels Jun 18, 2024

github-actions bot added the exporter/clickhouse label Jun 18, 2024

crobert-1 removed the needs triage New item requiring triage label Jun 20, 2024

This was referenced Jun 20, 2024

Weekly Report: 2024-06-13 - 2024-06-20 LucaLanziani/opentelemetry-collector-contrib#14

Closed

Weekly Report: 2024-06-13 - 2024-06-20 LucaLanziani/opentelemetry-collector-contrib#15

Closed

JustinMason mentioned this issue Aug 11, 2024

Initial update to have attributes use the ordered map so the order is maintained into clickhouse #34598

Closed

Frapschen assigned JustinMason Aug 12, 2024

earwin added a commit to rainsouthafrica/opentelemetry-collector-contrib that referenced this issue Oct 9, 2024

[exporter/clickhouseexporter] Sort attribute maps before insertion (o…

0576d04

…pen-telemetry#33634)

earwin mentioned this issue Oct 9, 2024

[exporter/clickhouseexporter] Sort attribute maps before insertion #33634 #35725

Merged

earwin added a commit to rainsouthafrica/opentelemetry-collector-contrib that referenced this issue Oct 9, 2024

[exporter/clickhouseexporter] Sort attribute maps before insertion (o…

5bf53bd

…pen-telemetry#33634)

earwin added a commit to rainsouthafrica/opentelemetry-collector-contrib that referenced this issue Oct 9, 2024

[exporter/clickhouseexporter] Sort attribute maps before insertion (o…

b093216

…pen-telemetry#33634)

earwin added a commit to rainsouthafrica/opentelemetry-collector-contrib that referenced this issue Oct 9, 2024

[exporter/clickhouseexporter] Sort attribute maps before insertion (o…

4e716f6

…pen-telemetry#33634)

earwin added a commit to rainsouthafrica/opentelemetry-collector-contrib that referenced this issue Oct 10, 2024

[exporter/clickhouseexporter] Sort attribute maps before insertion (o…

142c36d

…pen-telemetry#33634)

earwin added a commit to rainsouthafrica/opentelemetry-collector-contrib that referenced this issue Oct 10, 2024

[exporter/clickhouseexporter] Sort attribute maps before insertion (o…

1055b0b

…pen-telemetry#33634)

github-actions bot added the Stale label Oct 14, 2024

earwin added a commit to rainsouthafrica/opentelemetry-collector-contrib that referenced this issue Oct 16, 2024

[exporter/clickhouseexporter] Sort attribute maps before insertion (o…

636bdaf

…pen-telemetry#33634)

earwin added a commit to rainsouthafrica/opentelemetry-collector-contrib that referenced this issue Oct 17, 2024

[exporter/clickhouseexporter] Sort attribute maps before insertion (o…

361f6b5

…pen-telemetry#33634)

github-actions bot removed the Stale label Oct 22, 2024

earwin added a commit to rainsouthafrica/opentelemetry-collector-contrib that referenced this issue Nov 26, 2024

[exporter/clickhouseexporter] Sort attribute maps before insertion (o…

a15186c

…pen-telemetry#33634)

github-actions bot mentioned this issue Dec 3, 2024

Weekly Report: 2024-11-26 - 2024-12-03 #36628

Closed

github-actions bot mentioned this issue Dec 10, 2024

Weekly Report: 2024-12-03 - 2024-12-10 #36739

Closed

TylerHelmuth closed this as completed in #35725 Dec 12, 2024

Fiery-Fenix mentioned this issue Jan 6, 2025

[exporter/clickhouse] Nil pointer exception on Metrics without service.name Resource Attributes #37030

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ClickHouse exporter produces duplicates and poor compression without sorting attributes #33634

ClickHouse exporter produces duplicates and poor compression without sorting attributes #33634

JustinMason commented Jun 18, 2024

github-actions bot commented Jun 18, 2024

SpencerTorres commented Jun 18, 2024

hanjm commented Jun 19, 2024

crobert-1 commented Jun 20, 2024

github-actions bot commented Oct 14, 2024

hanjm commented Oct 21, 2024

SpencerTorres commented Oct 29, 2024

zdyj3170101136 commented Nov 14, 2024 •

edited

Loading

earwin commented Nov 14, 2024

ClickHouse exporter produces duplicates and poor compression without sorting attributes #33634

ClickHouse exporter produces duplicates and poor compression without sorting attributes #33634

Comments

JustinMason commented Jun 18, 2024

Component(s)

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context

github-actions bot commented Jun 18, 2024

SpencerTorres commented Jun 18, 2024

hanjm commented Jun 19, 2024

crobert-1 commented Jun 20, 2024

github-actions bot commented Oct 14, 2024

hanjm commented Oct 21, 2024

SpencerTorres commented Oct 29, 2024

zdyj3170101136 commented Nov 14, 2024 • edited Loading

earwin commented Nov 14, 2024

zdyj3170101136 commented Nov 14, 2024 •

edited

Loading