Padding for IO buffers. #2980

amosbird · 2018-08-28T12:21:49Z

Testing data

select 'aaaaaaaa','bbbbbbbb','cccccccc','dddddddd','eeeeeeee','ffffffff','gggg','hhh' from numbers(3000000) into outfile '/tmp/test.tsv'

Testing command

echo "select count() from file('/tmp/test.tsv', CSV, 'a String, b String, c String, d String, e String, f String, g String, h String') where not ignore(e)" | clickhouse-benchmark

TSV parser has less overhead than CSV, using it would better unveil the benefits of memcpySmall.

Before

QPS: 1.662, RPS: 4985463.906, MiB/s: 603.823, result RPS: 1.662, result MiB/s: 0.000.
0.000%  0.559 sec.
10.000% 0.564 sec.
20.000% 0.568 sec.
30.000% 0.572 sec.
40.000% 0.575 sec.
50.000% 0.581 sec.
60.000% 0.592 sec.
70.000% 0.624 sec.
80.000% 0.639 sec.
90.000% 0.664 sec.
95.000% 0.686 sec.
99.000% 0.711 sec.
99.900% 0.715 sec.
99.990% 0.716 sec.

After

QPS: 1.861, RPS: 5582303.107, MiB/s: 676.110, result RPS: 1.861, result MiB/s: 0.000.
0.000%  0.510 sec.
10.000% 0.514 sec.
20.000% 0.517 sec.
30.000% 0.521 sec.
40.000% 0.523 sec.
50.000% 0.527 sec.
60.000% 0.530 sec.
70.000% 0.539 sec.
80.000% 0.558 sec.
90.000% 0.584 sec.
95.000% 0.589 sec.
99.000% 0.608 sec.
99.900% 0.655 sec.
99.990% 0.663 sec.

I hereby agree to the terms of the CLA available at: https://yandex.ru/legal/cla/?lang=en

alexey-milovidov · 2018-08-28T14:08:14Z

dbms/src/Common/PODArray.h

        insert_assume_reserved(from_begin, from_end);
    }

+    /// Works under assumption, that it's possible to read up to 15 excessive bytes after `from_end`


And if the PODArray is padded.

alexey-milovidov · 2018-08-28T14:11:22Z

dbms/src/IO/ReadHelpers.cpp

@@ -177,7 +189,10 @@ void readStringInto(Vector & s, ReadBuffer & buf)
    {
        char * next_pos = find_first_symbols<'\t', '\n'>(buf.position(), buf.buffer().end());

-        appendToStringOrVector(s, buf.position(), next_pos);
+        if (buf.isPadded())


Maybe we can put it inside one another inline function?

Yeah, there is indeed a chance.

btw, recent revisions to FunctionComparisor make g++ 8.2 mad. It takes 4 minutes to compile the translation unit.

Yes, something is over-complicated in implementation of DECIMAL data type. We will address it.

alexey-milovidov · 2018-08-28T20:13:13Z

dbms/src/Common/PODArray.h

        insert_assume_reserved(from_begin, from_end);
    }

+    /// Works under assumption, that it's possible to read up to 15 excessive bytes after `from_end` and this PODArray is padded.


We can enable this method only if the PODArray has enough padding.
By static_assert, for example.

PS. We can name this method insertSmallAllowReadWriteOverflow15 instead of insertSmall to be more precise.

Testing data ``` select 'aaaaaaaa','bbbbbbbb','cccccccc','dddddddd','eeeeeeee','ffffffff','gggg','hhh' from numbers(3000000) into outfile '/tmp/test.tsv' ``` Testing command ``` echo "select count() from file('/tmp/test.tsv', CSV, 'a String, b String, c String, d String, e String, f String, g String, h String') where not ignore(e)" | clickhouse-benchmark ``` TSV parser has less overhead than CSV, using it would better unveil the benefits of memcpySmall. Before ``` QPS: 1.662, RPS: 4985463.906, MiB/s: 603.823, result RPS: 1.662, result MiB/s: 0.000. 0.000% 0.559 sec. 10.000% 0.564 sec. 20.000% 0.568 sec. 30.000% 0.572 sec. 40.000% 0.575 sec. 50.000% 0.581 sec. 60.000% 0.592 sec. 70.000% 0.624 sec. 80.000% 0.639 sec. 90.000% 0.664 sec. 95.000% 0.686 sec. 99.000% 0.711 sec. 99.900% 0.715 sec. 99.990% 0.716 sec. ``` After ``` QPS: 1.861, RPS: 5582303.107, MiB/s: 676.110, result RPS: 1.861, result MiB/s: 0.000. 0.000% 0.510 sec. 10.000% 0.514 sec. 20.000% 0.517 sec. 30.000% 0.521 sec. 40.000% 0.523 sec. 50.000% 0.527 sec. 60.000% 0.530 sec. 70.000% 0.539 sec. 80.000% 0.558 sec. 90.000% 0.584 sec. 95.000% 0.589 sec. 99.000% 0.608 sec. 99.900% 0.655 sec. 99.990% 0.663 sec. ```

alexey-milovidov · 2018-08-30T20:14:22Z

Ok.

But the code in Memory is a little bit tangled (from my point of view).
Because capacity is first initialized to value without padding and then padded.

amosbird force-pushed the memcpySmall branch from 35fc520 to 339d5be Compare August 28, 2018 12:21

alexey-milovidov reviewed Aug 28, 2018

View reviewed changes

amosbird force-pushed the memcpySmall branch from 339d5be to 210091b Compare August 28, 2018 15:01

alexey-milovidov reviewed Aug 28, 2018

View reviewed changes

amosbird force-pushed the memcpySmall branch from 210091b to 8851fbc Compare August 29, 2018 02:02

alexey-milovidov merged commit e0b1b5f into ClickHouse:master Aug 30, 2018

alexey-milovidov added a commit that referenced this pull request Aug 30, 2018

Added comments #2980

06053d9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Padding for IO buffers. #2980

Padding for IO buffers. #2980

amosbird commented Aug 28, 2018

alexey-milovidov Aug 28, 2018

amosbird Aug 28, 2018

alexey-milovidov Aug 28, 2018

amosbird Aug 28, 2018

amosbird Aug 28, 2018

alexey-milovidov Aug 28, 2018

alexey-milovidov Aug 28, 2018

alexey-milovidov commented Aug 30, 2018

Padding for IO buffers. #2980

Padding for IO buffers. #2980

Conversation

amosbird commented Aug 28, 2018

alexey-milovidov Aug 28, 2018

Choose a reason for hiding this comment

amosbird Aug 28, 2018

Choose a reason for hiding this comment

alexey-milovidov Aug 28, 2018

Choose a reason for hiding this comment

amosbird Aug 28, 2018

Choose a reason for hiding this comment

amosbird Aug 28, 2018

Choose a reason for hiding this comment

alexey-milovidov Aug 28, 2018

Choose a reason for hiding this comment

alexey-milovidov Aug 28, 2018

Choose a reason for hiding this comment

alexey-milovidov commented Aug 30, 2018