feat: scalar regex match physical expr #12270

zhuliquan · 2024-08-31T04:29:05Z

Which issue does this PR close?

Rationale for this change

This PR is successor of PR #11455

BinaryExpr will compile literal regex pattern when it evaluating RecordBatch every time, Sometime, the time of compiling regex pattern is also expensive. In our approach, literal regex pattern will be compiled once and cached to be reused in execution. It's will save compile time of pre execution and speed up execution.

What changes are included in this PR?

Introducing a new physical expr ScalarRegexMatchExpr to handle regexp match with literal regrex pattern.
Introducing a message PhysicalScalarRegexMatchExprNode in proto to handle ScalarRegexMatchExpr and add arm in func parse_physical_expr and serialize_physical_expr.
Changing BinaryExpr arm in create_physical_expr. Creating ScalarRegexMatchExpr instead of BinaryExpr when Rhs is string literal expr and op is RegexMatch | RegexIMatch | RegexNotMatch | RegexNotIMatch.

Are these changes tested?

Yes, test mod in scalar_regex_match.rs

Are there any user-facing changes?

alamb · 2024-09-07T10:37:44Z

Thank you for this PR @zhuliquan . Have you run any benchmarks that show this approach is noticeably faster than the existing approach? It makes sense that it would be faster as it does not re-compile the regular expression for each batch, but I think it would help to quantify this difference

zhuliquan · 2024-09-17T16:47:02Z

Thank you for this PR @zhuliquan . Have you run any benchmarks that show this approach is noticeably faster than the existing approach? It makes sense that it would be faster as it does not re-compile the regular expression for each batch, but I think it would help to quantify this difference

yeah add benchmarks scalar_regex_match.rs which includes scalar_regex_match / binary_regex_match two type cases, these cases are varied from record_batch size and record_batch iterator time and regexp pattern. According to below benchmarks. almost scalar_regex_match is better than binary_expr in preformance of running time, and are even orders of magnitude different.

scalar_regex_match_pattern_url_rows_1000_iter_20
                        time:   [486.65 µs 487.36 µs 488.20 µs]
Found 12 outliers among 100 measurements (12.00%)
  7 (7.00%) high mild
  5 (5.00%) high severe

Benchmarking binary_regex_match_pattern_url_rows_1000_iter_20: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 7.9s, enable flat sampling, or reduce sample count to 50.
binary_regex_match_pattern_url_rows_1000_iter_20
                        time:   [1.5255 ms 1.5324 ms 1.5426 ms]
Found 17 outliers among 100 measurements (17.00%)
  3 (3.00%) high mild
  14 (14.00%) high severe

Benchmarking scalar_regex_match_pattern_url_rows_1000_iter_50: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 6.4s, enable flat sampling, or reduce sample count to 60.
scalar_regex_match_pattern_url_rows_1000_iter_50
                        time:   [1.2263 ms 1.2307 ms 1.2358 ms]
Found 6 outliers among 100 measurements (6.00%)
  4 (4.00%) high mild
  2 (2.00%) high severe

binary_regex_match_pattern_url_rows_1000_iter_50
                        time:   [3.8231 ms 3.8478 ms 3.8821 ms]
Found 10 outliers among 100 measurements (10.00%)
  3 (3.00%) high mild
  7 (7.00%) high severe

scalar_regex_match_pattern_url_rows_1000_iter_100
                        time:   [2.4275 ms 2.4302 ms 2.4334 ms]
Found 9 outliers among 100 measurements (9.00%)
  2 (2.00%) high mild
  7 (7.00%) high severe

binary_regex_match_pattern_url_rows_1000_iter_100
                        time:   [7.8337 ms 7.8562 ms 7.8837 ms]
Found 13 outliers among 100 measurements (13.00%)
  5 (5.00%) high mild
  8 (8.00%) high severe

scalar_regex_match_pattern_url_rows_2000_iter_10
                        time:   [479.66 µs 479.97 µs 480.34 µs]
Found 8 outliers among 100 measurements (8.00%)
  8 (8.00%) high severe

Benchmarking binary_regex_match_pattern_url_rows_2000_iter_10: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 5.3s, enable flat sampling, or reduce sample count to 60.
binary_regex_match_pattern_url_rows_2000_iter_10
                        time:   [1.0449 ms 1.0459 ms 1.0472 ms]
Found 14 outliers among 100 measurements (14.00%)
  3 (3.00%) high mild
  11 (11.00%) high severe

scalar_regex_match_pattern_url_rows_2000_iter_20
                        time:   [960.57 µs 963.14 µs 966.35 µs]
Found 16 outliers among 100 measurements (16.00%)
  1 (1.00%) low mild
  6 (6.00%) high mild
  9 (9.00%) high severe

binary_regex_match_pattern_url_rows_2000_iter_20
                        time:   [2.0784 ms 2.0906 ms 2.1056 ms]
Found 10 outliers among 100 measurements (10.00%)
  1 (1.00%) high mild
  9 (9.00%) high severe

scalar_regex_match_pattern_url_rows_2000_iter_50
                        time:   [2.4077 ms 2.4122 ms 2.4172 ms]
Found 3 outliers among 100 measurements (3.00%)
  2 (2.00%) high mild
  1 (1.00%) high severe

binary_regex_match_pattern_url_rows_2000_iter_50
                        time:   [5.1844 ms 5.1936 ms 5.2067 ms]
Found 8 outliers among 100 measurements (8.00%)
  1 (1.00%) high mild
  7 (7.00%) high severe

scalar_regex_match_pattern_url_rows_2000_iter_100
                        time:   [4.8069 ms 4.8147 ms 4.8250 ms]
Found 4 outliers among 100 measurements (4.00%)
  2 (2.00%) high mild
  2 (2.00%) high severe

binary_regex_match_pattern_url_rows_2000_iter_100
                        time:   [10.039 ms 10.057 ms 10.080 ms]
Found 6 outliers among 100 measurements (6.00%)
  2 (2.00%) high mild
  4 (4.00%) high severe

scalar_regex_match_pattern_url_rows_100_iter_10
                        time:   [27.600 µs 27.626 µs 27.656 µs]
Found 17 outliers among 100 measurements (17.00%)
  8 (8.00%) high mild
  9 (9.00%) high severe

binary_regex_match_pattern_url_rows_100_iter_10
                        time:   [563.57 µs 567.55 µs 572.23 µs]
Found 18 outliers among 100 measurements (18.00%)
  3 (3.00%) high mild
  15 (15.00%) high severe

scalar_regex_match_pattern_url_rows_100_iter_20
                        time:   [55.174 µs 55.222 µs 55.278 µs]
Found 14 outliers among 100 measurements (14.00%)
  5 (5.00%) high mild
  9 (9.00%) high severe

Benchmarking binary_regex_match_pattern_url_rows_100_iter_20: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 5.6s, enable flat sampling, or reduce sample count to 60.
binary_regex_match_pattern_url_rows_100_iter_20
                        time:   [1.1047 ms 1.1079 ms 1.1124 ms]
Found 15 outliers among 100 measurements (15.00%)
  1 (1.00%) high mild
  14 (14.00%) high severe

scalar_regex_match_pattern_url_rows_100_iter_50
                        time:   [138.07 µs 138.15 µs 138.23 µs]
Found 17 outliers among 100 measurements (17.00%)
  7 (7.00%) high mild
  10 (10.00%) high severe

binary_regex_match_pattern_url_rows_100_iter_50
                        time:   [2.6605 ms 2.6693 ms 2.6795 ms]
Found 11 outliers among 100 measurements (11.00%)
  4 (4.00%) high mild
  7 (7.00%) high severe

scalar_regex_match_pattern_url_rows_100_iter_100
                        time:   [276.31 µs 276.56 µs 276.82 µs]
Found 9 outliers among 100 measurements (9.00%)
  3 (3.00%) high mild
  6 (6.00%) high severe

binary_regex_match_pattern_url_rows_100_iter_100
                        time:   [5.3683 ms 5.3892 ms 5.4134 ms]
Found 8 outliers among 100 measurements (8.00%)
  1 (1.00%) high mild
  7 (7.00%) high severe

scalar_regex_match_pattern_phone_rows_10_iter_10
                        time:   [4.2460 µs 4.2593 µs 4.2732 µs]
Found 4 outliers among 100 measurements (4.00%)
  3 (3.00%) high mild
  1 (1.00%) high severe

binary_regex_match_pattern_phone_rows_10_iter_10
                        time:   [5.7370 ms 5.9557 ms 6.2181 ms]
Found 11 outliers among 100 measurements (11.00%)
  5 (5.00%) high mild
  6 (6.00%) high severe

scalar_regex_match_pattern_phone_rows_10_iter_20
                        time:   [8.5372 µs 8.5639 µs 8.5909 µs]
Found 5 outliers among 100 measurements (5.00%)
  2 (2.00%) high mild
  3 (3.00%) high severe

binary_regex_match_pattern_phone_rows_10_iter_20
                        time:   [10.707 ms 10.727 ms 10.756 ms]
Found 3 outliers among 100 measurements (3.00%)
  1 (1.00%) high mild
  2 (2.00%) high severe

scalar_regex_match_pattern_phone_rows_10_iter_50
                        time:   [21.362 µs 21.419 µs 21.478 µs]
Found 4 outliers among 100 measurements (4.00%)
  2 (2.00%) high mild
  2 (2.00%) high severe

binary_regex_match_pattern_phone_rows_10_iter_50
                        time:   [27.042 ms 27.118 ms 27.210 ms]
Found 3 outliers among 100 measurements (3.00%)
  3 (3.00%) high severe

scalar_regex_match_pattern_phone_rows_10_iter_100
                        time:   [42.551 µs 42.648 µs 42.747 µs]
Found 7 outliers among 100 measurements (7.00%)
  2 (2.00%) high mild
  5 (5.00%) high severe

Benchmarking binary_regex_match_pattern_phone_rows_10_iter_100: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 5.6s, or reduce sample count to 80.
binary_regex_match_pattern_phone_rows_10_iter_100
                        time:   [53.956 ms 54.130 ms 54.321 ms]
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high severe

scalar_regex_match_pattern_phone_rows_1000_iter_10
                        time:   [64.245 µs 64.429 µs 64.697 µs]
Found 14 outliers among 100 measurements (14.00%)
  4 (4.00%) high mild
  10 (10.00%) high severe

Benchmarking binary_regex_match_pattern_phone_rows_1000_iter_10: Collecting 100 samples in estimated 5.5002 s (1000 iterationsbinary_regex_match_pattern_phone_rows_1000_iter_10
                        time:   [5.3799 ms 5.3920 ms 5.4059 ms]
Found 2 outliers among 100 measurements (2.00%)
  1 (1.00%) high mild
  1 (1.00%) high severe

scalar_regex_match_pattern_phone_rows_1000_iter_20
                        time:   [128.51 µs 129.09 µs 129.87 µs]
Found 19 outliers among 100 measurements (19.00%)
  2 (2.00%) high mild
  17 (17.00%) high severe

binary_regex_match_pattern_phone_rows_1000_iter_20
                        time:   [10.641 ms 10.663 ms 10.693 ms]
Found 8 outliers among 100 measurements (8.00%)
  3 (3.00%) high mild
  5 (5.00%) high severe

scalar_regex_match_pattern_phone_rows_1000_iter_50
                        time:   [321.31 µs 322.78 µs 324.74 µs]
Found 16 outliers among 100 measurements (16.00%)
  4 (4.00%) high mild
  12 (12.00%) high severe

binary_regex_match_pattern_phone_rows_1000_iter_50
                        time:   [26.925 ms 26.999 ms 27.099 ms]
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high severe

Benchmarking scalar_regex_match_pattern_phone_rows_1000_iter_100: Collecting 100 samples in estimated 6.5876 s (10k iterationsscalar_regex_match_pattern_phone_rows_1000_iter_100
                        time:   [643.86 µs 646.24 µs 648.94 µs]
Found 22 outliers among 100 measurements (22.00%)
  4 (4.00%) high mild
  18 (18.00%) high severe

Benchmarking binary_regex_match_pattern_phone_rows_1000_iter_100: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 5.5s, or reduce sample count to 90.
Benchmarking binary_regex_match_pattern_phone_rows_1000_iter_100: Collecting 100 samples in estimated 5.4971 s (100 iterationsbinary_regex_match_pattern_phone_rows_1000_iter_100
                        time:   [53.220 ms 53.332 ms 53.478 ms]
Found 7 outliers among 100 measurements (7.00%)
  3 (3.00%) high mild
  4 (4.00%) high severe

scalar_regex_match_pattern_phone_rows_2000_iter_10
                        time:   [125.23 µs 125.32 µs 125.43 µs]
Found 12 outliers among 100 measurements (12.00%)
  2 (2.00%) high mild
  10 (10.00%) high severe

binary_regex_match_pattern_phone_rows_2000_iter_10
                        time:   [5.5073 ms 5.5882 ms 5.6784 ms]
Found 15 outliers among 100 measurements (15.00%)
  2 (2.00%) high mild
  13 (13.00%) high severe

scalar_regex_match_pattern_phone_rows_2000_iter_20
                        time:   [250.82 µs 251.30 µs 251.90 µs]
Found 16 outliers among 100 measurements (16.00%)
  1 (1.00%) high mild
  15 (15.00%) high severe

binary_regex_match_pattern_phone_rows_2000_iter_20
                        time:   [10.726 ms 10.748 ms 10.782 ms]
Found 6 outliers among 100 measurements (6.00%)
  4 (4.00%) high mild
  2 (2.00%) high severe

scalar_regex_match_pattern_phone_rows_2000_iter_50
                        time:   [626.83 µs 627.60 µs 628.49 µs]
Found 15 outliers among 100 measurements (15.00%)
  6 (6.00%) high mild
  9 (9.00%) high severe

binary_regex_match_pattern_phone_rows_2000_iter_50
                        time:   [26.689 ms 26.736 ms 26.810 ms]
Found 10 outliers among 100 measurements (10.00%)
  7 (7.00%) high mild
  3 (3.00%) high severe

Benchmarking scalar_regex_match_pattern_phone_rows_2000_iter_100: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 6.9s, enable flat sampling, or reduce sample count to 50.
Benchmarking scalar_regex_match_pattern_phone_rows_2000_iter_100: Collecting 100 samples in estimated 6.9075 s (5050 iterationscalar_regex_match_pattern_phone_rows_2000_iter_100
                        time:   [1.2532 ms 1.2547 ms 1.2565 ms]
Found 17 outliers among 100 measurements (17.00%)
  4 (4.00%) high mild
  13 (13.00%) high severe

Benchmarking binary_regex_match_pattern_phone_rows_2000_iter_100: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 5.6s, or reduce sample count to 80.
Benchmarking binary_regex_match_pattern_phone_rows_2000_iter_100: Collecting 100 samples in estimated 5.6216 s (100 iterationsbinary_regex_match_pattern_phone_rows_2000_iter_100
                        time:   [54.064 ms 54.181 ms 54.348 ms]
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high severe

scalar_regex_match_pattern_phone_rows_100_iter_10
                        time:   [9.7657 µs 9.7984 µs 9.8394 µs]
Found 13 outliers among 100 measurements (13.00%)
  1 (1.00%) high mild
  12 (12.00%) high severe

binary_regex_match_pattern_phone_rows_100_iter_10
                        time:   [5.2486 ms 5.2581 ms 5.2729 ms]
Found 5 outliers among 100 measurements (5.00%)
  2 (2.00%) high mild
  3 (3.00%) high severe

scalar_regex_match_pattern_phone_rows_100_iter_20
                        time:   [19.694 µs 19.978 µs 20.470 µs]
Found 11 outliers among 100 measurements (11.00%)
  2 (2.00%) high mild
  9 (9.00%) high severe

binary_regex_match_pattern_phone_rows_100_iter_20
                        time:   [10.634 ms 10.714 ms 10.813 ms]
Found 7 outliers among 100 measurements (7.00%)
  2 (2.00%) high mild
  5 (5.00%) high severe

scalar_regex_match_pattern_phone_rows_100_iter_50
                        time:   [49.043 µs 49.112 µs 49.185 µs]
Found 5 outliers among 100 measurements (5.00%)
  2 (2.00%) high mild
  3 (3.00%) high severe

binary_regex_match_pattern_phone_rows_100_iter_50
                        time:   [26.416 ms 26.471 ms 26.537 ms]
Found 3 outliers among 100 measurements (3.00%)
  2 (2.00%) high mild
  1 (1.00%) high severe

scalar_regex_match_pattern_phone_rows_100_iter_100
                        time:   [98.263 µs 98.523 µs 98.826 µs]
Found 7 outliers among 100 measurements (7.00%)
  7 (7.00%) high severe

Benchmarking binary_regex_match_pattern_phone_rows_100_iter_100: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 5.5s, or reduce sample count to 90.
binary_regex_match_pattern_phone_rows_100_iter_100
                        time:   [52.802 ms 52.914 ms 53.059 ms]
Found 2 outliers among 100 measurements (2.00%)
  2 (2.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_10_iter_10: Collecting 100 samples in estimated 5.0029 s (1.2M iterationscalar_regex_match_pattern_zip_code_rows_10_iter_10
                        time:   [4.2069 µs 4.2121 µs 4.2181 µs]
Found 9 outliers among 100 measurements (9.00%)
  2 (2.00%) high mild
  7 (7.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_10_iter_10: Collecting 100 samples in estimated 5.2384 s (1300 iterationbinary_regex_match_pattern_zip_code_rows_10_iter_10
                        time:   [4.0685 ms 4.1297 ms 4.2001 ms]
Found 12 outliers among 100 measurements (12.00%)
  3 (3.00%) high mild
  9 (9.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_10_iter_20: Collecting 100 samples in estimated 5.0209 s (581k iterationscalar_regex_match_pattern_zip_code_rows_10_iter_20
                        time:   [8.4201 µs 8.5101 µs 8.7093 µs]
Found 12 outliers among 100 measurements (12.00%)
  5 (5.00%) high mild
  7 (7.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_10_iter_20: Collecting 100 samples in estimated 5.6356 s (700 iterationsbinary_regex_match_pattern_zip_code_rows_10_iter_20
                        time:   [7.8804 ms 7.9021 ms 7.9259 ms]
Found 2 outliers among 100 measurements (2.00%)
  1 (1.00%) high mild
  1 (1.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_10_iter_50: Collecting 100 samples in estimated 5.1036 s (232k iterationscalar_regex_match_pattern_zip_code_rows_10_iter_50
                        time:   [21.478 µs 21.512 µs 21.548 µs]
Found 8 outliers among 100 measurements (8.00%)
  5 (5.00%) high mild
  3 (3.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_10_iter_50: Collecting 100 samples in estimated 5.9688 s (300 iterationsbinary_regex_match_pattern_zip_code_rows_10_iter_50
                        time:   [19.532 ms 19.594 ms 19.680 ms]
Found 12 outliers among 100 measurements (12.00%)
  5 (5.00%) high mild
  7 (7.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_10_iter_100: Collecting 100 samples in estimated 5.0465 s (116k iteratioscalar_regex_match_pattern_zip_code_rows_10_iter_100
                        time:   [42.538 µs 42.661 µs 42.804 µs]
Found 5 outliers among 100 measurements (5.00%)
  3 (3.00%) high mild
  2 (2.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_10_iter_100: Collecting 100 samples in estimated 7.9481 s (200 iterationbinary_regex_match_pattern_zip_code_rows_10_iter_100
                        time:   [38.765 ms 38.821 ms 38.898 ms]
Found 9 outliers among 100 measurements (9.00%)
  3 (3.00%) high mild
  6 (6.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_1000_iter_10: Collecting 100 samples in estimated 5.2511 s (76k iteratioscalar_regex_match_pattern_zip_code_rows_1000_iter_10
                        time:   [64.238 µs 64.353 µs 64.503 µs]
Found 18 outliers among 100 measurements (18.00%)
  5 (5.00%) high mild
  13 (13.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_1000_iter_10: Collecting 100 samples in estimated 5.2693 s (1300 iteratibinary_regex_match_pattern_zip_code_rows_1000_iter_10
                        time:   [3.9515 ms 3.9651 ms 3.9822 ms]
Found 5 outliers among 100 measurements (5.00%)
  4 (4.00%) high mild
  1 (1.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_1000_iter_20: Collecting 100 samples in estimated 5.3484 s (40k iteratioscalar_regex_match_pattern_zip_code_rows_1000_iter_20
                        time:   [128.38 µs 128.71 µs 129.29 µs]
Found 11 outliers among 100 measurements (11.00%)
  1 (1.00%) low severe
  2 (2.00%) high mild
  8 (8.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_1000_iter_20: Collecting 100 samples in estimated 5.6914 s (700 iteratiobinary_regex_match_pattern_zip_code_rows_1000_iter_20
                        time:   [7.8980 ms 7.9112 ms 7.9278 ms]
Found 8 outliers among 100 measurements (8.00%)
  3 (3.00%) high mild
  5 (5.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_1000_iter_50: Collecting 100 samples in estimated 6.6129 s (20k iteratioscalar_regex_match_pattern_zip_code_rows_1000_iter_50
                        time:   [320.93 µs 321.28 µs 321.75 µs]
Found 11 outliers among 100 measurements (11.00%)
  3 (3.00%) high mild
  8 (8.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_1000_iter_50: Collecting 100 samples in estimated 6.0956 s (300 iteratiobinary_regex_match_pattern_zip_code_rows_1000_iter_50
                        time:   [19.761 ms 19.796 ms 19.842 ms]
Found 14 outliers among 100 measurements (14.00%)
  4 (4.00%) high mild
  10 (10.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_1000_iter_100: Collecting 100 samples in estimated 6.5983 s (10k iteratiscalar_regex_match_pattern_zip_code_rows_1000_iter_100
                        time:   [641.75 µs 642.84 µs 644.18 µs]
Found 15 outliers among 100 measurements (15.00%)
  4 (4.00%) high mild
  11 (11.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_1000_iter_100: Collecting 100 samples in estimated 7.6057 s (200 iteratibinary_regex_match_pattern_zip_code_rows_1000_iter_100
                        time:   [37.290 ms 37.361 ms 37.453 ms]
Found 11 outliers among 100 measurements (11.00%)
  5 (5.00%) high mild
  6 (6.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_2000_iter_10: Collecting 100 samples in estimated 5.1504 s (40k iteratioscalar_regex_match_pattern_zip_code_rows_2000_iter_10
                        time:   [125.26 µs 126.37 µs 128.03 µs]
Found 13 outliers among 100 measurements (13.00%)
  4 (4.00%) high mild
  9 (9.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_2000_iter_10: Collecting 100 samples in estimated 5.1009 s (1200 iteratibinary_regex_match_pattern_zip_code_rows_2000_iter_10
                        time:   [4.1292 ms 4.2475 ms 4.4111 ms]
Found 18 outliers among 100 measurements (18.00%)
  3 (3.00%) high mild
  15 (15.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_2000_iter_20: Collecting 100 samples in estimated 5.2205 s (20k iteratioscalar_regex_match_pattern_zip_code_rows_2000_iter_20
                        time:   [250.40 µs 250.84 µs 251.66 µs]
Found 13 outliers among 100 measurements (13.00%)
  1 (1.00%) low severe
  2 (2.00%) high mild
  10 (10.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_2000_iter_20: Collecting 100 samples in estimated 5.7687 s (700 iteratiobinary_regex_match_pattern_zip_code_rows_2000_iter_20
                        time:   [8.0311 ms 8.0417 ms 8.0569 ms]
Found 13 outliers among 100 measurements (13.00%)
  7 (7.00%) high mild
  6 (6.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_2000_iter_50: Collecting 100 samples in estimated 6.8124 s (10k iteratioscalar_regex_match_pattern_zip_code_rows_2000_iter_50
                        time:   [626.11 µs 632.16 µs 640.81 µs]
Found 17 outliers among 100 measurements (17.00%)
  7 (7.00%) high mild
  10 (10.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_2000_iter_50: Collecting 100 samples in estimated 5.8554 s (300 iteratiobinary_regex_match_pattern_zip_code_rows_2000_iter_50
                        time:   [19.017 ms 19.142 ms 19.301 ms]
Found 7 outliers among 100 measurements (7.00%)
  2 (2.00%) high mild
  5 (5.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_2000_iter_100: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 6.5s, enable flat sampling, or reduce sample count to 60.
Benchmarking scalar_regex_match_pattern_zip_code_rows_2000_iter_100: Collecting 100 samples in estimated 6.4886 s (5050 iteratscalar_regex_match_pattern_zip_code_rows_2000_iter_100
                        time:   [1.2522 ms 1.2538 ms 1.2558 ms]
Found 16 outliers among 100 measurements (16.00%)
  7 (7.00%) high mild
  9 (9.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_2000_iter_100: Collecting 100 samples in estimated 8.2551 s (200 iteratibinary_regex_match_pattern_zip_code_rows_2000_iter_100
                        time:   [40.160 ms 40.214 ms 40.289 ms]
Found 13 outliers among 100 measurements (13.00%)
  7 (7.00%) high mild
  6 (6.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_100_iter_10: Collecting 100 samples in estimated 5.0502 s (500k iteratioscalar_regex_match_pattern_zip_code_rows_100_iter_10
                        time:   [9.7926 µs 9.8117 µs 9.8327 µs]
Found 10 outliers among 100 measurements (10.00%)
  2 (2.00%) high mild
  8 (8.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_100_iter_10: Collecting 100 samples in estimated 5.2189 s (1300 iteratiobinary_regex_match_pattern_zip_code_rows_100_iter_10
                        time:   [3.8927 ms 3.8968 ms 3.9020 ms]
Found 7 outliers among 100 measurements (7.00%)
  1 (1.00%) high mild
  6 (6.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_100_iter_20: Collecting 100 samples in estimated 5.0642 s (252k iteratioscalar_regex_match_pattern_zip_code_rows_100_iter_20
                        time:   [19.607 µs 19.636 µs 19.667 µs]
Found 8 outliers among 100 measurements (8.00%)
  3 (3.00%) high mild
  5 (5.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_100_iter_20: Collecting 100 samples in estimated 5.3264 s (700 iterationbinary_regex_match_pattern_zip_code_rows_100_iter_20
                        time:   [7.3950 ms 7.4103 ms 7.4329 ms]
Found 7 outliers among 100 measurements (7.00%)
  3 (3.00%) high mild
  4 (4.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_100_iter_50: Collecting 100 samples in estimated 5.1970 s (101k iteratioscalar_regex_match_pattern_zip_code_rows_100_iter_50
                        time:   [48.754 µs 48.951 µs 49.214 µs]
Found 17 outliers among 100 measurements (17.00%)
  1 (1.00%) high mild
  16 (16.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_100_iter_50: Collecting 100 samples in estimated 5.7735 s (300 iterationbinary_regex_match_pattern_zip_code_rows_100_iter_50
                        time:   [18.503 ms 18.541 ms 18.584 ms]
Found 14 outliers among 100 measurements (14.00%)
  3 (3.00%) high mild
  11 (11.00%) high severe

Benchmarking scalar_regex_match_pattern_zip_code_rows_100_iter_100: Collecting 100 samples in estimated 5.0773 s (50k iteratioscalar_regex_match_pattern_zip_code_rows_100_iter_100
                        time:   [97.465 µs 97.681 µs 97.959 µs]
Found 15 outliers among 100 measurements (15.00%)
  2 (2.00%) high mild
  13 (13.00%) high severe

Benchmarking binary_regex_match_pattern_zip_code_rows_100_iter_100: Collecting 100 samples in estimated 7.9996 s (200 iteratiobinary_regex_match_pattern_zip_code_rows_100_iter_100
                        time:   [39.223 ms 39.299 ms 39.388 ms]
Found 13 outliers among 100 measurements (13.00%)
  2 (2.00%) high mild
  11 (11.00%) high severe

scalar_regex_match_pattern_ip_rows_10_iter_10
                        time:   [4.2223 µs 4.2432 µs 4.2691 µs]
Found 12 outliers among 100 measurements (12.00%)
  2 (2.00%) high mild
  10 (10.00%) high severe

binary_regex_match_pattern_ip_rows_10_iter_10
                        time:   [630.65 µs 631.87 µs 633.45 µs]
Found 17 outliers among 100 measurements (17.00%)
  6 (6.00%) high mild
  11 (11.00%) high severe

scalar_regex_match_pattern_ip_rows_10_iter_20
                        time:   [8.5427 µs 8.5514 µs 8.5612 µs]
Found 17 outliers among 100 measurements (17.00%)
  10 (10.00%) high mild
  7 (7.00%) high severe

Benchmarking binary_regex_match_pattern_ip_rows_10_iter_20: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 6.9s, enable flat sampling, or reduce sample count to 50.
binary_regex_match_pattern_ip_rows_10_iter_20
                        time:   [1.3461 ms 1.3498 ms 1.3541 ms]
Found 16 outliers among 100 measurements (16.00%)
  3 (3.00%) high mild
  13 (13.00%) high severe

scalar_regex_match_pattern_ip_rows_10_iter_50
                        time:   [21.348 µs 21.451 µs 21.582 µs]
Found 5 outliers among 100 measurements (5.00%)
  2 (2.00%) high mild
  3 (3.00%) high severe

binary_regex_match_pattern_ip_rows_10_iter_50
                        time:   [3.2852 ms 3.2901 ms 3.2973 ms]
Found 8 outliers among 100 measurements (8.00%)
  3 (3.00%) high mild
  5 (5.00%) high severe

scalar_regex_match_pattern_ip_rows_10_iter_100
                        time:   [42.674 µs 42.773 µs 42.878 µs]
Found 4 outliers among 100 measurements (4.00%)
  1 (1.00%) high mild
  3 (3.00%) high severe

binary_regex_match_pattern_ip_rows_10_iter_100
                        time:   [6.6613 ms 6.6750 ms 6.6914 ms]
Found 16 outliers among 100 measurements (16.00%)
  6 (6.00%) high mild
  10 (10.00%) high severe

scalar_regex_match_pattern_ip_rows_1000_iter_10
                        time:   [64.227 µs 64.372 µs 64.650 µs]
Found 15 outliers among 100 measurements (15.00%)
  6 (6.00%) high mild
  9 (9.00%) high severe

binary_regex_match_pattern_ip_rows_1000_iter_10
                        time:   [680.25 µs 681.96 µs 684.16 µs]
Found 14 outliers among 100 measurements (14.00%)
  3 (3.00%) high mild
  11 (11.00%) high severe

scalar_regex_match_pattern_ip_rows_1000_iter_20
                        time:   [128.24 µs 128.43 µs 128.68 µs]
Found 17 outliers among 100 measurements (17.00%)
  1 (1.00%) high mild
  16 (16.00%) high severe

Benchmarking binary_regex_match_pattern_ip_rows_1000_iter_20: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 7.0s, enable flat sampling, or reduce sample count to 50.
binary_regex_match_pattern_ip_rows_1000_iter_20
                        time:   [1.3580 ms 1.3619 ms 1.3669 ms]
Found 15 outliers among 100 measurements (15.00%)
  2 (2.00%) high mild
  13 (13.00%) high severe

scalar_regex_match_pattern_ip_rows_1000_iter_50
                        time:   [321.58 µs 322.41 µs 323.41 µs]
Found 20 outliers among 100 measurements (20.00%)
  2 (2.00%) high mild
  18 (18.00%) high severe

binary_regex_match_pattern_ip_rows_1000_iter_50
                        time:   [3.4025 ms 3.4131 ms 3.4271 ms]
Found 9 outliers among 100 measurements (9.00%)
  3 (3.00%) high mild
  6 (6.00%) high severe

scalar_regex_match_pattern_ip_rows_1000_iter_100
                        time:   [643.86 µs 650.60 µs 659.63 µs]
Found 14 outliers among 100 measurements (14.00%)
  2 (2.00%) high mild
  12 (12.00%) high severe

binary_regex_match_pattern_ip_rows_1000_iter_100
                        time:   [7.0150 ms 7.0464 ms 7.0832 ms]
Found 17 outliers among 100 measurements (17.00%)
  7 (7.00%) high mild
  10 (10.00%) high severe

scalar_regex_match_pattern_ip_rows_2000_iter_10
                        time:   [126.59 µs 128.20 µs 130.80 µs]
Found 12 outliers among 100 measurements (12.00%)
  2 (2.00%) high mild
  10 (10.00%) high severe

binary_regex_match_pattern_ip_rows_2000_iter_10
                        time:   [751.90 µs 754.29 µs 757.16 µs]
Found 17 outliers among 100 measurements (17.00%)
  4 (4.00%) high mild
  13 (13.00%) high severe

scalar_regex_match_pattern_ip_rows_2000_iter_20
                        time:   [250.91 µs 252.22 µs 254.68 µs]
Found 14 outliers among 100 measurements (14.00%)
  8 (8.00%) high mild
  6 (6.00%) high severe

Benchmarking binary_regex_match_pattern_ip_rows_2000_iter_20: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 7.6s, enable flat sampling, or reduce sample count to 50.
binary_regex_match_pattern_ip_rows_2000_iter_20
                        time:   [1.4612 ms 1.4642 ms 1.4681 ms]
Found 17 outliers among 100 measurements (17.00%)
  7 (7.00%) high mild
  10 (10.00%) high severe

scalar_regex_match_pattern_ip_rows_2000_iter_50
                        time:   [626.52 µs 627.26 µs 628.41 µs]
Found 14 outliers among 100 measurements (14.00%)
  6 (6.00%) high mild
  8 (8.00%) high severe

binary_regex_match_pattern_ip_rows_2000_iter_50
                        time:   [3.7769 ms 3.7918 ms 3.8091 ms]
Found 14 outliers among 100 measurements (14.00%)
  2 (2.00%) high mild
  12 (12.00%) high severe

Benchmarking scalar_regex_match_pattern_ip_rows_2000_iter_100: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 6.5s, enable flat sampling, or reduce sample count to 60.
scalar_regex_match_pattern_ip_rows_2000_iter_100
                        time:   [1.2529 ms 1.2550 ms 1.2588 ms]
Found 16 outliers among 100 measurements (16.00%)
  1 (1.00%) low severe
  6 (6.00%) high mild
  9 (9.00%) high severe

binary_regex_match_pattern_ip_rows_2000_iter_100
                        time:   [7.6793 ms 7.7249 ms 7.7985 ms]
Found 9 outliers among 100 measurements (9.00%)
  2 (2.00%) high mild
  7 (7.00%) high severe

scalar_regex_match_pattern_ip_rows_100_iter_10
                        time:   [9.7378 µs 9.7840 µs 9.8396 µs]
Found 19 outliers among 100 measurements (19.00%)
  1 (1.00%) high mild
  18 (18.00%) high severe

binary_regex_match_pattern_ip_rows_100_iter_10
                        time:   [680.50 µs 681.41 µs 682.54 µs]
Found 22 outliers among 100 measurements (22.00%)
  5 (5.00%) low mild
  4 (4.00%) high mild
  13 (13.00%) high severe

scalar_regex_match_pattern_ip_rows_100_iter_20
                        time:   [19.498 µs 19.608 µs 19.747 µs]
Found 15 outliers among 100 measurements (15.00%)
  2 (2.00%) high mild
  13 (13.00%) high severe

Benchmarking binary_regex_match_pattern_ip_rows_100_iter_20: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 6.9s, enable flat sampling, or reduce sample count to 50.
binary_regex_match_pattern_ip_rows_100_iter_20
                        time:   [1.3570 ms 1.3602 ms 1.3641 ms]
Found 14 outliers among 100 measurements (14.00%)
  5 (5.00%) high mild
  9 (9.00%) high severe

scalar_regex_match_pattern_ip_rows_100_iter_50
                        time:   [48.605 µs 48.712 µs 48.856 µs]
Found 14 outliers among 100 measurements (14.00%)
  1 (1.00%) low severe
  3 (3.00%) high mild
  10 (10.00%) high severe

binary_regex_match_pattern_ip_rows_100_iter_50
                        time:   [3.3835 ms 3.3895 ms 3.3988 ms]
Found 4 outliers among 100 measurements (4.00%)
  2 (2.00%) high mild
  2 (2.00%) high severe

scalar_regex_match_pattern_ip_rows_100_iter_100
                        time:   [97.812 µs 97.954 µs 98.104 µs]
Found 9 outliers among 100 measurements (9.00%)
  2 (2.00%) high mild
  7 (7.00%) high severe

binary_regex_match_pattern_ip_rows_100_iter_100
                        time:   [6.4822 ms 6.5068 ms 6.5354 ms]
Found 15 outliers among 100 measurements (15.00%)
  5 (5.00%) high mild
  10 (10.00%) high severe

scalar_regex_match_pattern_email_rows_10_iter_10
                        time:   [23.695 µs 23.897 µs 24.154 µs]
Found 15 outliers among 100 measurements (15.00%)
  6 (6.00%) high mild
  9 (9.00%) high severe

binary_regex_match_pattern_email_rows_10_iter_10
                        time:   [371.65 µs 373.00 µs 374.81 µs]
Found 14 outliers among 100 measurements (14.00%)
  3 (3.00%) high mild
  11 (11.00%) high severe

scalar_regex_match_pattern_email_rows_10_iter_20
                        time:   [46.839 µs 46.864 µs 46.891 µs]
Found 10 outliers among 100 measurements (10.00%)
  4 (4.00%) high mild
  6 (6.00%) high severe

binary_regex_match_pattern_email_rows_10_iter_20
                        time:   [745.12 µs 746.30 µs 747.77 µs]
Found 10 outliers among 100 measurements (10.00%)
  3 (3.00%) high mild
  7 (7.00%) high severe

scalar_regex_match_pattern_email_rows_10_iter_50
                        time:   [117.18 µs 117.26 µs 117.35 µs]
Found 9 outliers among 100 measurements (9.00%)
  4 (4.00%) high mild
  5 (5.00%) high severe

Benchmarking binary_regex_match_pattern_email_rows_10_iter_50: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 9.7s, enable flat sampling, or reduce sample count to 50.
binary_regex_match_pattern_email_rows_10_iter_50
                        time:   [1.8642 ms 1.8683 ms 1.8734 ms]
Found 14 outliers among 100 measurements (14.00%)
  1 (1.00%) high mild
  13 (13.00%) high severe

scalar_regex_match_pattern_email_rows_10_iter_100
                        time:   [234.00 µs 234.13 µs 234.30 µs]
Found 12 outliers among 100 measurements (12.00%)
  5 (5.00%) high mild
  7 (7.00%) high severe

binary_regex_match_pattern_email_rows_10_iter_100
                        time:   [3.6968 ms 3.7092 ms 3.7239 ms]
Found 9 outliers among 100 measurements (9.00%)
  1 (1.00%) high mild
  8 (8.00%) high severe

Benchmarking scalar_regex_match_pattern_email_rows_1000_iter_10: Collecting 100 samples in estimated 5.0188 s (2500 iterationsscalar_regex_match_pattern_email_rows_1000_iter_10
                        time:   [1.9898 ms 2.0103 ms 2.0497 ms]
Found 4 outliers among 100 measurements (4.00%)
  1 (1.00%) high mild
  3 (3.00%) high severe

Benchmarking binary_regex_match_pattern_email_rows_1000_iter_10: Collecting 100 samples in estimated 5.2012 s (2200 iterationsbinary_regex_match_pattern_email_rows_1000_iter_10
                        time:   [2.3288 ms 2.3336 ms 2.3400 ms]
Found 13 outliers among 100 measurements (13.00%)
  2 (2.00%) high mild
  11 (11.00%) high severe

Benchmarking scalar_regex_match_pattern_email_rows_1000_iter_20: Collecting 100 samples in estimated 5.2299 s (1300 iterationsscalar_regex_match_pattern_email_rows_1000_iter_20
                        time:   [3.9735 ms 3.9766 ms 3.9801 ms]
Found 20 outliers among 100 measurements (20.00%)
  1 (1.00%) high mild
  19 (19.00%) high severe

Benchmarking binary_regex_match_pattern_email_rows_1000_iter_20: Collecting 100 samples in estimated 5.2181 s (1100 iterationsbinary_regex_match_pattern_email_rows_1000_iter_20
                        time:   [4.6979 ms 4.7036 ms 4.7102 ms]
Found 11 outliers among 100 measurements (11.00%)
  3 (3.00%) high mild
  8 (8.00%) high severe

scalar_regex_match_pattern_email_rows_1000_iter_50
                        time:   [9.9446 ms 9.9522 ms 9.9609 ms]
Found 3 outliers among 100 measurements (3.00%)
  1 (1.00%) high mild
  2 (2.00%) high severe

binary_regex_match_pattern_email_rows_1000_iter_50
                        time:   [11.709 ms 11.724 ms 11.740 ms]
Found 15 outliers among 100 measurements (15.00%)
  3 (3.00%) high mild
  12 (12.00%) high severe

Benchmarking scalar_regex_match_pattern_email_rows_1000_iter_100: Collecting 100 samples in estimated 6.0043 s (300 iterationsscalar_regex_match_pattern_email_rows_1000_iter_100
                        time:   [19.917 ms 19.941 ms 19.966 ms]
Found 10 outliers among 100 measurements (10.00%)
  10 (10.00%) high mild

Benchmarking binary_regex_match_pattern_email_rows_1000_iter_100: Collecting 100 samples in estimated 7.1549 s (300 iterationsbinary_regex_match_pattern_email_rows_1000_iter_100
                        time:   [23.450 ms 23.474 ms 23.503 ms]
Found 11 outliers among 100 measurements (11.00%)
  2 (2.00%) high mild
  9 (9.00%) high severe

Benchmarking scalar_regex_match_pattern_email_rows_2000_iter_10: Collecting 100 samples in estimated 5.2380 s (1300 iterationsscalar_regex_match_pattern_email_rows_2000_iter_10
                        time:   [3.9789 ms 3.9868 ms 3.9967 ms]
Found 15 outliers among 100 measurements (15.00%)
  7 (7.00%) high mild
  8 (8.00%) high severe

Benchmarking binary_regex_match_pattern_email_rows_2000_iter_10: Collecting 100 samples in estimated 5.3289 s (1200 iterationsbinary_regex_match_pattern_email_rows_2000_iter_10
                        time:   [4.3518 ms 4.3916 ms 4.4376 ms]
Found 18 outliers among 100 measurements (18.00%)
  4 (4.00%) high mild
  14 (14.00%) high severe

scalar_regex_match_pattern_email_rows_2000_iter_20
                        time:   [7.9475 ms 7.9527 ms 7.9590 ms]
Found 14 outliers among 100 measurements (14.00%)
  4 (4.00%) high mild
  10 (10.00%) high severe

binary_regex_match_pattern_email_rows_2000_iter_20
                        time:   [8.6322 ms 8.6389 ms 8.6473 ms]
Found 10 outliers among 100 measurements (10.00%)
  3 (3.00%) high mild
  7 (7.00%) high severe

scalar_regex_match_pattern_email_rows_2000_iter_50
                        time:   [19.879 ms 19.894 ms 19.911 ms]
Found 4 outliers among 100 measurements (4.00%)
  1 (1.00%) high mild
  3 (3.00%) high severe

binary_regex_match_pattern_email_rows_2000_iter_50
                        time:   [21.612 ms 21.639 ms 21.672 ms]
Found 15 outliers among 100 measurements (15.00%)
  8 (8.00%) high mild
  7 (7.00%) high severe

Benchmarking scalar_regex_match_pattern_email_rows_2000_iter_100: Collecting 100 samples in estimated 8.0362 s (200 iterationsscalar_regex_match_pattern_email_rows_2000_iter_100
                        time:   [39.893 ms 39.926 ms 39.961 ms]
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high severe

Benchmarking binary_regex_match_pattern_email_rows_2000_iter_100: Collecting 100 samples in estimated 8.6913 s (200 iterationsbinary_regex_match_pattern_email_rows_2000_iter_100
                        time:   [42.957 ms 42.982 ms 43.013 ms]
Found 13 outliers among 100 measurements (13.00%)
  5 (5.00%) high mild
  8 (8.00%) high severe

scalar_regex_match_pattern_email_rows_100_iter_10
                        time:   [201.92 µs 202.26 µs 202.66 µs]
Found 5 outliers among 100 measurements (5.00%)
  3 (3.00%) high mild
  2 (2.00%) high severe

binary_regex_match_pattern_email_rows_100_iter_10
                        time:   [537.45 µs 538.49 µs 539.77 µs]
Found 16 outliers among 100 measurements (16.00%)
  4 (4.00%) high mild
  12 (12.00%) high severe

scalar_regex_match_pattern_email_rows_100_iter_20
                        time:   [403.24 µs 403.62 µs 404.01 µs]
Found 6 outliers among 100 measurements (6.00%)
  3 (3.00%) high mild
  3 (3.00%) high severe

Benchmarking binary_regex_match_pattern_email_rows_100_iter_20: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 5.5s, enable flat sampling, or reduce sample count to 60.
binary_regex_match_pattern_email_rows_100_iter_20
                        time:   [1.0758 ms 1.0787 ms 1.0823 ms]
Found 15 outliers among 100 measurements (15.00%)
  2 (2.00%) high mild
  13 (13.00%) high severe

Benchmarking scalar_regex_match_pattern_email_rows_100_iter_50: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 5.2s, enable flat sampling, or reduce sample count to 60.
scalar_regex_match_pattern_email_rows_100_iter_50
                        time:   [1.0078 ms 1.0091 ms 1.0108 ms]
Found 10 outliers among 100 measurements (10.00%)
  5 (5.00%) high mild
  5 (5.00%) high severe

binary_regex_match_pattern_email_rows_100_iter_50
                        time:   [2.6853 ms 2.6906 ms 2.6973 ms]
Found 11 outliers among 100 measurements (11.00%)
  3 (3.00%) high mild
  8 (8.00%) high severe

Benchmarking scalar_regex_match_pattern_email_rows_100_iter_100: Collecting 100 samples in estimated 5.1116 s (2500 iterationsscalar_regex_match_pattern_email_rows_100_iter_100
                        time:   [2.0232 ms 2.0263 ms 2.0296 ms]
Found 2 outliers among 100 measurements (2.00%)
  2 (2.00%) high mild

Benchmarking binary_regex_match_pattern_email_rows_100_iter_100: Collecting 100 samples in estimated 5.4667 s (1000 iterationsbinary_regex_match_pattern_email_rows_100_iter_100
                        time:   [5.4049 ms 5.4203 ms 5.4381 ms]
Found 13 outliers among 100 measurements (13.00%)
  5 (5.00%) high mild
  8 (8.00%) high severe

feat: scalar regex match physical expr

b770697

github-actions bot added physical-expr Physical Expressions core Core DataFusion crate proto Related to proto crate labels Aug 31, 2024

alamb mentioned this pull request Sep 7, 2024

Add documentation about performance PRs, add (TBD) section on feature criteria #12372

Merged

milenkovicm mentioned this pull request Sep 13, 2024

Specialized / Pre-compiled / Prepared ScalarUDFs #8051

Open

Merge branch 'main' into feature-scalar_regexp_match_expr

a600f4d

bench: add scalar regex match benchmarks

e9fc6c7

zhuliquan force-pushed the feature-scalar_regexp_match_expr branch from d49edca to e9fc6c7 Compare September 17, 2024 17:03

zhuliquan and others added 5 commits September 18, 2024 20:52

feat: apply scalar_regex_match optimize to similar_to case

47269be

Merge branch 'main' into feature-scalar_regexp_match_expr

7589539

minor: regen datafusion protobuf

7bd1fe4

Merge branch 'main' into feature-scalar_regexp_match_expr

aa2eed2

Merge branch 'main' into feature-scalar_regexp_match_expr

f1a81a7

zhuliquan force-pushed the feature-scalar_regexp_match_expr branch from 9f02ab6 to f1a81a7 Compare October 29, 2024 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: scalar regex match physical expr #12270

feat: scalar regex match physical expr #12270

zhuliquan commented Aug 31, 2024

alamb commented Sep 7, 2024

zhuliquan commented Sep 17, 2024

feat: scalar regex match physical expr #12270

Are you sure you want to change the base?

feat: scalar regex match physical expr #12270

Conversation

zhuliquan commented Aug 31, 2024

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

alamb commented Sep 7, 2024

zhuliquan commented Sep 17, 2024