Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[fix](merge-iterator) fix NOT_IMPLEMENTED_ERROR when read next block view #32961

Merged
merged 1 commit into from
Mar 28, 2024

Conversation

liaoxin01
Copy link
Contributor

@liaoxin01 liaoxin01 commented Mar 28, 2024

Proposed changes

Issue Number: close #xxx

W0327 06:03:44.308210 6588 beta_rowset_reader.cpp:350] failed to read next block view: [NOT_IMPLEMENTED_ERROR]to be implemented

Further comments

If this is a relatively large or complex change, kick off the discussion at [email protected] by explaining why you chose the solution you did and what alternatives you considered, etc...

@doris-robot
Copy link

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR

Since 2024-03-18, the Document has been moved to doris-website.
See Doris Document.

@liaoxin01
Copy link
Contributor Author

run buildall

Copy link
Contributor

PR approved by anyone and no changes requested.

Copy link
Contributor

clang-tidy review says "All clean, LGTM! 👍"

@doris-robot
Copy link

TPC-H: Total hot run time: 37834 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 77d79e3617dbde0033f1c438591480f7b1264a01, data reload: false

------ Round 1 ----------------------------------
q1	18418	4938	4232	4232
q2	2528	163	159	159
q3	11195	1133	1224	1133
q4	10630	789	798	789
q5	7640	2974	2946	2946
q6	210	131	135	131
q7	1080	613	582	582
q8	9517	2044	2018	2018
q9	7061	6487	6319	6319
q10	8402	3460	3613	3460
q11	455	226	230	226
q12	372	204	194	194
q13	17811	2858	2863	2858
q14	230	203	202	202
q15	518	464	476	464
q16	504	369	368	368
q17	954	642	587	587
q18	7222	6429	6375	6375
q19	3405	1395	1449	1395
q20	565	256	248	248
q21	3507	2871	2853	2853
q22	343	300	295	295
Total cold run time: 112567 ms
Total hot run time: 37834 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4139	4074	4051	4051
q2	322	229	240	229
q3	3023	2880	2860	2860
q4	1837	1575	1521	1521
q5	5206	5264	5240	5240
q6	198	117	114	114
q7	2198	1819	1791	1791
q8	3159	3311	3262	3262
q9	8410	8367	8434	8367
q10	3809	3821	3741	3741
q11	538	448	449	448
q12	730	556	564	556
q13	16914	2863	2858	2858
q14	281	251	260	251
q15	517	468	460	460
q16	468	420	426	420
q17	1756	1514	1470	1470
q18	7477	7107	7091	7091
q19	1614	1563	1511	1511
q20	1924	1737	1689	1689
q21	4761	4580	4657	4580
q22	510	458	455	455
Total cold run time: 69791 ms
Total hot run time: 52965 ms

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 35.62% (8833/24797)
Line Coverage: 27.34% (72388/264813)
Region Coverage: 26.56% (37537/141349)
Branch Coverage: 23.37% (19148/81946)
Coverage Report: http://coverage.selectdb-in.cc/coverage/77d79e3617dbde0033f1c438591480f7b1264a01_77d79e3617dbde0033f1c438591480f7b1264a01/report/index.html

@doris-robot
Copy link

TPC-DS: Total hot run time: 180647 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 77d79e3617dbde0033f1c438591480f7b1264a01, data reload: false

query1	964	383	346	346
query2	6548	1981	1904	1904
query3	6707	215	209	209
query4	31675	21339	21317	21317
query5	4314	394	384	384
query6	284	194	168	168
query7	4629	288	282	282
query8	239	173	167	167
query9	9394	2342	2328	2328
query10	542	240	252	240
query11	17283	14174	14339	14174
query12	133	92	88	88
query13	1629	415	419	415
query14	9593	7156	7278	7156
query15	219	181	178	178
query16	7954	266	260	260
query17	1950	580	548	548
query18	2015	284	289	284
query19	262	155	152	152
query20	94	84	90	84
query21	209	138	133	133
query22	5008	4777	4825	4777
query23	33458	32518	32784	32518
query24	12068	2828	2852	2828
query25	664	381	382	381
query26	1885	152	152	152
query27	3206	342	361	342
query28	7736	1913	1901	1901
query29	1081	618	608	608
query30	303	147	149	147
query31	982	713	729	713
query32	98	57	59	57
query33	775	255	247	247
query34	1106	474	486	474
query35	812	623	598	598
query36	1023	880	875	875
query37	277	68	66	66
query38	3585	3464	3431	3431
query39	1523	1441	1413	1413
query40	301	115	112	112
query41	50	48	48	48
query42	110	95	95	95
query43	503	470	463	463
query44	1223	750	723	723
query45	285	251	246	246
query46	1116	682	695	682
query47	1899	1903	1808	1808
query48	451	363	345	345
query49	1224	354	341	341
query50	760	368	363	363
query51	6816	6790	6846	6790
query52	107	90	96	90
query53	344	275	270	270
query54	330	251	233	233
query55	86	78	84	78
query56	251	233	222	222
query57	1230	1132	1127	1127
query58	235	218	206	206
query59	2791	2762	2537	2537
query60	267	248	253	248
query61	130	94	94	94
query62	617	407	434	407
query63	308	275	278	275
query64	6295	4039	3919	3919
query65	3114	3034	3065	3034
query66	1431	380	361	361
query67	15621	15219	15077	15077
query68	8732	539	533	533
query69	617	369	387	369
query70	1230	1117	1174	1117
query71	512	265	261	261
query72	6271	2524	2342	2342
query73	890	316	309	309
query74	8087	6302	6337	6302
query75	3775	2204	2276	2204
query76	5376	874	900	874
query77	643	256	250	250
query78	10903	10167	10215	10167
query79	8810	518	518	518
query80	1588	375	366	366
query81	514	227	220	220
query82	380	84	87	84
query83	221	144	142	142
query84	300	77	77	77
query85	1086	327	316	316
query86	356	310	263	263
query87	3719	3524	3576	3524
query88	4437	2307	2307	2307
query89	506	376	375	375
query90	2095	182	183	182
query91	186	148	148	148
query92	64	51	50	50
query93	6612	499	483	483
query94	1215	183	182	182
query95	432	337	337	337
query96	599	261	270	261
query97	2705	2511	2513	2511
query98	230	221	213	213
query99	1082	821	842	821
Total cold run time: 314295 ms
Total hot run time: 180647 ms

@doris-robot
Copy link

Load test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'

Load test result on commit 77d79e3617dbde0033f1c438591480f7b1264a01 with default session variables
Stream load json:         18 seconds loaded 2358488459 Bytes, about 124 MB/s
Stream load orc:          59 seconds loaded 1101869774 Bytes, about 17 MB/s
Stream load parquet:      31 seconds loaded 861443392 Bytes, about 26 MB/s
Insert into select:       13.8 seconds inserted 10000000 Rows, about 724K ops/s

@liaoxin01
Copy link
Contributor Author

run p1

Copy link
Contributor

@dataroaring dataroaring left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@github-actions github-actions bot added the approved Indicates a PR has been approved by one committer. label Mar 28, 2024
Copy link
Contributor

PR approved by at least one committer and no changes requested.

@dataroaring dataroaring merged commit 634b8aa into apache:master Mar 28, 2024
27 of 32 checks passed
Jibing-Li added a commit that referenced this pull request Mar 29, 2024
* [fix](merge cloud) Fix cloud be set be tag map (#32864)

* [chore] Add gavinchou to collaborators (#32881)

* [chore](show) support statement to show views from table (#32358)

MySQL [test]> show views;
+----------------+
| Tables_in_test |
+----------------+
| t1_view        |
| t2_view        |
+----------------+
2 rows in set (0.00 sec)

MySQL [test]> show views like '%t1%';
+----------------+
| Tables_in_test |
+----------------+
| t1_view        |
+----------------+
1 row in set (0.01 sec)

MySQL [test]> show views where create_time > '2024-03-18';
+----------------+
| Tables_in_test |
+----------------+
| t2_view        |
+----------------+
1 row in set (0.02 sec)

* [Enhancement](ranger) Disable some permission operations when Ranger or LDAP are enabled (#32538)

Disable some permission operations when Ranger or LDAP are enabled.

* [chore](ci) exclude unstable trino_connector case (#32892)

Co-authored-by: stephen <[email protected]>

* [fix](Nereids) NPE when create table with implicit index type (#32893)

* [improvement](mtmv) Support more join types for query rewriting by materialized view (#32685)

This pattern of rewriting is supported for multi-table joins and supported join types is as following:

INNER JOIN
LEFT OUTER JOIN
RIGHT OUTER JOIN
FULL OUTER JOIN
LEFT SEMI JOIN
RIGHT SEMI JOIN
LEFT ANTI JOIN
RIGHT ANTI JOIN

* [Serde](Variant) support arrow serialization for varint type (#32780)

* [fix](multicatalog) fix no data error when read hive table on cosn (#32815)

Currently, when reading a hive on cosn table, doris return empty result, but the table has data.
iceberg on cosn is ok.
The reason is misuse of cosn's file sytem. according to cosn's doc, its fs.cosn.impl should be org.apache.hadoop.fs.CosFileSystem

* [fix](nereids)EliminateGroupByConstant should replace agg's output after removing constant group by keys (#32878)

* [Fix](executor)Fix regression test for test_active_queries/test_backend_active_tasks #32899

* [fix](iceberg) fix iceberg catalog bug and p2 test cases (#32898)

1. Fix iceberg catalog bug

    This PR #30198 change the logic of `IcebergHMSExternalCatalog.java`,
    to get locationUrl by calling hive metastore's `getCatalog()` method.
    But this method only exists in hive 3+. So it will fail if we using hive 2.x.

    I temporary remove this logic, because this logic is only used from iceberg table writing.
    Which is still under development. We will rethink this logic later.

2. Fix test cases

    Some of P2 test cases missed `order_qt`. And because the output format of the floating point
    type is changed, some result in `out` files need to be regenerated.

* [revert](jni) revert part of #32455 (#32904)

* [fix](spill) Avoid releasing resources while spill tasks are executing (#32783)

* [chore](log) print query id before logging profile in be.INFO (#32922)

* [fix](grace-exit) Stop incorrectly of reportwork cause heap use after free #32929

* [improvement](decommission be) decommission check replica num (#32748)

* [fix](arrow-flight) Fix reach limit of connections error (#32911)

Fix Reach limit of connections error
in fe.conf , arrow_flight_token_cache_size is mandatory less than qe_max_connection/2. arrow flight sql is a stateless protocol, connection is usually not actively disconnected, bearer token is evict from the cache will unregister ConnectContext.

Fix ConnectContext.command not be reset to COM_SLEEP in time, this will result in frequent kill connection after query timeout.

Fix bearer token evict log and exception.

TODO: use arrow flight session: https://mail.google.com/mail/u/0/#inbox/FMfcgzGxRdxBLQLTcvvtRpqsvmhrHpdH

* [bugfix](cloud) few variable not initialized (#32868)

../../cloud/src/recycler/meta_checker.cpp
can cause uninitialised memory read.

* [fix](arrow-flight) Fix arrow flight sql compatible with JDK 17 and upgrade arrow 15.0.2 (#32796)

--add-opens=java.base/java.nio=ALL-UNNAMED, see: https://arrow.apache.org/docs/java/install.html#java-compatibility
groovy use flight sql connection to execute query SUM(MAX(c1) OVER (PARTITION BY)) report error: AGGREGATE clause must not contain analytic expressions, but no problem in Java execute it with jdbc::arrow-flight-sql.
groovy not support print arrow array type, throw IndexOutOfBoundsException.
"arrow_flight_sql" not support two phase read
./run-regression-test.sh --run --clean -g arrow_flight_sql

* [fix](spill) SpillStream's writer maybe may not have been finalized (#32931)

* [improvement](spill) Disable DistinctStreamingAgg when spill is enabled (#32932)

* [Improve](inverted_index) update clucene and improve array inverted index writer  (#32436)

* [Performance](exec) replace SipHash in function by XXHash (#32919)

* [feature](agg) add aggregate function sum0 (#32541)

* [improvement](mtmv) Support to get tables in materialized view when collecting table in plan (#32797)

Support to get tables in materialized view when collecting table in plan

table scehma as fllowing:

create materialized view mv1
BUILD IMMEDIATE REFRESH COMPLETE ON MANUAL
DISTRIBUTED BY RANDOM BUCKETS 1 
PROPERTIES ('replication_num' = '1')
 as 
select 
  t1.c1, 
  t3.c2 
from 
  table1 t1 
  inner join table3 t3 on t1.c1 = t3.c2

if get table from the plan as follwoing, we can get [table1, table3, table2], the mv1 is expanded to get base tables;

SELECT 
  mv1.*, 
  uuid() 
FROM 
  mv1 LEFT SEMI 
  JOIN table2 ON mv1.c1 = table2.c1 
WHERE 
  mv1.c1 IN (
    SELECT 
      c1 
    FROM 
      table2
  ) 
  OR mv1.c1 < 10

* [enhance](mtmv)support olap table partition column is null (#32698)

* [enhancement](cloud) add table version to cloud (#32738)

Add table version to cloud.

In Fe:
Get: If Fe is cloud mode, get table version from meta service.
Update: Op drop/replace temp partition, commit transaction.

In meta service:
Add: create Index. init value is 1.
Remove: by recycler.
Update: commit/drop partition rpc, commit txn rpc. Atomic++.

* [fix](cloud) schema change from not null to null (#32913)

1. Use equals instead of == for type comparing
2. null bitmap size is reisze by size of ref column.

* [feature](Nereids): add ColumnPruningPostProcessor. (#32800)

* [case](rowpolicy)fix row policy has been exist (#32880)

* [fix](pipeline) fix use error row desc when origin block clear (#32803)

* [fix](Nereids) support variant column with index when create table (#32948)

* [opt](Nereids) support create table with variant type (#32953)

* [test](insert-overwrite) Add insert overwrite auto detect concurrency cases (#32935)

* [fix](compile) fe cannot compile in idea (#32955)

* [enhancement](plsql) Support select * from routines (#32866)

Support show of plsql procedure using select * from routines.

* [fix](trino-connector) fix `NoClassDefFoundError` of hudi `Utils` class (#32846)

Due to the change of this PR #32455 , the `trino-connector-scanner` package cannot access the `hudi_scanner` package, so the exception NoclassDeffounderror will appear.

We need to write a separate Utils class.

* [exec](column) change some complex column move to noexcept (#32954)

* [Enhancement](data skew) extends show data skew (#32732)

* [chore](test) let suite compatible with Nereids (#32964)

* Support identical column name in different index. (#32792)

* Limit the max string length to 1024 while collecting column stats to control BE memory usage. (#32470)

* [fix](merge-iterator) fix NOT_IMPLEMENTED_ERROR when read next block view (#32961)

* [improvement](executor)Add tag property for workload group #32874

* [fix](auth)unified workload and resource permission logic (#32907)

- `Grant resource` can no longer grant global `usage_priv`
-  `grant resource %` instead of `grant resource *`

before change:
```
grant usage_priv on resource * to f;
show grants for f\G
*************************** 1. row ***************************
      UserIdentity: 'f'@'%'
           Comment: 
          Password: No
             Roles: 
       GlobalPrivs: Usage_priv 
      CatalogPrivs: NULL
     DatabasePrivs: internal.information_schema: Select_priv ; internal.mysql: Select_priv 
        TablePrivs: NULL
          ColPrivs: NULL
     ResourcePrivs: NULL
 CloudClusterPrivs: NULL
WorkloadGroupPrivs: normal: Usage_priv 
```
after change
```
grant usage_priv on resource '%' to f;
show grants for f\G
*************************** 1. row ***************************
      UserIdentity: 'f'@'%'
           Comment: 
          Password: No
             Roles: 
       GlobalPrivs: NULL
      CatalogPrivs: NULL
     DatabasePrivs: internal.information_schema: Select_priv ; internal.mysql: Select_priv 
        TablePrivs: NULL
          ColPrivs: NULL
     ResourcePrivs: %: Usage_priv 
 CloudClusterPrivs: NULL
WorkloadGroupPrivs: normal: Usage_priv 

```

---------

Co-authored-by: yujun <[email protected]>
Co-authored-by: Gavin Chou <[email protected]>
Co-authored-by: xy720 <[email protected]>
Co-authored-by: yongjinhou <[email protected]>
Co-authored-by: Dongyang Li <[email protected]>
Co-authored-by: stephen <[email protected]>
Co-authored-by: morrySnow <[email protected]>
Co-authored-by: seawinde <[email protected]>
Co-authored-by: lihangyu <[email protected]>
Co-authored-by: Yulei-Yang <[email protected]>
Co-authored-by: starocean999 <[email protected]>
Co-authored-by: wangbo <[email protected]>
Co-authored-by: Mingyu Chen <[email protected]>
Co-authored-by: Jerry Hu <[email protected]>
Co-authored-by: zhiqiang <[email protected]>
Co-authored-by: Xinyi Zou <[email protected]>
Co-authored-by: Vallish Pai <[email protected]>
Co-authored-by: amory <[email protected]>
Co-authored-by: HappenLee <[email protected]>
Co-authored-by: Jensen <[email protected]>
Co-authored-by: zhangdong <[email protected]>
Co-authored-by: Yongqiang YANG <[email protected]>
Co-authored-by: jakevin <[email protected]>
Co-authored-by: Mryange <[email protected]>
Co-authored-by: zclllyybb <[email protected]>
Co-authored-by: Tiewei Fang <[email protected]>
Co-authored-by: Xin Liao <[email protected]>
liaoxin01 added a commit to liaoxin01/doris that referenced this pull request Apr 2, 2024
mongo360 pushed a commit to mongo360/doris that referenced this pull request Aug 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by one committer. dev/2.0.8-merged reviewed
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants