[Bug]: fulltext index search in tpch 100g lineitem oom #20213

tom-csf · 2024-11-20T08:43:34Z

Is there an existing issue for the same bug?

I have checked the existing issues.

Branch Name

main

Commit ID

c93bbbd

Other Environment Information

- Hardware parameters:
- OS type:
- Others:

Actual Behavior

企业微信截图_f78d812e-1d6e-4c97-9bb5-0627d8ed6f87

企业微信截图_e61b236f-bb27-458b-b2c1-9d502b4db342

Expected Behavior

No response

Steps to Reproduce

1、load 100g tpch data to mo
2、set experimental_fulltext_index=1;
create fulltext index fdx on lineitem(L_COMMENT)
3、select * from lineitem where match(l_comment) against('"olphins nag slyly after the regular packa"' in boolean mode);

Additional information

No response

The text was updated successfully, but these errors were encountered:

aronchanisme · 2024-11-20T09:00:55Z

Fulltext index issue. Hi Eric @cpegeric , could you please kindly help take a look? Thanks.

cpegeric · 2024-11-20T16:14:25Z

mysql> select count(*) from lineitem where match(l_comment) against('deposits' in boolean mode);
ERROR 20101 (HY000): internal error: Invalid alloc size 1147486208

ouyuanning · 2024-11-21T09:26:11Z

hash build时候的OOM。不知道跟 #20236 是不是类似问题
@badboynt1 先帮忙定位一下吧

badboynt1 · 2024-11-21T09:44:49Z

跟 #20236不是同一个问题，我在129上可以成功创建索引，但是我跑query会hang住，看起来这条query要跑10分钟以上？没有报错也没有oom @cpegeric

@badboynt1

…rser (#20269) bug fixes for #20217 #20213 #20175 1. limit the batch size to 8192 on both fulltext_index_scan() and fulltext_tokenize() function 2. In fulltext_index_scan function, create a new thread to evaluate the score in 8192 documents per batch instead of waiting for all results from SQL. It will speed up and avoid OOM in the function. However, the score will be calculated based on each mini-batch instead of complete batch. I think it doesn't matter as long as we have the correct answer. 3. support json_value parser 4. Pre-allocation of memory in fulltext_tokenize() function to avoid malloc 5. add monpl tokenizer repo to matrixone 6. bug fix json tokenizer to truncate value and increase the limit to 127 bytes 7. pushdown limit Approved by: @badboynt1, @zhangxu19830126, @m-schen, @fengttt, @aunjgr, @ouyuanning, @sukki37, @aressu1985, @heni02, @XuPeng-SH, @qingxinhome

badboynt1 · 2024-11-29T01:48:01Z

select * from lineitem where match(l_comment) against('"olphins nag slyly after the regular packa"' in boolean mode);

这条query在我本地环境上依然会hang住，或者需要运行超过十分钟。

但是即便我扫描全表， select * from lineitem where l_comment like "%olphins nag slyly after the regular packa%";
这条语句也能在数秒内返回。看起来fulltext index的实现似乎有些问题
@cpegeric

@fengttt

…rser (#20230) bug fixes for #20217 #20213 #20175 #20149 and add json_value parser 1. limit the batch size to 8192 on both fulltext_index_scan() and fulltext_tokenize() function 2. In fulltext_index_scan function, create a new thread to evaluate the score in 8192 documents per batch instead of waiting for all results from SQL. It will speed up and avoid OOM in the function. However, the score will be calculated based on each mini-batch instead of complete batch. I think it doesn't matter as long as we have the correct answer. 3. support json_value parser 4. Pre-allocation of memory in fulltext_tokenize() function to avoid malloc 5. bug fix #20149 Delete table. pkPos, pkType is needed but (doc_id, INT) is given. 6. add monpl tokenizer repo to matrixone 7. bug fix json tokenizer to truncate value and increase the limit to 127 bytes 8. pushdown limit 9. bug fix #20311. data race occurred during bvt test 10. alter table drop column with fulltext index 11. SQL executor add streaming mode. Approved by: @fengttt, @badboynt1, @zhangxu19830126, @m-schen, @aunjgr, @ouyuanning, @aressu1985, @XuPeng-SH, @sukki37, @qingxinhome

cpegeric · 2024-12-02T16:48:40Z

The slow query in fulltext index is

select * from `__mo_index_secondary_019386fe-f00c-7b42-8f99-ba608027eee7` where word in ('olphins', 'nag', 'slyly', 'after', 'the', 'regular', 'packa') order by doc_id;

order by is slow when the number of rows is large.

mysql> select count(*) from `__mo_index_secondary_019386fe-f00c-7b42-8f99-ba608027eee7` where word in ('olphins', 'nag', 'slyly', 'after'
, 'the', 'regular', 'packa');
+-----------+
| count(*)  |
+-----------+
| 309164415 |
+-----------+

Possible solutions:

make ORDER BY faster
use stop word
return a vector to represent the text. Right now, multiple rows per doc_id returns that why we need to sort by doc_id.
Our own index file format that enable us to sort the index in advance but not sort on the fly.
use disk hashtable to speed up the word -> docid look up. https://github.com/rosedblabs/diskhash
disk cache https://github.com/peterbourgon/diskv

fengttt · 2024-12-04T18:31:34Z

First let's double check fulltext index table schema and primary key/cluster by key.

Second, the following query is not going to work well

select * from `__mo_index_secondary_019386fe-f00c-7b42-8f99-ba608027eee7` where word in ('olphins', 'nag', 'slyly', 'after', 'the', 'regular', 'packa') order by doc_id;

I would rather issue the following

with 
kw1 as (select docid, to_array(position) as pos from __mo_index where word = 'olphins' group by docid),
kw2 as (select docs, to_array(position) as pos from __mo_index where word = 'nag' group by docid),
...
select kw1.docid, kw1.pos as pos1, kw2.pos as pos2 ... kw6.pos6
from kw1, kw2, ... kw6
where kw1.docid = kw2.docid and kw1.docid = kw3.docid ....

Note the we need a new agg function to_array, otherwise, the join could become a cross product of positions and explode. Of course, for phrase query, we don't need this to_array, we can put this is
the join condition

where kw1.docid = kw2.docid ....     AND kw1.pos + 1 = kw2.pos and kw2.pos +1 = kw3.pos ...

But there could be more complex position processing inside the fulltext function -- such as 'foo NEAR bar', etc, so to_array seems to be a better solution.

cpegeric · 2024-12-05T12:30:47Z

Checked, Cluster By word got some speed improvement but ORDER BY is not going to work.

only pharse search can use JOIN. For OR operation like natural language mode, we cannot use JOIN at all. I think we need to save the data from SQL into temporary file when the data size is large.

store the SQL result into map just like what we are doing
when data size exceed limit, sort the data with doc_id and save to temp file
After getting all results from SQL, we will have several temp files which is sorted by doc_id
use heapsort to get the sorted doc_id for each words by getting the top item from each files.
merge the doc_id into the data structure
when number of doc_id reach 8192 limit, process the batch
go back to step 4 to read the data

Note: use ordered map to prevent us from sorting the keys before save the file
https://github.com/elliotchance/orderedmap

fengttt · 2024-12-08T04:08:49Z

I think it should be cluster by at least, by (word, docid) or even (word, docid, position).

Next, I am really curious what is the performance if we just issue

where ...
AND kw1.docid = kw2.docid ....     AND kw1.pos + 1 = kw2.pos and kw2.pos +1 = kw3.pos ...

I don't see any reason this could be bad.

fengttt · 2024-12-08T04:11:34Z

Checked, Cluster By word got some speed improvement but ORDER BY is not going to work.

only pharse search can use JOIN. For OR operation like natural language mode, we cannot use JOIN at all. I think we need to save the data from SQL into temporary file when the data size is large.

store the SQL result into map just like what we are doing

when data size exceed limit, sort the data with doc_id and save to temp file

After getting all results from SQL, we will have several temp files which is sorted by doc_id

use heapsort to get the sorted doc_id for each words by getting the top item from each files.

merge the doc_id into the data structure

when number of doc_id reach 8192 limit, process the batch

go back to step 4 to read the data

Note: use ordered map to prevent us from sorting the keys before save the file https://github.com/elliotchance/orderedmap

I still don't see any reason/benefit of the order by. Just avoid it.

And for OR, it is not a join but a UNION ALL then group by. Basically, translate all fulltext query to proper SQL -- our query engine should be faster than any hand rolled code -- if not, we should optimize our query engine.

fengttt · 2024-12-08T04:14:33Z

But there could be more complex position processing inside the fulltext function -- such as 'foo NEAR bar', etc, so to_array seems to be a better solution.

Now I believe foo NEAR bar should be just compiled to foo.pos > bar.pos - 100 and foo.pos < bar.pos + 100, suppose NEAR means pos within 100. Why not.

cpegeric · 2024-12-09T09:07:48Z

I changed my mind on the design.

For large data scale, we need a hashtable with spill (do we have it?)
For OR operation,

ignore Position
minimize the memory footprint in hashtable say, 3 words in Search ["hello", "happy", "world"] with internal index [0, 1, 2],
The result from multiple words in SQL group by doc_id into single row with an []uint8 value which store the document count of the word w1 present in a row.
say doc_id=1 contains words "hello", "world". Two rows from SQL [{0, doc_id("hello")}, {2, doc_id("world")}] and will aggregate into a uint8 array is [1, 0, 1] and the Hashtable definition is map[doc_id][]uint8. This will minimize the hashtable size and []uint8 is simillar to vector which can evaluate the score by each vector instead of batch. This new GROUP BY operation is done in table function. Maybe it is good to implement in query engine?

For AND operation (phrase search), use JOIN and filtering with position to do phrase search. We will have answer immediately from SQL.

cpegeric · 2024-12-09T18:59:49Z

Even worse with SELECt UNION, ERROR 20101 (HY000): internal error: mpool memory allocation exceed limit with requested size 1147486208

SELECT doc_id from table where word in (w1, ...wn) has smaller memory footprint.

Ariznawlll · 2024-12-21T00:17:15Z

testing

Ariznawlll · 2024-12-23T02:34:09Z

commit: 2b6ab7e
测试步骤：

1、load 100g tpch data to mo
2、set experimental_fulltext_index=1;
create fulltext index fdx on lineitem(L_COMMENT)
3、select * from lineitem where match(l_comment) against('"olphins nag slyly after the regular packa"' in boolean mode);

本地测试通过

tom-csf added kind/bug Something isn't working needs-triage severity/s0 Extreme impact: Cause the application to break down and seriously affect the use labels Nov 20, 2024

tom-csf added this to the 2.0.1 milestone Nov 20, 2024

tom-csf assigned cpegeric and matrix-meow Nov 20, 2024

matrix-meow removed the needs-triage label Nov 20, 2024

tom-csf mentioned this issue Nov 20, 2024

[issue]: fulltext index test bugs #20164

Open

aronchanisme unassigned matrix-meow Nov 20, 2024

tom-csf mentioned this issue Nov 20, 2024

[Bug]: create FULLTEXT INDEX on table(has many data) failed #19923

Closed

1 task

cpegeric mentioned this issue Nov 20, 2024

fulltext bug fixes, performance improvement and support json_value parser #20230

Merged

7 tasks

ouyuanning mentioned this issue Nov 21, 2024

[Bug]: [1121 big data regression]cn oom in create index. #20236

Open

1 task

ouyuanning assigned badboynt1 and unassigned cpegeric Nov 21, 2024

badboynt1 assigned cpegeric and unassigned badboynt1 Nov 21, 2024

cpegeric mentioned this issue Nov 21, 2024

fulltext bug fixes, performance improvement and support json_value parser #20269

Merged

7 tasks

cpegeric mentioned this issue Nov 25, 2024

SQL executor with streaming and fulltext index with pushdown limit #20354

Merged

7 tasks

cpegeric assigned badboynt1 and unassigned cpegeric Nov 28, 2024

badboynt1 assigned cpegeric and unassigned badboynt1 Nov 29, 2024

fengttt modified the milestones: 2.0.1, 2.1.0 Dec 4, 2024

fengttt mentioned this issue Dec 5, 2024

[Refactoring]: Add comment on functions like buildFulltextIndexTable #20578

Open

cpegeric mentioned this issue Dec 10, 2024

Fulltext remove slow ORDER BY SQL and reduce memory usage #20702

Merged

7 tasks

cpegeric assigned Ariznawlll and unassigned cpegeric Dec 20, 2024

matrix-meow added the phase/testing label Dec 20, 2024

Ariznawlll added severity/s1 High impact: Logical errors or data errors that must occur and removed severity/s0 Extreme impact: Cause the application to break down and seriously affect the use labels Dec 26, 2024

cpegeric mentioned this issue Jan 8, 2025

fulltext index bugs port from main to 2.0-dev #21139

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: fulltext index search in tpch 100g lineitem oom #20213

[Bug]: fulltext index search in tpch 100g lineitem oom #20213

tom-csf commented Nov 20, 2024

aronchanisme commented Nov 20, 2024

cpegeric commented Nov 20, 2024 •

edited

Loading

ouyuanning commented Nov 21, 2024

badboynt1 commented Nov 21, 2024 •

edited

Loading

badboynt1 commented Nov 29, 2024

cpegeric commented Dec 2, 2024 •

edited

Loading

fengttt commented Dec 4, 2024

cpegeric commented Dec 5, 2024 •

edited

Loading

fengttt commented Dec 8, 2024

fengttt commented Dec 8, 2024

fengttt commented Dec 8, 2024 •

edited

Loading

cpegeric commented Dec 9, 2024 •

edited

Loading

cpegeric commented Dec 9, 2024 •

edited

Loading

Ariznawlll commented Dec 21, 2024

Ariznawlll commented Dec 23, 2024

[Bug]: fulltext index search in tpch 100g lineitem oom #20213

[Bug]: fulltext index search in tpch 100g lineitem oom #20213

Comments

tom-csf commented Nov 20, 2024

Is there an existing issue for the same bug?

Branch Name

Commit ID

Other Environment Information

Actual Behavior

Expected Behavior

Steps to Reproduce

Additional information

aronchanisme commented Nov 20, 2024

cpegeric commented Nov 20, 2024 • edited Loading

ouyuanning commented Nov 21, 2024

badboynt1 commented Nov 21, 2024 • edited Loading

badboynt1 commented Nov 29, 2024

cpegeric commented Dec 2, 2024 • edited Loading

fengttt commented Dec 4, 2024

cpegeric commented Dec 5, 2024 • edited Loading

fengttt commented Dec 8, 2024

fengttt commented Dec 8, 2024

fengttt commented Dec 8, 2024 • edited Loading

cpegeric commented Dec 9, 2024 • edited Loading

cpegeric commented Dec 9, 2024 • edited Loading

Ariznawlll commented Dec 21, 2024

Ariznawlll commented Dec 23, 2024

cpegeric commented Nov 20, 2024 •

edited

Loading

badboynt1 commented Nov 21, 2024 •

edited

Loading

cpegeric commented Dec 2, 2024 •

edited

Loading

cpegeric commented Dec 5, 2024 •

edited

Loading

fengttt commented Dec 8, 2024 •

edited

Loading

cpegeric commented Dec 9, 2024 •

edited

Loading

cpegeric commented Dec 9, 2024 •

edited

Loading