Spike/dega 1748 #7

arodionov53 · 2024-03-20T15:34:59Z

Adding functionality for boolean expression factorization - running several matches against several be-trees in sequence.
See Troubleshooting tools: Targeting, Creatives, and Frequency Capping.

vs-ads · 2024-03-21T14:25:11Z

My comments are rather suggestions, they are not conditions for approval.
It is at your discretion to follow them.
In the order of importance.

It is not clear how the high-level business requirements the PR refers to,
https://adgear.atlassian.net/browse/PEDSP-1431,
https://adgear.atlassian.net/wiki/spaces/AGILE/pages/19867599792/Troubleshooting+tool+Internal+loss+reasons,
connected to the implementation.
If I wouldn't participate in one discussion before I wouldn't know how.
Would be beneficial to have a Tech document mapping business requirements to the implementation;
It is not clear what are use cases for the functionality introduced by the PR because of the absence of a Tech doc.
I can only guess what the use cases are.
I pushed the commit with the tests to verify that I understand the main use case properly.
If the tests do not reflect the intended use case please let me know. I will revoke the commit then.
7073b52
erl_betree:betree_make_event creates MEM_EVENT resource.
It could make search for the event at the same invocation as well to save one round trip to the NIF;
When Ids are returned from NIF they also can be kept with MEM_EVENT resource.
This would allow to avoid passing the Ids back to the NIF and unpack them from the list, like in
betree.c, ERL_NIF_TERM nif_betree_search_evt_ids(..., lines 1087-1101
c_src/build_betree script refers to v1.2.1 tag of http://github.com/adgear/be-tree
tag v1.3.0 on be-tree would reflect that there are major (feature level) changes.
When resolving merge conflicts I didn't notice that order of functions in betree.c get re-arranged and logically related functions are far from each other.
Let me know if you want me to address that and I'll put them in the logically consistent order.
I extended benchmarks to use erl_betree:betree_make_event functionality.
Let me know if you think that this commit should not be a part of the PR. I will revoke the commit then.
83bc082 (edited)

arodionov53 · 2024-03-21T16:50:44Z

Victor, thank you for your comments.

I can only agree. We need to defined what information we collecting, where we put it and for what reason. This will be very helpful for teams that will work with the data.
I think that we have to change your example to make it more realistic.
Not sure that I do understand you fully The reason for creating this structure is that bid request contains information about almost one hundred variables, some on which may have list of thousand elements long. Converting this information from Erlang to C structures takes considerable time. So not to repeat this process for every be-tree in the sequence I do it once at the very beginning.
[VS] My suggestion is to extend erl_betree:betree_make_event functionality:

MEM_EVENT resource will be returned as in current implementation;
before returning the MEM_EVENT to perform the first search.
[AR] The first step is the longest one and I want to make time spend in NIF shorter. For this reason I did not incorporate MEM_EVENT creation in the first step.

I considered this option. The reason why I do not go this way is: 1. these ids are required in Erlang for reporting. 2. The lists are short and can be converted to C structures fast. I have some ideas at this point how to improved performance close to your suggestion but decided to make later.
[VS] My suggestion is to extend:

along with returning the Ids to Erlang level;
save these Ids for the next use in MEM_EVENT;
[AR] - This definitely can be done but then there are two choices:
Creating a new copy of MEM_EVENT - then some extra cost will be paid to GC
Modify MEM_EVENT - I tried to follow principles of functional programming
But the main reason - I do not feel that this can give much .

You are right I will change this. The reason why I choose 1.2. is that you changes already made a huge change.
Please do. Thank you.
[VS] Ok.
What are results? I'm not sure that I understand your test.
[VS] The benchmarks:

provide observations for memory allocations;
collect statistics how much time is spent to evaluate events;
compare statistics for different implementations;
write evaluations outputs to file;
compare evaluations output for different implementations.
[AR] The problem is that we need events created in production. They may contain huge data structures (mostly lists).

arodionov53 added 9 commits February 5, 2024 15:46

new function betree_write_dot added

1706771

new functionality - search with id lists, added

12afc5e

new test added; using latest be-tree

cb3fcc2

new feature with using event created as C object is added

434ada3

change betree_make_event to make usage from rtb-gateway more convinient

090476a

some more simplifications for rtb_gateway usage

9f7b079

betree_make_event returns execution time

0f42d0a

memory allocation for event fixed

ef669ff

now all betree_search return sorted id list

48c2f5a

arodionov53 requested a review from a team as a code owner March 20, 2024 15:35

arodionov53 and others added 5 commits March 20, 2024 13:20

using v1.2.1 be-tree

765747d

resolve merge conflicts

a08897a

no changes almost

2a2d4bc

betree_search_tests - added tests with multiple Betrees

7073b52

added 'with_event' related benchmark functions

83bc082

betree.c - re-ordered functions according to 'nif_functions' table

a59adc4

vs-ads approved these changes Mar 21, 2024

View reviewed changes

be-tree tag changed

d7c5a14

saleyn approved these changes Mar 21, 2024

View reviewed changes

arodionov53 merged commit c1cd823 into master Mar 25, 2024
1 of 2 checks passed

arodionov53 deleted the spike/DEGA-1748 branch March 25, 2024 18:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spike/dega 1748 #7

Spike/dega 1748 #7

arodionov53 commented Mar 20, 2024

vs-ads commented Mar 21, 2024

arodionov53 commented Mar 21, 2024 •

edited

Loading

Spike/dega 1748 #7

Spike/dega 1748 #7

Conversation

arodionov53 commented Mar 20, 2024

vs-ads commented Mar 21, 2024

arodionov53 commented Mar 21, 2024 • edited Loading

arodionov53 commented Mar 21, 2024 •

edited

Loading