Use `Vector` instead of `List` in transform methods #62

pondzix · 2024-03-13T19:50:45Z

Currently, the transform functions in common-loaders return a List. We construct the list by concatenating two lists: 1 for atomic fields and 1 for the self-describing entity fields (this line).

The ::: operator is relatively expensive, especially for large lists. And this is a “hot” function which we invoke for every single event.

If we switch the implementation to use Vector instead of List then it should be much more efficient to concatenate the two lists.

Currently, the transform functions in common-loaders return a [List](https://github.com/snowplow-incubator/common-streams/blob/0.2.1/modules/loaders-common/src/main/scala/com/snowplowanalytics/snowplow/loaders/transform/Transform.scala#L83). We construct the list by concatenating two lists: 1 for atomic fields and 1 for the self-describing entity fields ([this line](https://github.com/snowplow-incubator/common-streams/blob/0.2.1/modules/loaders-common/src/main/scala/com/snowplowanalytics/snowplow/loaders/transform/Transform.scala#L86)). The ::: operator is relatively expensive, especially for large lists. And this is a “hot” function which we invoke for every single event. If we switch the implementation to use Vector instead of List then it should be much more efficient to concatenate the two lists.

In #62 the transform method was improved to avoid the expensive `:::` operator. But we were still converting a `List` to a `Vector` for every single event. That conversion can be eliminated fairly easily.

istreeter approved these changes Mar 18, 2024

View reviewed changes

pondzix force-pushed the use_vector_in_transform branch from 7bcc80f to ff178b6 Compare March 18, 2024 09:03

pondzix merged commit ff178b6 into develop Mar 18, 2024
1 check passed

istreeter deleted the use_vector_in_transform branch March 18, 2024 23:25

istreeter mentioned this pull request Mar 19, 2024

Use Vector instead of List in transform methods: part 2 #66

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `Vector` instead of `List` in transform methods #62

Use `Vector` instead of `List` in transform methods #62

pondzix commented Mar 13, 2024 •

edited

Loading

Use Vector instead of List in transform methods #62

Use Vector instead of List in transform methods #62

Conversation

pondzix commented Mar 13, 2024 • edited Loading

Use `Vector` instead of `List` in transform methods #62

Use `Vector` instead of `List` in transform methods #62

pondzix commented Mar 13, 2024 •

edited

Loading