Add a feature to collapse structs or the output data #685

yruslan · 2024-06-04T11:54:27Z

Background

Currently, we have 2 options for schema transformation:

.option("schema_retention_policy", "keep_original") 
.option("schema_retention_policy", "collapse_root")

Field names in mainframe copybooks are usually unique, even if they are part of nested structs. Cobrix can remove all nesting until an array or a primitive is encountered.

Feature

Add a feature to collapse structs or the output data.

Example [Optional]

A simple example if applicable.

Proposed Solution [Optional]

Solution Ideas

Add a new option

.option("schema_retention_policy", "collapse_struct")

that unstructs on-fly.
OR

Add a method to SparkUtils that unstructs as a post-processing.

The text was updated successfully, but these errors were encountered:

This is similar to flattening, but does not flatten arrays, and it is more efficient.

yruslan added the enhancement New feature or request label Jun 4, 2024

yruslan added a commit that referenced this issue Jun 5, 2024

#685 Add methods for unstructing schemas and dataframes.

0903a48

This is similar to flattening, but does not flatten arrays, and it is more efficient.

yruslan added a commit that referenced this issue Jun 5, 2024

#685 Add methods for unstructing schemas and dataframes.

3d3076f

This is similar to flattening, but does not flatten arrays, and it is more efficient.

yruslan added a commit that referenced this issue Jun 5, 2024

#685 Add methods for unstructing schemas and dataframes.

061702c

This is similar to flattening, but does not flatten arrays, and it is more efficient.

yruslan mentioned this issue Jun 6, 2024

#685 Add methods for unstructing schemas and dataframes. #687

Merged

yruslan closed this as completed in #687 Jun 6, 2024

yruslan added a commit that referenced this issue Jun 6, 2024

#685 Add methods for unstructing schemas and dataframes.

6d1c729

This is similar to flattening, but does not flatten arrays, and it is more efficient.

yruslan mentioned this issue Jun 7, 2024

Release Cobrix v2.7.2 #688

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a feature to collapse structs or the output data #685

Add a feature to collapse structs or the output data #685

yruslan commented Jun 4, 2024

Add a feature to collapse structs or the output data #685

Add a feature to collapse structs or the output data #685

Comments

yruslan commented Jun 4, 2024

Background

Feature

Example [Optional]

Proposed Solution [Optional]