Destination S3, S3-Glue: Add option to convert objects to strings #27171
Labels
area/connectors
Connector related issues
community
connectors/destination/s3-glue
connectors/destination/s3
connectors/destinations-database
frozen
Not being actively worked on
team/destinations
Destinations team's backlog
type/enhancement
New feature or request
What area the feature impact?
Connectors
Revelant Information
Description/Context
Airbyte has a representation of data types that all source connectors map their own types into, and which the destination connectors need to map from into their own types (reference). Nested objects in the source data are represented as an
Object
.The S3 and S3-Glue connectors are relying on the upstream Apache Hive JsonSerDe library to handle serialization of JSON formatted data. By default, items being deserialized are expected to be wrapped in Hadoop Writable objects and objects being serialized are expected to be Java primitive objects.
The issue is that when nested documents are being serialized as an
object
, the data is escaped incorrectly and the resulting record in the Hive table is always{
. We need to create a configurable option that enables the S3 and S3-Glue destination connectors to serialize any root levelobject
as astring
(including _airbyte_data).Plan/Design
object
types asobject
( Options should beDefault
orStringify
)The text was updated successfully, but these errors were encountered: