-
Notifications
You must be signed in to change notification settings - Fork 596
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support geospatial data types during cdc replication #19519
Comments
How about ingesting these geo data types as strings? As far as I know, Iceberg or Parquet doesn't have official support for geospatial data yet, so I am wondering what format will be used in the downstream Iceberg in your case. The full support #16829 requires more effort, so it would be nice if there's a quick workaround. |
Hey so we currently use debezium and debezium automatically converts such values into wkb json object which works perfectly fine with iceberg tables. We have this running in production right now and it's working with zero issues something similar would be amazing. Strings work as well, we can translate them back elsewhere maybe? |
https://debezium.io/blog/2018/01/25/debezium-0-7-2-released/ The geometry columns are exported as structs as mentioned here should be easy enough to sink? |
I see. It sounds good to ingest the What do you think? cc. @StrikeW |
By the way, we rely on sea-schema to backfill data from upstream database. It doesn't support PostGIS's data types ( Update: Created an issue SeaQL/sea-schema#140 |
Is this only for the select * syntax? if so were happy to define the schema manually for the geospatial tables Also our case is specifically MySql only and it looks like they are implemented MySql geospatial types |
The challenge lies in the backfill part, we may need to decode the bytes to |
True. Good news is we are not blocked by them. |
Is your feature request related to a problem? Please describe.
We use geometric data types like point,polygon, lat long in our systems. Currently tables that contain this data can't be ingested using the select * statement in risingwave as the entire table is rejected automatically
Describe the solution you'd like
Support ingesting geospatial data types from source DB into risingwave and sinking them to downstream system like iceberg for further processing.
Describe alternatives you've considered
Currently there is no alternative as there's no workaround in risingwave
Additional context
Geospatial functions aren't as important as the ability to stream this data into the system and ingest into the lake house. Further processing can be done downstream by other engines like duckdb etc
The text was updated successfully, but these errors were encountered: