-
Notifications
You must be signed in to change notification settings - Fork 594
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
discussion: Support creating lakehouse catalog in risingwave. #8603
Comments
After DDL is supported, do you plan to support reading and writing to the |
The reason why RisingWave supports Iceberg is to fit into users' existing tech stack where they have used Spark and Iceberg already. Iceberg is not a out-of-the-box system that you can use without any other dependencies. It's still required a separate process to run compaction, serve ad-hoc queries, and to manage catalogs. It will be non-trivial to build a management system for Iceberg. Therefore, from the product's perspective, I would view lakehouse as a completely distinct product line that RisingWave should only provide minimal functions to integrate, instead of combining them together. Furthermore, you also need to consider what if the user's Iceberg is hosted by Tabular, Dremio, or any full-fledged lakehouse. It will then be unnecessary to create the lake in RisingWave. |
I think reading without optimization will not require much effort and is worth doing. DML statement maybe complicated to implement, so I don't think we should do it. |
Agreed. However, our goal is not to build a fully managed solution for Iceberg. Rather, we want to make RisingWave easier to use and experiment with. Adding support for DDL statements will simplify integration tests, as it eliminates the need to download other systems like Spark, Flink, or Hadoop to create a catalog for us. This will also make it easier for beginners who want to try out RisingWave with Iceberg. This is similar to other DML statements in our system, which are not intended for production use, but rather make tests and experimentation easier. cc @neverchanje |
@StrikeW's comment is also my major concern. If we support these |
This issue has been open for 60 days with no activity. Could you please update the status? Feel free to continue discussion or close as not planned. |
Is your feature request related to a problem? Please describe.
Currently, before risingwave can sink to iceberg, we need to rely on some external system(flink, spark, etc) to create table for us. There are two drawbacks with this problem:
Describe the solution you'd like
Support creating iceberg catalog in risingwave as following:
With this enabled, user can ingesting our solution to iceberg in one shot without any other dependencies.
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: