-
Notifications
You must be signed in to change notification settings - Fork 534
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DISCUSSION]: move sqlparser to Apache (DataFusion) governance #1294
Comments
Thank you for restarting this conversation @alamb. I am also biased as the original author of sqlparser-rs and DataFusion, but I do think the project will have more success under ASF governance, so I am in favor of this proposal. |
Hello! I'm not against it, and would love to be added as a maintainer under the asf governance, if we need some balance to avoid too much bias towards datafusion. sqlparser-rs is a crucial component of SQLPage, and I'll keep making pull request either way. |
I am in support of this. |
Yes, thank you @lovasoa -- I would expect to discuss adding committers to DataFusion who focused on sqlparser-rs (as we have committers focused on another subproject, comet, and we had committers focused on datafusion when it was part of arrow) |
I just realized I never actually responded to #1243. So: I'm happy to participate as a maintainer of |
My preference would be to keep the project independent but I'm not able to commit much time to it and it's admittedly a preference mostly based on the idea of how things might appear were it to be moved under the DataFusion project. |
I like how @maxcountryman put it, but I believe the decision must be done by those maintaining the project. I will take the opportunity to express my gratitude to @alamb for keeping the project going for as long as he has. Thanks for doing this, Andrew! |
You have my full support. I would love to be involved in any way I can. |
@alamb I think this is a very good idea. One question though: If a fork is created now, it is just of the In looking into this further I see the workspace is defined in the datafusion/Cargo.toml workspace. So it looks like someone will need a fork of the whole DataFusion project to make something work in any case. This should be manageable. Unless advised otherwise that's what I will do. Regardless I am fully supportive of the idea of moving |
No I would expect that sqlparser-rs remains in its own repository (just the github organization would be different) |
@andygrove has started a vote thread here: https://lists.apache.org/thread/7tnq1v0dg2f2xw8c7611bk8mw8lz5dff |
I am in favor of this proposal |
We have started the ip clearance process: https://docs.google.com/document/d/1XPp1zG8yfkCfjusQH7_qr8FMBDDEQ5psR7HhZ1LuVC8/edit#heading=h.nlygv2z3fdg2 Mailing list thread: https://lists.apache.org/thread/1s1qv6gxy0ltzrqq9skxt7lomlq9w4gj |
I think moving sqlparser-rs into datafusion repo will get more visibility and seems reasonable. Not very sure separate repo can substantially improve the current situation. But we can do this in followup action. |
@iffyio @lovasoa and @jmhain -- we are trying to move this process along and part of doing so is to get documentation in place for the Apache Software Foundation that all the code in the repo is ok from an ip clearance perspective. As the most active contributors recently, would you be willing to submit an "Individual Contributor Licence Agreement" (ICLA )to the ASF? The document can be found here: https://www.apache.org/licenses/contributor-agreements.html This document re-affirms that you have donated all code under the Apache Software License, and is the first step in becoming a committer on the project. There is more details in @andygrove 's doc here: |
@alamb On it. |
@alamb I've signed the ICLA |
@iffyio has submitted an ICLA |
@andygrove what are the next steps here ? Is there anything i can do to help the ip clearance form (e.g. make ticket and @ mention the various sqlparser cotributors)? Help submit the ip clearance form? |
@alamb The only blocker has been waiting for the ICLAs to be filed. The next step is converting this doc to XML and creating a PR (IIRC). Out of the original active contributors, only @lovasoa is unable to complete an ICLA at this time (they are too busy, which perhaps means they are no longer an active contributor?). I don't know if there are new active contributors since this process started. I will need to check. |
@alamb @andygrove I have submitted iCLA to ASF for another project(Apache Kvrocks), so it should be good to count me(apache id: hulk) as recorded if requested iCLA for all contributors. |
I sent my completed ICLA :) |
Just sent my completed ICLA as well. |
Thanks, everyone. I will start on the next step of this process in the next day or two. |
Relevant mailing list discussion: https://lists.apache.org/thread/vv279vtm6q7cfnbgwx4dlm7yj9j93ly4 New repository is here: https://github.com/apache/datafusion-sqlparser-rs My planned next step (if no one beats me to it) is to create a PR with the code (and all the commits) from sqlarser-rs into the new repository: Some other things (I will file tickets to orgnaize this):
|
Why not migrating the existing repository instead of creating a new one ? This would keep the issues, pull requests, etc... |
🤔 that is a good idea. The trick is to figure out if the apache (destination organization) can accept such transfers. I will open a ticket to find out |
Yes, it can. I did migrate another project into ASF: https://issues.apache.org/jira/browse/INFRA-25024, hope this would help. |
Thanks @git-hulk and @lovasoa -- I have filed https://issues.apache.org/jira/browse/INFRA-26136 to request the transfer. |
Here is the XML form for the IP clearance process: As part of this process I would like to remind all contributors that they are responsible for ensuring that a Corporate CLA is recorded if such is required to authorize their contributions under their individual CLA |
HTML form: https://incubator.apache.org/ip-clearance/datafusion-sqlparser.html We need to do a couple more things before we can start the IP clearance vote:
|
Don't we need to complete the IP clearance process before moving the code into the new repo? |
We certainly need to complete the IP clearance process before making a new release. I am not sure about moving into the apache github organization I will create a PR later today to add Apache License headers to the files in sqlparser-rs |
Filed #1436 to track migration issues |
The PR is here: #1437 |
The IP clearance vote has passed. Thanks to everyone who helped with this process! https://lists.apache.org/thread/4qhyk23d0g01g3ccsffsxygwd6vj9b8k |
I see that the repo has now moved 🎉 |
I plan to review / update #1436 tomorrow with next steps |
Now that the repo has moved and the project has been accepted, let's track the transition work in #1436. Gere is the proposed plan going forward: Comments more than welcome |
@alamb as maintainer of other OSS project that uses sqlparser-rs (OpenLineage) just wanted to thank you for the continued maintanence, support and merging a few of our PRs :) |
@mobuchowski -- thank you for the kind words The last few months @iffyio has really been the key maintainer. 🙏 Thank you! |
(disclaimer: I am biased being the one who merges sqlparser prs and also am the Apache DataFusion PMC chair)
Problem Statement
sqlparser seems to have become the defacto sql parsing library in Rust (5.5M downloads at the time of this writing) 🎉
However the sqlparser-rs project doesn't have sufficient maintainer capacity. I (@alamb) do enough to keep it from going entirely dormant, but that is really not sufficient for a healthy project.
Here are the specific problems:
Challenges with current governance structure (or lack thereof)
Past discussions:
When DataFusion was part of the Apache Arrow project, we didn't have the correct space to bring SQL parser at that time
Now that DataFusion is its own top level project (with @andygrove and myself on the PMC) there is a natural space to do thos
Specific Proposal:
apache/datafusion-sqlparser
apache/datafuson-sqlparser
Benefits of ASF governance;
Drawbacks
There is plenty of experience with the ASF release process in DataFusion so I don't think that is a major hurdle. I also think DataFusion in general and sqlparser in particular has a long history of accepting features that benefit all users not just maintainers, so I am not worried about this either (but I am of course biased)
cc @Dandandan @tobyhede @andygrove @maxcountryman @nickolay
The text was updated successfully, but these errors were encountered: