Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[C++][Parquet] Add JSON canonical extension type #32538

Closed
asfimport opened this issue Jul 29, 2022 · 3 comments · Fixed by #13901
Closed

[C++][Parquet] Add JSON canonical extension type #32538

asfimport opened this issue Jul 29, 2022 · 3 comments · Fixed by #13901

Comments

@asfimport
Copy link
Collaborator

asfimport commented Jul 29, 2022

As a BigQuery developer, I would like the Arrow libraries to support the JSON logical Type. This would enable us to use the JSON type in the Arrow format of our ReadAPI. This would also enable us to use the JSON type to export data from BigQuery to Parquet.

Reporter: Pradeep Gollakota
Watchers: Rok Mihevc / @rok

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-17255. Please see the migration documentation for further details.

@asfimport
Copy link
Collaborator Author

David Li / @lidavidm:
Hey - I made a guess at the components, but you may want to follow up on the mailing list ([email protected]) with some more details (e.g. what languages you want to support, at least initially, and any capabilities such an extension type would have, beyond just wrapping a string). There have been other such discussions on 'common' extension types like UUIDs.

@asfimport
Copy link
Collaborator Author

@rok rok changed the title Support JSON logical type in Arrow [C++][Parquet] Add JSON canonical extension type Jun 2, 2024
@rok rok linked a pull request Jun 2, 2024 that will close this issue
@raulcd raulcd added this to the 18.0.0 milestone Jun 28, 2024
pitrou added a commit that referenced this issue Sep 11, 2024
Arrow now provides a canonical extension type for JSON data. This
extension is backed by utf8(). Parquet will recognize this extension
and appropriately propagate the LogicalType to the storage format.
* GitHub Issue: #32538

Lead-authored-by: Rok Mihevc <[email protected]>
Co-authored-by: Pradeep Gollakota <[email protected]>
Co-authored-by: Antoine Pitrou <[email protected]>
Co-authored-by: mwish <[email protected]>
Co-authored-by: Antoine Pitrou <[email protected]>
Signed-off-by: Antoine Pitrou <[email protected]>
@pitrou
Copy link
Member

pitrou commented Sep 11, 2024

Issue resolved by pull request 13901
#13901

khwilson pushed a commit to khwilson/arrow that referenced this issue Sep 14, 2024
…che#13901)

Arrow now provides a canonical extension type for JSON data. This
extension is backed by utf8(). Parquet will recognize this extension
and appropriately propagate the LogicalType to the storage format.
* GitHub Issue: apache#32538

Lead-authored-by: Rok Mihevc <[email protected]>
Co-authored-by: Pradeep Gollakota <[email protected]>
Co-authored-by: Antoine Pitrou <[email protected]>
Co-authored-by: mwish <[email protected]>
Co-authored-by: Antoine Pitrou <[email protected]>
Signed-off-by: Antoine Pitrou <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants