-
Notifications
You must be signed in to change notification settings - Fork 413
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to read table using VM Managed Identity on Azure #1462
Comments
Thanks for reporting @jhoekx! Absolutely we can. Just to clarify, were you planning to contribute that, or hoping we set that up? :) Either case is fine, as I was looking to do that quite soon anyhow. In the former case tough one word of caution - which you may have figured out already - in the 0.6 release the direct support |
This brings object_store 0.6, which fixes delta-io#1462. Tokio needs to be updated, because DataFusion is using JoinSet::spawn_blocking now.
I got started updating to the current DataFusion master, but got stuck in
I guess because This is outside of my current Rust knowledge. Curious to see how one tackles that. If you have a clear pointer, I can try, otherwise I hope this was at least a bit helpful.
|
Does anyone know when the next python release might be? I might build a wheel from |
@jhoekx In your initial comment you said you called I've been trying to figure out what the
|
You don't need to set any |
Yes, although it has always been a bit troublesome and required reading the source of this library and the Not sure which version of the Delta Lake Python library you're using, but for 0.5.5, we had this guidance in our documentation:
We did not have much luck with newer versions (tested 0.8.1/0.10). Current master should be great as mentioned in the comments here. I intend to verify that this week.
In my situation it always fetches the token from
Based on those two things you can know if you need to look deeper into this library or in your infrastructure. |
@Tom-Newton thank you for your reply and if this is a bug then I wasted a lot of time trying to make this work! But I do still wonder: if no @jhoekx thank you for the information and the good suggestions to debug this. |
Azure managed identity is all about configuring the environment (I'm not talking about environment variables. I don't really know how it works but azure does something a bit more complicated than that) so that code that runs there can authenticate completely automatically without needing to provide any credentials. |
Since reading using a VM Managed Identity works since Python deltalake 0.10.1, this issue can be closed. |
Environment
Delta-rs version: 0.10
Binding: Python
Environment:
Bug
What happened:
What you expected to happen:
The Delta Table can be opened.
How to reproduce it:
Create a VM on Azure with a Managed Identity
Assign the
Storage Blob Data Reader
role to the Identity in the container that contains the Delta TableCreate a virtualenv on the VM, try to open the table.
More details:
This was fixed in
object_store
already: apache/arrow-rs@2ec8571 . An incorrect resource was requested.The fixed version of
object_store
is already in DataFusion and should be in the 27.0 release. If not already planned, we could try to have a branch to prepare for an update to 27.0 that works with the current datafusion master?Reading the table using a SAS key works. Building the
deltalake
wheel with a patchedobject_store
with the commit above also works.The text was updated successfully, but these errors were encountered: