-
Notifications
You must be signed in to change notification settings - Fork 14.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
XComObjectStorageBackend returns the S3 path during deserialization instead of the data #39602
Comments
Thanks for opening your first issue here! Be sure to follow the issue template! If you are willing to raise PR to address this issue please do so, no need to wait for approval. |
cc @bolkedebruin I haven't looked at the code, but was expecting that during deserialization this works out of the box. |
cc @uranusjr airflow/airflow/providers/common/io/xcom/backend.py Lines 151 to 165 in f57de6c
|
I seem to recall @TJaniF has a very similar issue a while ago. I don’t remember the details but it was some sort of configuration issues for that instance. And if it is indeed not a bug in Airflow logic, we should try to detect the configuration issue and raise it better to the user, instead of returning a wrong value. |
Yes, I had the same issue and as far as I am aware this PR fixes it: #39313 so it should be fixed in 2.9.2 :) |
Good to hear Astronomer is now also in the time machine bussiness. |
Well thanks. That was quick :-). |
Note: @Uture I don't think the fix will do this for past xcom values. You will need to regenerate those. |
Great, thank you all for resolving this so quickly. |
Apache Airflow version
2.9.1
If "Other Airflow 2 version" selected, which one?
No response
What happened?
After configuring the object storage as XCom backend, the serialization works fine above the specified threshold, but once another task consumes the previously stored XCom, the deserialization doesn't seem to work. Instead of the deserialized data, the path of the object is returned.
What you think should happen instead?
The stored object should be deserialized and returned to the downstream task.
How to reproduce
Operating System
apache/airflow:2.9.0-python3.11
Versions of Apache Airflow Providers
No response
Deployment
Docker-Compose
Deployment details
No response
Anything else?
No response
Are you willing to submit PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: