-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
696 update dagster op for materialize alldocs #817
Conversation
…on_referential_integrity notebook. Added necessary utils to util.py to support this.
Hi @mkshah605, thanks for implementing this. I have a question about something in the diff (CC: @dwinston): slots_to_include = ["id", "type"] + document_reference_ranged_slots[
doc_type
]
new_doc = keyfilter(lambda slot: slot in slots_to_include, doc)
new_doc["_type_and_ancestors"] = schema_view.class_ancestors(doc_type) Will the documents in the |
The latter. I verified that the current direct usage of |
Thanks for confirming, @dwinston. I will update my mental model (and the one-liner description of it that I have ended up giving during various meetings) accordingly. |
@dwinston - we should be careful when changing fields in all-docs since other parts of the API depend on it. Let's do a sanity check during one if the sync or squad meetings if there is a potentially breaking change to all-docs format or interface. |
Gotcha. I'm glad we have a sanity-checking "happy path" automated test for |
In this branch, I updated materialize_alldocs in the dagster file.
Details
In this PR, I:
pick
function to usekeyfilter
directlyRelated issue(s)
alldocs
collection in different ways #696Related subsystem(s)
docs
directory)Testing
I tested these changes by making sure the
ensure_alldocs
job runs in my local dagster environment:Documentation
docs
directory)Maintainability
study_id: str
)# TODO
or# FIXME
black
to format all the Python files I created/modified