OpenLineage integration #6644
Replies: 2 comments
-
Hi @yonil7 Could you please elaborate? I'm not really familiar with OpenLineage. How do you see the integration, btw? |
Beta Was this translation helpful? Give feedback.
-
OpenLineage is a new open standard for lineage metadata collection. (inspired by opentelemetry - standard for telemetry data (metrics, logs, and traces) - https://openlineage.io/blog/openlineage-takes-inspiration-from-opentelemetry/) OpenLineage spec is a definition of a single json object - On top of this json schema definition, OpenLineage defines HTTP API and provide a python/java client library implementation for using this (very simple) API On the other hand, OpenLineage provides integrations with pipelines/workflows frameworks (Airflow, Spark, dbt and other frameworks are on the roadmap). There are currently 2 systems that implements the OpenLineage HTTP API that I know of: Some ways dvc can be involved in this initiative:
|
Beta Was this translation helpful? Give feedback.
-
now that OpenLineage is formally released and is part of LF AI & Data it would be nice to see some kind of integration between it and dvc
Beta Was this translation helpful? Give feedback.
All reactions