Тёмный

Cross-Platform Data Lineage with OpenLineage 

Databricks
Подписаться 113 тыс.
Просмотров 5 тыс.
50% 1

There are more data tools available than ever before, and it is easier to build a pipeline than it has ever been. These tools and advancements have created an explosion of innovation, resulting in data within today's organizations becoming increasingly distributed and can't be contained within a single brain, a single team, or a single platform. Data lineage can help by tracing the relationships between datasets and providing a map of your entire data universe.
OpenLineage provides a standard for lineage collection that spans multiple platforms, including Apache Airflow, Apache Spark™, Flink®, and dbt. This empowers teams to diagnose and address widespread data quality and efficiency issues in real time. In this session, we will show how to trace data lineage across Apache Spark and Apache Airflow. There will be a walk-through of the OpenLineage architecture and a live demo of a running pipeline with real-time data lineage.
Talk by: Julien Le Dem,Willy Lulciuc
Here’s more to explore:
Data, Analytics, and AI Governance: dbricks.co/44gu3YU
Connect with us: Website: databricks.com
Twitter: / databricks
LinkedIn: / databricks
Instagram: / databricksinc
Facebook: / databricksinc

Наука

Опубликовано:

 

26 июл 2023

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии    
Далее
Streaming Data Analytics with Power BI and Databricks
37:58
Connecting the Dots with DataHub: Lakehouse and Beyond
35:43
Редакция. News: 128-я неделя
57:38
Просмотров 762 тыс.
Data Lineage with OpenLineage and Airflow
50:44
Просмотров 9 тыс.
What is Data Lineage?
5:38
Просмотров 13 тыс.
OpenMetadata Overview
9:30
Просмотров 34 тыс.
Observability for Data Pipelines With OpenLineage
23:38
Проверил, как вам?
0:58
Просмотров 283 тыс.
iPhone 16 - 20+ КРУТЫХ ИЗМЕНЕНИЙ
5:20
Просмотров 100 тыс.