Тёмный

Data enrichment patterns with Apache Flink 

The ASF
Подписаться 18 тыс.
Просмотров 1
50% 1

Data enrichment is a critical step in stream processing. Real-time enrichment of streaming data with contextual information adds missing information, improves accuracy, increases trustworthiness, and facilitates better decision-making. Contextual data can be static or dynamic and obtained in various ways - APIs, databases, files and even as a stream. While there are multiple design patterns to perform data enrichment, it is not always obvious when one pattern is preferred over the other.
In this session we will show common enrichment patterns for streaming data, and how to implement them using Apache Flink. We will cover patterns in scenarios where reference data is static, available through external APIs, or available as a change data stream. We will dive into internal details about Flink state and how it stores reference data.
You will walk away with clear understanding of how to use these patterns in your architecture. You can make an informed decision about how stream data enrichment can meet the throughput and latency goals of your use case.
Subham Rakshit
Streaming Solution Architect, AWS
Subham Rakshit is a Streaming Solutions Architect for Analytics at AWS based in the UK. He works with customers to design and build streaming architectures so that they can get value from analysing their streaming data. His two little daughters keep her occupied most of the time outside work and loves solving jigsaw puzzles with them.

Опубликовано:

 

1 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии    
Далее
Cloud Design Patterns
3:40:10
Просмотров 910
Apache Spark Executor Tuning | Executor Cores & Memory
44:35