Тёмный

Real Time Streaming with Azure Databricks and Event Hubs 

Pathfinder Analytics
Подписаться 10 тыс.
Просмотров 18 тыс.
50% 1

🚀Join me in this tutorial as we build a real-time analytics streaming solution using Azure Databricks and Event Hubs.
⌚Timestamps:
Project Overview: 00:00:00
Solution Architecture: 00:00:44
Event Hubs Overview and Implementation: 00:04:11
Databricks Cluster and Library Set Up: 00:12:32
Importing the Project Notebook: 00:16:35
Initiating the Stream and Processing the Bronze Layer: 00:18:20
Processing the Silver Layer: 00:33:08
Processing the Gold Layer: 00:43:51
Near Real-Time Power BI Report: 00:54:43
Project Debrief: 00:59:41
🔥Project Overview:
We'll start by creating real time data in Azure Event Hubs and then store and process the data in our Databricks Lakehouse, implementing the Bronze, Silver, and Gold layers of the Medallion Architecture. The final step involves creating a near real-time report on Power BI, showcasing the power of streaming analytics.
Links and Resources
🔗 GitHub: github.com/malvik01/Real-Time...
🔗 Spark Structured Streaming API: spark.apache.org/docs/latest/...
🔗 Watermarking: www.databricks.com/blog/featu...
🔗 Check Out My Udemy Courses: www.pathfinder-analytics.net/...
📺 Don't forget to hit the like button, subscribe, and turn on notifications to stay updated with our latest tutorials and tech insights!
📬 For more information, queries, or feedback, feel free to drop a comment below.
🔖 Tags
azure data engineering, end-to-end azure data engineering project, azure data engineering project free
Hashtags
#azure #dataengineering #databricks #DataEngineeringProject #RealTimeDataEngineering #EventHubsTutorial #BigDataSolutions #CloudDataEngineering #DataScienceProject #AzureStreaming #AdvancedDataEngineering

Наука

Опубликовано:

 

10 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 20   
@RajeshRRajamani
@RajeshRRajamani 28 дней назад
Precise , Detailed and to the point . Thanks
@suniguha
@suniguha 3 месяца назад
Excellent presentation. Thank You.
@sraoarjun
@sraoarjun 3 месяца назад
You make perfect videos and this is really quality content !!!
@shubhampawade2933
@shubhampawade2933 2 месяца назад
Lovely! thanks a ton. You've earned a subscriber.
@Hitesh939
@Hitesh939 3 месяца назад
Awesome video Bhai :)
@ranjansrivastava9256
@ranjansrivastava9256 6 месяцев назад
Great Video !!! Could you please help us to understand on Tumbling window and Watermark concept while reading and writing real time data. If possible.
@ranjansrivastava9256
@ranjansrivastava9256 6 месяцев назад
Dear , you have used the the protocol AQMP, should it will be AMQP? Kindly clarify on that.
@monalisachatterjee7222
@monalisachatterjee7222 7 месяцев назад
its gem of a knowledge. I was searching for it. finally found one. just one question how to do similar thing if we dont have unity catalog
@pathfinder-analytics
@pathfinder-analytics 6 месяцев назад
Thank you! Without Unity Catalog you will need to use the Hive Metastore and Service Principals / Mount Points
@vemedia5850
@vemedia5850 5 месяцев назад
how is this done for this project? as also don't have unity catalog under a student account.@@pathfinder-analytics
@neelred10
@neelred10 4 месяца назад
Is eventhub necessary ? I think we can use autoloader directly to connect to Kafka . Please let me know if there are any limitations that would warrant use of event-hub in between
@SudarshanThakurIRONPULLER
@SudarshanThakurIRONPULLER 2 месяца назад
Very useful .Can you also explain how we can compress and send data to event hub and read in spark ?
@user-ce9tn6qh5r
@user-ce9tn6qh5r 2 месяца назад
does eventhub has to be in the same resource group as storage account?
@satwikkumar-eq6fm
@satwikkumar-eq6fm 3 месяца назад
Hi, while creating the cluster I'm unable to add unity catalog? Please help
@Ramakrishna410
@Ramakrishna410 3 месяца назад
How to capture batchid as newcolumn and write it to adls. Plz help me
@Ramakrishna410
@Ramakrishna410 2 месяца назад
How to trigger my pipeline when any new message reached to eventhub
@desifood1895
@desifood1895 5 месяцев назад
Did u use Autoloader here ...
@a2zhi976
@a2zhi976 4 месяца назад
can you please do similar video with Synpase spark cluster instead of databricks
@abhishekkumartiwari1207
@abhishekkumartiwari1207 6 месяцев назад
How to stop the job automatically
@user-ce9tn6qh5r
@user-ce9tn6qh5r 2 месяца назад
Hi, i mounted my container correctly, but the df.writestream give the following error for checkpointlocation, what should I check? nalysisException: [RequestId=0b4e8597-1d71-483d-9447-ab4e6c3d9470 ErrorClass=INVALID_STATE.UC_CLOUD_STORAGE_ACCESS_FAILURE] Failed to access cloud storage: [AbfsRestOperationException] error code: UNKNOWN, status code: -1 exceptionTraceId=de24bf5b-e34c-41a8-8855-710ae71aabd2 thank you.
Далее
Databricks Extension for VS Code: A Hands-On Tutorial
16:12
Дарю Самокат Скейтеру !
00:42
Просмотров 1,2 млн
Nurse's dream !! 😂😂
00:17
Просмотров 2,3 млн
Why I Quit the Scrum Alliance
7:58
Просмотров 10 тыс.
AZ-204 - Connect and Consume Services - Azure Event Hubs
1:02:21
Azure Event Hubs Deep Dive (Azure & AI Conference 2022)
1:03:03
Azure Event Hubs for Apache Kafka | Azure Friday
16:04
Streaming Data Analytics with Power BI and Databricks
37:58
Подключил AirPods к Xbox
0:45
Просмотров 27 тыс.