Тёмный

Row Format vs Column Format | Why Parquet is better than Avro | Why Columnar formats are preferred 

Learning Journal
Подписаться 74 тыс.
Просмотров 11 тыс.
50% 1

Learn more at www.scholarnest.com/
Best place to learn Data engineering, Bigdata, Apache Spark, Databricks, Apache Kafka, Confluent Cloud, AWS Cloud Computing, Azure Cloud, Google Cloud - Self-paced, Instructor-led, Certification courses, and practice tests.
========================================================
SPARK COURSES
-----------------------------
www.scholarnest.com/courses/s...
www.scholarnest.com/courses/s...
www.scholarnest.com/courses/s...
www.scholarnest.com/courses/s...
www.scholarnest.com/courses/d...
KAFKA COURSES
--------------------------------
www.scholarnest.com/courses/a...
www.scholarnest.com/courses/k...
www.scholarnest.com/courses/s...
AWS CLOUD
------------------------
www.scholarnest.com/courses/a...
www.scholarnest.com/courses/a...
PYTHON
------------------
www.scholarnest.com/courses/p...
========================================
We are also available on Udemy Platform
Check out the below link for our Courses on Udemy
www.learningjournal.guru/cour...
=======================================
You can also find us on Oreilly Learning
www.oreilly.com/library/view/...
www.oreilly.com/videos/apache...
www.oreilly.com/videos/kafka-...
==============================
Follow us on Social Media
/ scholarnest
/ scholarnesttechnologies
/ scholarnest
/ scholarnest
github.com/ScholarNest
github.com/learningJournal/
========================================

Наука

Опубликовано:

 

12 ноя 2022

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 9   
@SanjayKumar-rw2gj
@SanjayKumar-rw2gj 10 дней назад
Great explanation, to the point no exaggeration. Thanks for the video
@PANKAJKUMAR-fe8zn
@PANKAJKUMAR-fe8zn 20 дней назад
Wonderful explanation. I was studying data cloud in salesforce and they were mentioning this data format multiple time. I was clueless but I got clarity from your video. Thank you sir
@cheluveshab9525
@cheluveshab9525 Год назад
Pleasure do make a video on compression techniques
@MrSravan84
@MrSravan84 Год назад
Very nicely explained. But @8:40 you mentioned that the column 2 can go in the different same block or different block and @11:29 you mentioned that Spark knows that column 2 is stored in Block-2. These 2 statements are sort of causing confusion. i.e., if a column of each row can be spread across multiple blocks how does Spark know which block to search ?
@nindersingh
@nindersingh Год назад
In Block 1 R3C3 is mentioned as wrong 🚫, this must be R2C3. Because R3C3 is coming in Block 2 as expected.
@sumanthb3280
@sumanthb3280 Год назад
So, why is Avro used in some projects?
@sumitnekar8965
@sumitnekar8965 Год назад
One scenario i can think of,Avro over plain json offers benefits like schema evolution which can be beneficial in case of multiple producers and consumers setup. If you are using json data format with kafka topics in a data pipeline, avro format can be leveraged instead of json.
@josephjoestar995
@josephjoestar995 7 месяцев назад
@@sumitnekar8965could you explain further please? I’m doing some investigation work on choosing avro v parquet v delta tables for Azure Event Hubs output, your explanation would be appreciated 🙏
@user-gh4lv2ub2j
@user-gh4lv2ub2j 9 месяцев назад
As a mathematician I must inform you that having a row space vs a column space is an isomorphism. There is no difference; it's in your head.
Далее
what is Apache Parquet file | Lec-7
47:13
Просмотров 21 тыс.
Drive through the color🚗❓
00:13
Просмотров 5 млн
Different Data File Formats in Big Data Engineering
7:53
Column vs Row Oriented Databases Explained
34:16
Просмотров 74 тыс.
Choose a phone for your mom
0:20
Просмотров 7 млн