Apache Iceberg on AWS with S3 and Athena [FULL COURSE IN 30MIN]

Johnny Chivers

Подписаться 21 тыс.

Просмотров 22 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

28 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 33

@____prajwal____ Год назад

Great vid. Please make one for Hudi.

@pauldevillers797 Год назад

Would be nice for hudi

@JohnnyChivers Год назад

Added to the list!

@BenOgorek Год назад

Great video! Just a heads up that the timestamps are in UTC, so most of us will have to do the offset calculation (5 hours ahead for EST during daylight savings). Maybe there's an easier way to specify that. Also, I'm really curious about the distinction between avro and parquet. I noticed that avro files were used in the metadata but parquet were used for the data. I heard Iceberg can accept avro and was wondering if there are advantages to only using avro.

@RahulSinghPatel-st6yb 4 месяца назад

line 3:5: mismatched input 'SYSTEM_TIME'. Expecting: 'TIMESTAMP', 'VERSION' I'm getting this error while running the timestamp querry. can you please tell me why?

@tieduprightnowprcls Год назад

Iceberg table is suitable for transformed layer or curated layer data rather than implementing it for raw data layer, am I right?

@DataEngUncomplicated Год назад

Great job Johnny! Im excited about the potential of Iceberg on AWS too!

@terri1258 3 месяца назад

so useful!

@harivigsp7934 4 месяца назад

Can we create an iceberg table to S3 using multi region access point?

@deepg6139 4 месяца назад

For a very large dataset (like around 15 billion rows overall) is it going to give good performance if we use iceberg to select/delete/update ?

@alecbg919 6 месяцев назад

Around 26 minutes after you queried the deleted data it said it scanned 5.76MB. That seems like a lot for just metadata!

@tieduprightnowprcls Год назад

I failed to create nested y/m/d partition for iceberg table in Athena, how to accomplish this?

@thiagoa1851 Год назад

After running the SQL delete, iceberg stills query with the time travel feature?

@JohnnyChivers Год назад

Yes, the snapshots are still present.

@viewermm1588 3 месяца назад

Hi all, when creating iceberg table in Athena , I get " Exception encountered when executing query, this query ran against ...... database, unless qualified by the query . please post the error message on our forum ....., anyone know the solution ?

@mickyman753 3 месяца назад

Johnny the speed comes from partition by column we use while creating? Like if I used a different column insyead of date and and used the date related queries , will it still be faster or not?

@fancystacy 3 месяца назад

Thanks. that was fast and quite easy to uderstand. But if you would put cross links to your other videos like about Glue this would become even greater!

@HariPrasadEluri Год назад

is there any way that it wont create random prefixes while inserting the partitioned data at @18:10?

@sungkim1830 Год назад

Hello Johnny chivers. Is there a way to create iceberg table with existing metadata and data using Athena or Glue?

@gregf9160 Год назад

Great intro to Iceberg, Johnny. Quick question, as well as delete can it support Truncate? Deletes are fine for a relatively small number of rows (in traditional DBMS's this is also true) but on millions of rows, Delete takes forever compared with Truncate. With Iceberg updating all those Manifests as it's deleting each row, would that not also be bit of a bottleneck, or is that offset somewhat by the compute resources of AWS?

@faingtoku Год назад

Great video ! Would be great one using streaming from kinesis to iceberg. Like kinesis +EMR + glue catalog + iceberg

@federicomanueldlouky5231 Год назад

great explanations! love your videos!!! thanks! 🙂

@nic-tf5dx Год назад

Love it! Looking forward to more Apache Iceberg. Maybe in connection with Dremio

@naveenkumarmurugan1962 3 месяца назад

thank you

@flaviolanfranco Год назад

Nice tutorial! I love how you share your knowledge! Thanks!

@jeffschroeder8875 8 месяцев назад

Can you write me a snippet of code the moves an iceberg column to a different column position? I cannot for the life of me get it to work based on the AWS documention. Thanks. Tried several variants similar to: ALTER TABLE database.table_name CHANGE field1 string AFTER field2

@jeffschroeder8875 8 месяцев назад

ALTER TABLE database.table_name CHANGE field1 field1 string AFTER field2

@swapnilbhoite902 Год назад

What a fantastic video. Great learning :)

@jesper6988 Год назад

Love your vids, really appreciate the work you do!

@danilomenoli 5 месяцев назад

You are amazing❤

@lucasgambi 10 месяцев назад

U are the AWS GOAT!!

@wuerikehenriquedasilvacava928 Год назад

After populating the iceberg table, at 18:10, why it creates a folder with random chars before each partition folder? I'd like to have the partitions folders right after the data folder