Тёмный

Olena Kutsenko - ClickHouse: what is behind the fastest columnar database 

Plain Schwarz
Подписаться 2,9 тыс.
Просмотров 2 тыс.
50% 1

An open source columnar database ClickHouse is in many ways exceptional - it is exceptionally fast, exceptionally efficient, but also, at times exceptionally confusing.
Its approach to handling data goes against many principles and concepts that we use in other databases. To give some examples: its primary index doesn't index each row and doesn't guarantee uniqueness; a secondary index is used to skip data and doesn't point to specific rows; JOINS is a complex topic and transactions are supported partially, not to mention that its SQL dialect holds a couple of surprises up its sleeve.
But, all that said, if used correctly, ClickHouse is a superb solution for online analytical processing (OLAP).
The goal of this talk is to help you get the most of ClickHouse and avoid the pitfalls. We'll talk about OLAP and columnar databases. We'll touch topics of indexing, searching and disk storage. We'll look at the reasons behind the most puzzling concepts of ClickHouse, so that by the end of the talk you find them not only logical, but maybe even fascinating.
If your challenge is analysing terabytes of data - this talk is for you. If you're a data scientist looking for tools to work with big data - this talk is for you. And, of course, if you are just curious about what makes ClickHouse crazy fast - this talk is for you as well.
Speaker: Olena Kutsenko
More: 2023.berlinbuzzwords.de/sessi...
Web: 2023.berlinbuzzwords.de/
Fediverse: floss.social/@berlinbuzzwords
Linkedin: / 13978964
Twitter: / berlinbuzzwords

Наука

Опубликовано:

 

9 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 2   
@maxmustermann4438
@maxmustermann4438 Год назад
Hey, awesome talk. Clickhouse seems really interesting. I was wondering, what is the difference between the open source version of Clickhouse and the managed cloud version? One of our customers needs an OLAP database, which has to be deployed on-premises. Would it lack any features (such as RBAC etc.) that are only available in the cloud version? Thanks and keep up the great work!
Далее
Clickhouse Internals with Tom and Tyler
1:09:41
Просмотров 1,9 тыс.
小天使和小丑离家出走#short #angel #clown
00:36
Sniper Duel | Standoff 2
00:54
Просмотров 623 тыс.
Fokko Driesprong - Tip of the Iceberg
40:52
I've been using Redis wrong this whole time...
20:53
Просмотров 335 тыс.
Solving one of PostgreSQL's biggest weaknesses.
17:12
Просмотров 177 тыс.
Writing My Own Database From Scratch
42:00
Просмотров 168 тыс.
When you have 32GB RAM in your PC
0:12
Просмотров 2,5 млн