Тёмный
No video :(

Centralised Data Sharing using Analytics Hub 

PracticalGCP
Подписаться 3,2 тыс.
Просмотров 2,9 тыс.
50% 1

Sharing data in a medium - large organisation has always been a big challenge.
In today's talk I've described some of these data sharing challenges I've seen over the past years in different organisations, and how the new Google Cloud product Analytics Hub can potentially solve this in a much easier and user friendly way in the analytics community.
01:50 - Data Sharing challenges
04:59 - What is Analytics Hub
08:48 - a quick demo
16:25 - Centralise data sharing using Analytics Hub
21:41 - Data Clean Room
24:16 - The trend to remove ETL on data sharing
26:41 - Summary
- Link to the slide: docs.google.co...

Опубликовано:

 

5 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 16   
@nishantmiglani7021
@nishantmiglani7021 2 месяца назад
Thanks a lot, Richard He, for creating this insightful video on Analytics Hub.
@rudytrisaputra2301
@rudytrisaputra2301 9 месяцев назад
Thank you for sharing, Richard! I am truly interested in exploring the concept of a Data Clean Room, there is a desire to facilitate data sharing for transformation without the need for data movement processes. So the better ways to do Data Sharing with Analytics Hub is we need to create a new project to deploy the Analytics Hub, this project will centralize the sharing process?
@practicalgcp2780
@practicalgcp2780 9 месяцев назад
Thank you for the comment! In my opinion, it’s a good model to create a centralised project to create exchanges where you may want to centralise who owns them and who can publish, and consistent naming conventions. So it doesn’t become a mess.
@WiktorJurek
@WiktorJurek Год назад
This is a pretty cool breakdown - where do you see the analytics hub configuration sitting? In the data generator project, or in a project of it's own?
@practicalgcp2780
@practicalgcp2780 Год назад
Thank you! I am not sure what is the best design, but in my option it would be better to keep the all the exchanges in a single separate project that is managed by the data platform team. That way you can apply governance and privacy control must easily, if you keep them in the source projects, it could still end up with each team doing whatever they like problem and it’s more difficult to monitor as well
@mohdabbas7794
@mohdabbas7794 6 месяцев назад
Sir Please make video on same with VPCSC
@practicalgcp2780
@practicalgcp2780 5 месяцев назад
Can you give a bit more detail on what problem you try to solve with VPC SC?
@practicalgcp2780
@practicalgcp2780 5 месяцев назад
There is something published by our team a while back you might find useful medium.com/@vmo2techteam/how-we-secured-our-data-on-the-cloud-341d4ac394b9
@mohdabbas7794
@mohdabbas7794 5 месяцев назад
@@practicalgcp2780 The Problem statement is something like Let's say we have a VPCSC restricted environment. where Project A is a centralised data sharing project for Bigquery. In that case to establish the communication between centralised project A and other project that are consuming the sharing data and those project for them we are creating exchanges and listing to share the data. what should be the VPCSC Service Perimeter Policies. Example Ingress and egress policies.
@practicalgcp2780
@practicalgcp2780 5 месяцев назад
it really depends on how you set things up in your org. Typically you may not want to have too many perimeters in the same org, because the overhead maybe too much, one single perimeter for the whole org is also a valid setup, so you can prevent risks from outside of the org, but within the org no whitelisting is required. I haven’t done this for analytics hub, but I believe it’s the same, you need to whitelist both ingress and egress rules as you are trying to get access to data from outside your org.
@andrzejmaj3190
@andrzejmaj3190 11 месяцев назад
Thank you for that. One question - if I'm not mistaken, Analytics Hub won't assist when querying tables located across multiple regions, like the US and EU, without some form of replication. Is that correct?
@practicalgcp2780
@practicalgcp2780 11 месяцев назад
Hi there, no it won’t. But google just announced dataset replication in preview, check it out here cloud.google.com/bigquery/docs/data-replication
@practicalgcp2780
@practicalgcp2780 11 месяцев назад
Actually I think I may have misunderstood the purpose of data-replication. I think this is more created for a primary / replica disaster recovery sort of use case, or data migrations between regions. Not for the ability to query the data on a separate region which I think is what you are trying to achieve.
@harshchoudhary6069
@harshchoudhary6069 4 месяца назад
How we can share the authorized view using analytics hub?
@practicalgcp2780
@practicalgcp2780 4 месяца назад
It makes no difference using authorised views, as authorised view permissions are managed the same way as tables, different to normal views. However, using authorised views has some tradeoffs, a key one being losing metadata such as column descriptions which isn’t great for data consumers. But it does have the advantage if you don’t want to duplicate data models or increase latencies
Далее
BigQuery to Datastore via Remote Functions
22:20
Просмотров 1,5 тыс.
ЭТО мне КУПИЛИ ПОДПИСЧИКИ 📦
22:33
НЕДОВОЛЬНА УСЛУГОЙ #shorts
00:27
Просмотров 21 тыс.
GCP DataProc: Spark on cloud made easy!!
41:43
Serverless distributed processing with BigFrames
27:53
Просмотров 2,2 тыс.
Cloud Run with IAP
23:10
Просмотров 6 тыс.
Run Apache Spark jobs on serverless Dataproc
30:18
Просмотров 4,2 тыс.
ЭТО мне КУПИЛИ ПОДПИСЧИКИ 📦
22:33