Тёмный

AWS Tutorials - Data Ingestion Services in AWS 

AWS Tutorials
Подписаться 13 тыс.
Просмотров 6 тыс.
50% 1

Опубликовано:

 

11 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 27   
@sandeepmodaliar8042
@sandeepmodaliar8042 2 года назад
This channel deserves more than 1 lakh subscribers for sure.
@AWSTutorialsOnline
@AWSTutorialsOnline 2 года назад
Thanks
@Skandawin78
@Skandawin78 3 месяца назад
very good presentation
@ranjanasingh7950
@ranjanasingh7950 Год назад
Thank you for this extremely informative Session
@SumitSharma-zp2sh
@SumitSharma-zp2sh 4 месяца назад
Can you comment on getting data from SaaS applications which only provide APIs, and something getting data of large volume may take 5-6hours. Is glue the right approach?
@Shwetahosur1220
@Shwetahosur1220 2 года назад
its very informative. Thank you
@AWSTutorialsOnline
@AWSTutorialsOnline 2 года назад
You are welcome
@anchalgupta5674
@anchalgupta5674 2 года назад
Thank you, content is quite helpful and would like to request you please share video with step by step setup of data ingestion pipeline of Glue/Lamda/DMS
@AWSTutorialsOnline
@AWSTutorialsOnline 2 года назад
I have already provided pipeline configuration using Glue Workflow and DMS. Please search in my channel.
@bharathikannan5772
@bharathikannan5772 2 года назад
Very good explanation
@AWSTutorialsOnline
@AWSTutorialsOnline 2 года назад
Thanks and welcome
@imsanjaya
@imsanjaya 2 года назад
Hi It was a excellent comparison. Can we compare the licensing cost also ,plz answer which ingestion tool costs more licensing cost
@AWSTutorialsOnline
@AWSTutorialsOnline 2 года назад
There is no license cost. Being cloud, it is pay as you go model. You can see pricing of the services online. For instance, here is the lambda and glue pricing. aws.amazon.com/lambda/pricing/ aws.amazon.com/glue/pricing/
@avishekghosh4879
@avishekghosh4879 2 года назад
Quite Insightful...
@AWSTutorialsOnline
@AWSTutorialsOnline 2 года назад
Thanks
@tracyding4906
@tracyding4906 2 года назад
AWS EMR is not mentioned, EMR can do some streaming data ingestion, am I right?
@AWSTutorialsOnline
@AWSTutorialsOnline 2 года назад
EMR can process streaming data. But the ingestion is done using services like Kinesis. So Kinesis ingests data and then hands over to EMR for processing using Kinesis Firehose.
@hsz7338
@hsz7338 2 года назад
Thank you, it is extremely informative, and it implies data ingestion decision making. As an observation, the suggested services in the tutorial are AWS native services. Would you consider AWS MSK or AWS managed RabbitMQ if the data sources are social media and web traffic/front-end client event data?
@AWSTutorialsOnline
@AWSTutorialsOnline 2 года назад
You are right. The scope of tutorial was aws native services and S3 as the data location. However, since AWS Lake House architecture, support multiple databases registered with various data types of data sources, it is perfectly ok if you use MSK or RebbitMQ for the ingestion,
@vivekjacobalex
@vivekjacobalex 2 года назад
in DATA SOURCE- file system, we can also use direct file transfer from source location to s3 using programmatic way (aws cli, python boto3 ) OR user longing in s3 console .
@AWSTutorialsOnline
@AWSTutorialsOnline 2 года назад
Technically you can do. Think about security especially if you are using AWS Access Key / Secret Key. Also think about scalability, performance when handling big files especially when using code.
@vivekjacobalex
@vivekjacobalex 2 года назад
@@AWSTutorialsOnline ok
@anasjemaa9138
@anasjemaa9138 2 года назад
Thank you, great content! I love your video and you are excellent to synthesize the information. If I use AWS Glue, can I ingest multiple tables in one job?
@AWSTutorialsOnline
@AWSTutorialsOnline 2 года назад
Yes, absolutely
@adityamishra7438
@adityamishra7438 2 года назад
@@AWSTutorialsOnlinehow can we please help me in this, how can we make it automated? like table name changes, can we use for loop for table list ?
@AWSTutorialsOnline
@AWSTutorialsOnline 2 года назад
@@adityamishra7438 You can pass comma separated tables names in job parameter a the time of job execution. Then loop in the tables and do ingestion one by one.
@adityamishra7438
@adityamishra7438 2 года назад
@@AWSTutorialsOnline Thanks, can you please show me the code.
Далее
AWS Tutorials - Using Concurrent AWS Glue Jobs
24:33
Growing fruit art
00:33
Просмотров 2,9 млн
AWS Tutorials - Using Job Bookmarks in AWS Glue Jobs
36:14
Top AWS Services A Data Engineer Should Know
13:11
Просмотров 163 тыс.
AWS Tutorials - Joining Datasets in AWS Glue ETL Job
25:57
AWS Tutorials - Introduction to AWS Glue Studio
28:21