System design basics: Learn about Distributed file systems

Подписаться 160 тыс.

Просмотров 57 тыс.

50% 1

#distributedfilesystems #hdfsbasics #learndistributefilesystems
#systemdesigntips #systemdesign #computerscience #learnsystemdesign #interviewpreperation #amazoninterview #googleinterview #uberinterview #micrsoftinterview

Опубликовано:

17 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 48

@velmurugansubramaniam 5 лет назад

Thanks for coming up with interesting videos and channel is more useful. Keep posting.

@ShabnamKhan-cj4zc 5 лет назад

Thanks narendra for taking out time and explaining the concept in such a simple n beautiful way..

@vivekmit 5 лет назад

I really likes and follows your sessions. I found these videos are really helpful for building up concepts but it would be more helpful, if you can provide practical approach, vendors details and its alternatives.

@44vnarayan 5 лет назад

Appreciated !! maybe u could have reduced the time by talking to the point more specifically, It would have been more interesting (Just a suggestion ;))

@allenixb.laurel3563 3 года назад

I super love your explanation here.. I'll be looking forward more from you here in the Philippines..

@poojapillai3281 4 года назад

the way you explain was excellent i have watched several video to understand this concept but no body did it well

@subee128 Месяц назад

Thank you very much

@Jkudjo 2 года назад

data should be replicated across az to avoid spof. Nicely and well explained

@vaybhavshawify 3 года назад

Good video. My 2 cents. The data should be replicated across different Availability Zones rather than across just different Data Centers. It is entirely possible that due to a Geographic catastrophe 2 DCs in a zone as equally susceptible to destruction. This is what most of the top Cloud Providers use.

@shubhams24 3 года назад

Explanation of the concept is far better... good work

@diegodevops4151 2 года назад

Good video Narendra. You have covered more details than other videos I've seen.

@DavidGutierrezArias 5 лет назад

Hi, Could you make a video about Uber-Eats system design, please

@scottscott232 2 года назад

Nice explanation. Thanks.

@RandomShowerThoughts Год назад

Narendra's videos are some of the best in the entire industry

@KomalMamHere 4 года назад

Thank you☺you cleared my confusions

@itanimatedvideos 3 года назад

Really good👍 ❤❤

@punyat7733 4 года назад

Excellent!! Really helpful....Thanks :)

@saliya2626 3 года назад

super simply explain

@thumper300zx 3 года назад

Thanks for the video. Just as an FYI: A 35TB file + one copy (replication) = 70TB (total available space in the DFS) = OODS. :)

@VINUSHASCHANNEL 6 месяцев назад

Thanks for clear explanation bruh!! it's really helpful

@marklowe7431 3 года назад

Good explanation. Cheers Narendra.

@hum__shyam 4 года назад

Good explain! 🙏

@jishnarajeev4840 4 года назад

It's really useful...

@boblochen 5 лет назад

Great intro.. Thanks!

@zara5568 4 года назад

You are the best!!!

@venkatkrishna3679 2 года назад

very good

@saip7137 4 года назад

Could you do a video on differences between the object store and the hdfs? Or just a video on the object store?

@大盗江南 4 года назад

Thank you buddy!

@haneulkim4902 5 лет назад

Thanks for the video. For DFS you have multiple copy of data file, I understand that when some machines break it has back up. Okay that is one benefit however in terms of efficiency is it efficient then having one file system w/o a back up, how does having multiple copies of data file be faster?

@evrimsonmezer3620 2 года назад

Hi, I'd like to ask that how can we make system design of anti-virus scanner system, upgrading software on a fleet of machines, or distrubeted Botnet? Are all of these related with the distributed file system logic? How can I find the system design details of above subjects? Are there any videos of your that are uploaded? I looked into your video library but I couldn't find the directly related one. Could you please help me? Thank you

@minostro 5 лет назад

In the video, you mentioned that the name node will tell the client which data node to use to upload the file. However, files are stored in chunks into the different data nodes the cluster has. Does the name node also pass the information of how many chunks need to be created for the given file and which data node to use for each chunk?

@TechDummiesNarendraL 5 лет назад

The default HDFS block size is 64 MB. The block size of a data product can affect the performance of the filesystem operations where larger block sizes would be more effective you can configure the blocksize and every namenode will use the configured blocksize

@minostro 5 лет назад

@@TechDummiesNarendraL Thanks for the reply. I understand that the block size can be configured, but I am more interested in learning how the file gets split in chunks and how the name node knows this. Is it the responsibility of the client (hdfs library) to split the file into chunks and then talk to the data nodes to upload the file?

@MrSauce714 5 лет назад

Chunk to node mapping I would imagine is probably cached on namednode (client side)

@minostro 5 лет назад

@@MrSauce714 but who is the source of truth in this case? I would imagine that name node is, but the video is not explicit about this. Not sure why you are talking about cache here.

@MrSauce714 5 лет назад

@@minostro what I mean is at runtime the client application would have a mapping of chunk to node mapping, this could be stored in memory. I may be wrong

@sachinmukherjee29 5 лет назад

Thankyou 😊

@sauravdas7591 3 года назад

The video could have explained one more example of filesystem, instead of discussing the DFS. Given the length of the video.

@thrashuploaded706 4 года назад

Sir, As u said that if we will store our file in Node1 its replica will be stored in Node 3... right.? but if Node3 will Crash ,, MASTER or Name node will ask NODE1 to replicate this file to NODE2 if space available....So my question is if space is full in NODE 2 ,,where replica of file will be stored at ..?

@Chezzar007 4 года назад

I have eight Synology NAS. Can they be turned into a single shared location of aggregated space?

@ancipesq26 3 года назад

Quick question, if we are setting the replication factor to 2, does it mean we now need double the storage to store the copy?

@mwalimukienyeji Год назад

Yes. The data has to go somewhere. Right?

@eugnsp 4 года назад

First paper to read: static.googleusercontent.com/media/research.google.com/en//archive/gfs-sosp2003.pdf Second paper to read: queue.acm.org/detail.cfm?id=1594206