[Hadoop] HDFS Architecture

1 분 소요

What is HDFS Architecture?

Hadoop Distributed File System (HDFS) is like Master-Worker architecture. The master is the NameNode and the workers are the low-cost commodity hardware. In the DataNodes, the actual data is stored. In this architecture, there is single NameNode and multiple DataNodes.

What is the task of NameNode?

The NameNode is used to store the meta-data and another data related for DataNodes. The NameNode also reponsible for:
- Managing the file-system namespace
- It controls the access of different clients into the data blocks.
- Periodically checks the availablility of the DataNodes.
- It also care about the replication factor of the data blocks.

What is the task of DataNodes?

DataNodes are the main storage of data. Hadoop uses low-cost hardware to store data.
DataNodes are responsible for storing, replication creatiing, deleting these type of jobs according to the instruction of NameNode.
These DataNodes send the health report to the NameNode periodically. The default time is 3 seconds. So after every 3 seconds, these send the report to the NameNode.

What is the Secondary NameNode?

Secondary NameNode: The Secondary NameNode is another specially dedicated node, which is used to take the checkpoints of the file-system. The Secondary NameNode is not the substitute of the Primary NameNode. It helps the NameNode but not replace for NameNode.

공유하기

Twitter Facebook LinkedIn

댓글남기기

참고

[TIL] 불리언 / 부동소수점

10/03/2022 TIL

1 분 소요

Today I Learned

[TIL] 전역변수 / singed와 unsigned / 정수 오버플로우, 언더플로우

10/02/2022 TIL

1 분 소요

Today I Learned

[TIL] 스택 메모리 사용법 / 주의사항

10/01/2022 TIL

최대 1 분 소요

Today I Learned

[TIL] 스택 메모리의 필요성

09/29/2022 TIL

1 분 소요

Today I Learned