In this blog post, I will explain different hadoop modes.
Table of Contents
Hadoop can run in 3 different modes
- Local/Standalone Mode
- Pseudo Distributed Mode
- Fully Distributed Model
- Standalone mode is the default mode in which Hadoop run.
- It is used to write MapReduce programs.
- Mainly used for debugging where you don’t really use HDFS.
- You can use input and output both as a local file system.
Pseudo Distributed Mode
- The pseudo-distribute mode is also known as a single-node cluster where both NameNode and DataNode will reside on the same machine.
- In Pseudo-distributed mode, all the Hadoop daemons will be running on a single node/machine.
- Each Hadoop daemon runs in a separate Java process
- Used for development and testing.
Fully Distributed Mode
- Fully Distributed Mode is called as production mode where multiple nodes will be running.
- Master and Slave services will be running on the separate nodes in fully-distributed Hadoop Mode.
- In fully-distributed mode, all daemons are executed in a separate nodes forming a multi node cluster.
Fully distributed mode is the one for which Hadoop is mainly known for but again there is no point in engaging the resource while in testing or debugging phase. So standalone and pseudo-distributed modes are also having their own significance.