Hadoop installation is supported in Standalone, Pseudo-Distributed, or Fully-Distributed Modes. This article is a guide to understand steps involved to install Hadoop on Mac OS X in Pseudo-Distributed Mode.
The pseudo-distributed mode of Hadoop lies between standalone mode and the fully distributed mode. It is useful to simulate an environment closer to production but on a smaller scale.
Similar to standalone mode Hadoop installed on pseudo-distributed mode runs on a single node. However, the difference is that all the Hadoop daemons run in separate java processes.
Installing java is one of the prerequisite for Hadoop installation also the latest Hadoop distributions would require minimum of Java 7 i.e., Java 1.7 to be installed.
The below command can be used to check what version of java is installed.
Output of the command must be similar to the below.
Step 1 - Download Hadoop
Download any latest stable binary distribution of Hadoop from the Apache Hadoop site. At the time of writing the blog the latest version of Hadoop is hadoop-3.2.1
Move the downloaded tar.gz file to a desired location (In this case Install/Hadoop) and unpack the same.