How to Install HDFS on Clear Linux Latest
HDFS stands for Hadoop Distributed File System. It is a distributed file system designed to run on commodity hardware. HDFS is the storage component of Apache Hadoop and provides a reliable and fault-tolerant way to store large amounts of data.
In this tutorial, we will learn how to install HDFS on Clear Linux Latest.
Prerequisites
- Clear Linux Latest
- Java Development Kit (OpenJDK) installed
Step 1: Download Hadoop
- Download Hadoop from the official website https://hadoop.apache.org/releases.html.
- Choose the latest stable release.
- Click on the link to download the compressed file in tar format.
Step 2: Extract the Compressed File
- Open a terminal window and navigate to the directory where the compressed file was downloaded.
- Use the following command to extract the contents of the compressed file:
tar -xvf hadoop-<version>.tar.gz
Replace <version> with the version number of the downloaded file.
Step 3: Move the Hadoop Folder
- Move the extracted Hadoop folder to a directory of your choice.
- In this tutorial, we will move the folder to
opt/:
sudo mv hadoop-<version> /opt/
Step 4: Set the Environment Variables
- Open the
.bashrcfile using a text editor:
nano ~/.bashrc
- Add the following lines to the end of the file:
export HADOOP_HOME=/opt/hadoop-<version>
export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin
- Replace
<version>with the version number of the downloaded file. - Save and close the file.
- Reload the
.bashrcfile:
source ~/.bashrc
Step 5: Format the Namenode
- Open a terminal window and navigate to the Hadoop bin directory:
cd /opt/hadoop-<version>/bin
- Use the following command to format the namenode:
hdfs namenode -format
Step 6: Start HDFS
- Open a terminal window and navigate to the Hadoop sbin directory:
cd /opt/hadoop-<version>/sbin
- Use the following command to start HDFS:
./start-dfs.sh
Step 7: Test HDFS
- Open a terminal window and navigate to the Hadoop bin directory:
cd /opt/hadoop-<version>/bin
- Use the following command to create a directory in HDFS:
hdfs dfs -mkdir /test
- Use the following command to list the contents of the directory:
hdfs dfs -ls /
- If the directory
/testis listed, then HDFS has been successfully installed.
Conclusion
In this tutorial, we learned how to install HDFS on Clear Linux Latest. We also learned how to configure the environment variables, format the namenode, start HDFS, and test the installation.