Hadoop Install

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 May 4 5:40
Editor
Edited
Edited
2024 May 8 13:11
Refs
Refs
wget https://dlcdn.apache.org/hadoop/common/hadoop-3.3.6/hadoop-3.3.6.tar.gz tar -xvzf hadoop-3.3.6.tar.gz rm hadoop-3.3.6.tar.gz mv hadoop-3.3.6 ~/hadoop export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64 export HADOOP_HOME=~/hadoop export HADOOP_INSTALL=$HADOOP_HOME export HADOOP_MAPRED_HOME=$HADOOP_HOME export HADOOP_COMMON_HOME=$HADOOP_HOME export HADOOP_HDFS_HOME=$HADOOP_HOME export HADOOP_YARN_HOME=$HADOOP_HOME export HADOOP_COMMON_LIB_NATIVE_DIR=$HADOOP_HOME/lib/native export PATH=$PATH:$HADOOP_HOME/sbin:$HADOOP_HOME/bin export HADOOP_OPTS="-Djava.library.path=$HADOOP_HOME/lib/native" mkdir -p ~/hadoopdata/hdfs/{namenode,datanode} # or datanode mkdir -p ~/hadoopdata/hdfs/datanode # Edit nano $HADOOP_HOME/etc/hadoop/core-site.xml # namenode host nano $HADOOP_HOME/etc/hadoop/mapred-site.xml # mapred binary nano $HADOOP_HOME/etc/hadoop/yarn-site.xml # yarn nano $HADOOP_HOME/etc/hadoop/hdfs-site.xml # hdfs folder # edit java_home for hadoop_env.sh hdfs namenode -format ./start-all.sh
Hadoop Install Methods
 
 
hadoop-env.sh
export HDFS_NAMENODE_USER="root" export HDFS_DATANODE_USER="root" export HDFS_SECONDARYNAMENODE_USER="root" export YARN_RESOURCEMANAGER_USER="root" export YARN_NODEMANAGER_USER="root" export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
hdfs-site.xml
<configuration> <property> <name>dfs.replication</name> <value>1</value> </property> <property> <name>dfs.namenode.name.dir</name> <value>file:///home/hadoop/hadoopdata/hdfs/namenode</value> </property> <property> <name>dfs.datanode.data.dir</name> <value>file:///home/hadoop/hadoopdata/hdfs/datanode</value> </property> </configuration>
core-site.xml
9000 is IPC(Inter-Process Communication) for Hadoop NameNode
<configuration> <property> <name>fs.defaultFS</name> <value>hdfs://0.0.0.0:9000</value> </property> </configuration>
yarn-site.xml
<configuration> <property> <name>yarn.resourcemanager.hostname</name> <value>172.31.15.227</value> </property> <property> <name>yarn.nodemanager.resource.memory-mb</name> <value>8192</value> </property> <property> <name>yarn.nodemanager.resource.cpu-vcores</name> <value>2</value> </property> <property> <name>yarn.nodemanager.aux-services</name> <value>mapreduce_shuffle</value> </property> <property> <name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name> <value>org.apache.hadoop.mapred.ShuffleHandler</value> </property> </configuration>
workers
172.31.15.227 172.31.13.72
mapred-site.xml
<configuration> <property> <name>yarn.app.mapreduce.am.env</name> <value>HADOOP_MAPRED_HOME=$HADOOP_HOME/bin/hadoop</value> </property> <property> <name>mapreduce.map.env</name> <value>HADOOP_MAPRED_HOME=$HADOOP_HOME/bin/hadoop</value> </property> <property> <name>mapreduce.reduce.env</name> <value>HADOOP_MAPRED_HOME=$HADOOP_HOME/bin/hadoop</value> </property> </configuration>
 
 
 
 
Hadoop Error starting ResourceManager and NodeManager
I'm trying to setup Hadoop3-alpha3 with a Single Node Cluster (Psuedo-distributed) and using the apache guide to do so. I've tried running the example MapReduce job but every time the connection is
Hadoop Error starting ResourceManager and NodeManager
Apache Hadoop 3.3.6 Installation on Ubuntu 22.04
In the ever-expanding world of big data, managing and processing vast amounts of information efficiently has become paramount for…
Apache Hadoop 3.3.6 Installation on Ubuntu 22.04
 
 

Recommendations