騰訊云中偽分布式配置:
首先給主機(jī)定義一個名稱:注意這里需要配置本機(jī)的內(nèi)網(wǎng)機(jī)器蚜迅,其它機(jī)器的外網(wǎng)地址
10.104.222.163 hadoopmaster
127.0.0.1 VM_222_163_centos VM_222_163_centos
127.0.0.1 localhost.localdomain localhost
127.0.0.1 localhost4.localdomain4 localhost4
# The following lines are desirable for IPv6 capable hosts
::1 VM_222_163_centos VM_222_163_centos
::1 localhost.localdomain localhost
::1 localhost6.localdomain6 localhost6
hadoop安裝目錄假定為${HADOOOP_HOME}沃暗,當(dāng)前hadoop版本為2.9.1:
1 在${HADOOOP_HOME}/etc/hadoop目錄下驯妄,修改下面幾個文件:
core-site.xml
<configuration>
<!-- 指定HDFS namenode 的通信地址 -->
<property>
<name>fs.defaultFS</name>
<value>hdfs://hadoopmaster:9000</value>
</property>
<!-- 指定hadoop運(yùn)行時產(chǎn)生文件的存儲路徑 -->
<property>
<name>hadoop.tmp.dir</name>
<value>/usr/local/hadoop/hadoop-2.9.1/hadoop</value>
</property>
</configuration>
hdfs-site.xml
<configuration>
<property>
<name>dfs.name.dir</name>
<value>/usr/local/hadoop/hdfs/name</value>
<description>namenode上存儲hdfs名字空間元數(shù)據(jù) </description>
</property>
<property>
<name>dfs.data.dir</name>
<value>/usr/local/hadoop/hdfs/data</value>
<description>datanode上數(shù)據(jù)塊的物理存儲位置</description>
</property>
<!-- 設(shè)置hdfs副本數(shù)量 -->
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>
通過拷貝生成mapred-site.xml
cp mapred-site.xml.template mapred-site.xml
內(nèi)容如下:
<configuration>
<!-- 通知框架MR使用YARN -->
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>
yarn-site.xml
<configuration>
<!-- reducer取數(shù)據(jù)的方式是mapreduce_shuffle -->
<property>
<name>yarn.acl.enable</name>
<value>0</value>
</property>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value>
</property>
<property>
<name>yarn.resourcemanager.hostname</name>
<value>hadoopmaster</value>
</property>
</configuration>
啟動hdfs
${HADOOOP_HOME}/sbin/start-dfs.sh
啟動yarn
${HADOOOP_HOME}/sbin/start-yarn.sh
檢查hadoop相關(guān)進(jìn)程啟動情況:
如果想要關(guān)閉hadoop進(jìn)程,可以執(zhí)行:
${HADOOOP_HOME}/sbin/stop-dfs.sh
${HADOOOP_HOME}/sbin/stop-yarn.sh
web中查看hadoop狀態(tài):http://outerIP:50070
web中查看集群中應(yīng)用程序狀態(tài):http://outerIP:8088