1)如果系統(tǒng)已經(jīng)安裝java遏匆,通過以下方式設(shè)置java相關(guān)環(huán)境變量:
$ ls -lrt /usr/bin/java
lrwxrwxrwx. 1 root root 22 Mar 20 14:49 /usr/bin/java -> /etc/alternatives/java
$ ls -lrt /etc/alternatives/java
lrwxrwxrwx. 1 root root 73 Mar 20 14:49 /etc/alternatives/java -> /usr/lib/jvm/java-1.8.0-openjdk-1.8.0.161-0.b14.el7_4.x86_64/jre/bin/java
設(shè)置java相關(guān)環(huán)境變量:
export JAVA_HOME=/usr/lib/jvm/java-1.8.0-openjdk-1.8.0.161-0.b14.el7_4.x86_64
export JRE_HOME=$JAVA_HOME/jre
export PATH=$JAVA_HOME/bin:$PATH
如果系統(tǒng)沒有安裝java运嗜,下載相應(yīng)java版本進(jìn)行安裝,然后設(shè)置java相關(guān)環(huán)境變量即可除盏。
2)設(shè)置ssh免登錄
$ ssh-keygen -t rsa -P '' -f ~/.ssh/id_rsa
$ cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
$ chmod 0600 ~/.ssh/authorized_keys
3)以偽分布式模式安裝hadoop-2.7.6
從官方網(wǎng)站下載hadoop 2.7.6:https://mirrors.tuna.tsinghua.edu.cn/apache/hadoop/common/hadoop-2.7.6/hadoop-2.7.6.tar.gz
將hadoop-2.7.6.tar.gz文件拷貝到/usr/local目錄并解壓,然后設(shè)置相關(guān)的環(huán)境變量:
export HADOOP_HOME=/usr/local/hadoop-2.7.6
export PATH=$HADOOP_HOME/bin:$PATH
進(jìn)入/usr/local/hadoop-2.7.6/etc目錄味混,依次修改以下配置文件:
- core-site.xml
<configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://localhost:9000</value>
</property>
</configuration>
- hdfs-site.xml
<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>
格式化hdfs文件系統(tǒng):
$ hdfs namenode -format
啟動(dòng)hdfs:
$ /usr/local/hadoop-2.7.6/sbin/start-dfs.sh
在瀏覽器輸入:http://localhost:50070/
,可以看到NameNode的相關(guān)信息诫惭。
4)運(yùn)行示例程序
在hdfs上創(chuàng)建相應(yīng)目錄翁锡,并上傳相應(yīng)的文件到該目錄:
$ cat log
hello hadoop
hello hdfs
hello mapreduce
$ hdfs dfs -mkdir -p /user/root/input
$ hdfs dfs -put log input
$ hdfs dfs -ls /user/root/input
Found 1 items
-rw-r--r-- 1 root supergroup 40 2018-07-23 06:33 /user/root/input/log
運(yùn)行示例程序:
$ hadoop jar /usr/local/hadoop-2.7.6/share/hadoop/mapreduce/hadoop-mapreduce-examples-2.7.6.jar grep input output 'he[a-z.]+'
查看運(yùn)行結(jié)果:
$ hdfs dfs -cat output/*
3 hello