在Linux系统下配置Hadoop分布式文件系统(HDFS)是一个相对复杂的过程,涉及多个步骤。以下是详细的配置步骤:
apt-get
或yum
)或从Oracle官网下载安装。~/.bashrc
文件:添加Hadoop的bin
目录到系统的PATH
环境变量中,方便全局调用Hadoop命令。export HADOOP_HOME=/path/to/your/hadoop
export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin
source ~/.bashrc
$HADOOP_HOME/etc/hadoop/core-site.xml
:<configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://namenode:9000</value>
</property>
<property>
<name>hadoop.tmp.dir</name>
<value>/path/to/hadoop/tmp</value>
</property>
</configuration>
$HADOOP_HOME/etc/hadoop/hdfs-site.xml
:<configuration>
<property>
<name>dfs.namenode.name.dir</name>
<value>/path/to/namenode/dir</value>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>/path/to/datanode/dir</value>
</property>
<property>
<name>dfs.replication</name>
<value>3</value>
</property>
</configuration>
hdfs namenode -format
start-dfs.sh
jps
命令查看Java进程,或访问NameNode的Web界面(默认端口50070)查看集群状态。ssh-keygen -t rsa
ssh-copy-id slave1
ssh-copy-id slave2
ssh-copy-id slave3
ssh-copy-id slave4
ssh-copy-id slave5
ssh slave1
dfs.namenode.fgl.enable true
辰迅云「云服务器」,即开即用、新一代英特尔至强铂金CPU、三副本存储NVMe SSD云盘,价格低至29元/月。点击查看>>
推荐阅读: LINUX主机怎么远程登录