Hadoop 2.5.2缓慢的响应能力

时间:2022-01-27 14:56:25

I've setup Hadoop 2.5.2 is pseudo distributed mode according the the documentation on a MacBook pro w/16GB ram currently running OS X 10.9.5.

根据目前运行OS X 10.9.5的MacBook pro w/16GB ram文件,我已经安装了Hadoop 2.5.2是伪分布式模式。

Everything seems to be working, but the issue is that it's extremely slow.

一切似乎都在起作用,但问题是它太慢了。

  • start-dfs.sh takes a 3-5 minutes to start namenodes
  • start-dfs。sh需要3-5分钟来启动namenodes。
  • hdfs dfs -ls / takes about a minute or two to run.
  • hdfs dfs -ls /运行大约需要一到两分钟。

I've ingested no data, and haven't done anything outside the documentation. I've tried export HADOOP_ROOT_LOGGER=DEBUG, but I'm not getting any extra output, or am seeing anything unusual in the logs. Perhaps I'm not looking at the right place.

我没有摄入任何数据,也没有在文档之外做任何事情。我尝试过导出hadoop - root_logger =DEBUG,但是我没有得到任何额外的输出,或者看到日志中有任何异常。也许我看错地方了。

The only error I ever see, which I've always seen with older versions of hadoop is:

我所见过的唯一错误,我在旧版本的hadoop上经常看到的错误是:

"Unable to load realm info from SCDynamicStore"

Any ideas on why everything might be so slow? Also, according to my Activity Monitor, I'm barely using any resources.

你知道为什么每件事都这么慢吗?而且,根据我的活动监视器,我几乎没有使用任何资源。

I just noticed this in my namenode log when I run the dfs -ls

我只是在运行dfs -ls时在namenode日志中注意到了这一点

2014-11-25 11:47:24,064 INFO org.apache.hadoop.hdfs.server.blockmanagement.CacheReplicationMonitor: Scanned 0 directive(s) and 0 block(s) in 1 millisecond(s).
2014-11-25 11:47:54,063 INFO org.apache.hadoop.hdfs.server.blockmanagement.CacheReplicationMonitor: Rescanning after 30000 milliseconds
2014-11-25 11:47:54,064 INFO org.apache.hadoop.hdfs.server.blockmanagement.CacheReplicationMonitor: Scanned 0 directive(s) and 0 block(s) in 1 millisecond(s).
2014-11-25 11:48:24,064 INFO org.apache.hadoop.hdfs.server.blockmanagement.CacheReplicationMonitor: Rescanning after 30001 milliseconds
2014-11-25 11:48:24,064 INFO org.apache.hadoop.hdfs.server.blockmanagement.CacheReplicationMonitor: Scanned 0 directive(s) and 0 block(s) in 0 millisecond(s).
2014-11-25 11:48:54,065 INFO org.apache.hadoop.hdfs.server.blockmanagement.CacheReplicationMonitor: Rescanning after 30001 milliseconds

Configurations

配置

core-site:

核心位点:

<configuration>
  <property>
        <name>fs.defaultFS</name>
        <value>hdfs://localhost:9000</value>
    </property>
</configuration>

hdfs-site:

hdfs-site:

<configuration>
   <property>
      <name>dfs.replication</name>
      <value>1</value>
   </property>
</configuration>

NameNode

NameNode

Hadoop 2.5.2缓慢的响应能力

Hadoop 2.5.2缓慢的响应能力

1 个解决方案

#1


0  

With nothing else to try, I decided to try removing the "Unable to load realm info from SCDynamicStore" warning. The solution to that was to set HADOOP_OPTS.

没有别的办法,我决定尝试删除“无法从SCDynamicStore加载领域信息”警告。解决方法是设置hadoop op_opts。

Although doing the following did NOT remove the warning, it for some reason FIXED my speed issue.

虽然这样做并没有删除警告,但它出于某些原因修正了我的速度问题。

export HADOOP_OPTS="-Djava.security.krb5.realm=OX.AC.UK
-Djava.security.krb5.kdc=kdc0.ox.ac.uk:kdc1.ox.ac.uk"

If anyone can explain why, I'll definitely accept that answer.

如果有人能解释为什么,我一定会接受这个答案。

#1


0  

With nothing else to try, I decided to try removing the "Unable to load realm info from SCDynamicStore" warning. The solution to that was to set HADOOP_OPTS.

没有别的办法,我决定尝试删除“无法从SCDynamicStore加载领域信息”警告。解决方法是设置hadoop op_opts。

Although doing the following did NOT remove the warning, it for some reason FIXED my speed issue.

虽然这样做并没有删除警告,但它出于某些原因修正了我的速度问题。

export HADOOP_OPTS="-Djava.security.krb5.realm=OX.AC.UK
-Djava.security.krb5.kdc=kdc0.ox.ac.uk:kdc1.ox.ac.uk"

If anyone can explain why, I'll definitely accept that answer.

如果有人能解释为什么,我一定会接受这个答案。