各位好,请教一下,现在部署了三节点的hadoop ha集群(hadoop version 2.7.1, OS:Centos 7.3),一切测试正常,然后开始集成kerberos,为zookeeper做完配置后,重启hdfs,zkfc服务无法正常启动,报错如下:
2018-02-01 11:26:40,158 INFO org.apache.hadoop.ha.ZKFailoverController: Local service NameNode at node1/192.168.10.187:9000 entered state: SERVICE_HEALTHY
2018-02-01 11:26:40,198 FATAL org.apache.hadoop.ha.ActiveStandbyElector: Received create error from Zookeeper. code:NOAUTH for path /hadoop-ha/myhdfs/ActiveStandbyElectorLock
2018-02-01 11:26:40,220 INFO org.apache.zookeeper.ZooKeeper: Session: 0x2614b837f710000 closed
2018-02-01 11:26:40,220 FATAL org.apache.hadoop.ha.ZKFailoverController: Fatal error occurred:Received create error from Zookeeper. code:NOAUTH for path /hadoop-ha/myhdfs/ActiveStandbyElectorLock
2018-02-01 11:26:40,220 INFO org.apache.zookeeper.ClientCnxn: EventThread shut down
2018-02-01 11:26:40,220 INFO org.apache.hadoop.ipc.Server: Stopping server on 8019
2018-02-01 11:26:40,222 INFO org.apache.hadoop.ha.ActiveStandbyElector: Yielding from election
2018-02-01 11:26:40,222 INFO org.apache.hadoop.ipc.Server: Stopping IPC Server listener on 8019
2018-02-01 11:26:40,222 INFO org.apache.hadoop.ipc.Server: Stopping IPC Server Responder
2018-02-01 11:26:40,222 INFO org.apache.hadoop.ha.HealthMonitor: Stopping HealthMonitor thread
2018-02-01 11:26:40,224 FATAL org.apache.hadoop.hdfs.tools.DFSZKFailoverController: Got a fatal error, exiting now
java.lang.RuntimeException: ZK Failover Controller failed: Received create error from Zookeeper. code:NOAUTH for path /hadoop-ha/myhdfs/ActiveStandbyElectorLock
at org.apache.hadoop.ha.ZKFailoverController.mainLoop(ZKFailoverController.java:369)
at org.apache.hadoop.ha.ZKFailoverController.doRun(ZKFailoverController.java:238)
at org.apache.hadoop.ha.ZKFailoverController.access$000(ZKFailoverController.java:61)
at org.apache.hadoop.ha.ZKFailoverController$1.run(ZKFailoverController.java:172)
at org.apache.hadoop.ha.ZKFailoverController$1.run(ZKFailoverController.java:168)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:360)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1637)
at org.apache.hadoop.security.SecurityUtil.doAsLoginUserOrFatal(SecurityUtil.java:413)
at org.apache.hadoop.ha.ZKFailoverController.run(ZKFailoverController.java:168)
at org.apache.hadoop.hdfs.tools.DFSZKFailoverController.main(DFSZKFailoverController.java:181)
登录进入zookeeper看了一眼,报错中所说的节点并不存在:
[zk: node1:2181(CONNECTED) 10] ls /hadoop-ha/myhdfs
[]
[zk: node1:2181(CONNECTED) 11] getAcl /hadoop-ha/myhdfs
'sasl,'nn
: cdrwa
[zk: node1:2181(CONNECTED) 12]
目前两个namenode服务都是standby状态,无法提供服务,请问是什么原因呢?
|