分享

请教大家, impala catalog server老是连不上statestore server是怎么回事~~~~

bluebluebeebee 发表于 2016-8-5 09:16:08 [显示全部楼层] 回帖奖励 阅读模式 关闭右栏 4 31200
大家好,非常新手,搞了六台机器跑cloudera,边搞边做实验。现在impala遇到些问题, 我已经卡在这里一周了。请大家帮我看看啊~~~~先谢谢了!!

问题就是impala的所有服务起来后, 很快catalog server就有bad health的报错, 进去看,是连不上state store server。restart后很快就报同样的错。log也里看不出什么名堂来。。。然后想在impala shell里运行点语句,就会显示 ERROR: AnalysisException: This Impala daemon is not ready to accept user requests. Status: Waiting for catalog update from the StateStore.。


最开始我的impala catalog, statestore以及一个impala daemon是装在和resource manager同一个机器上,做了点研究发现这样会导致些问题,就把这几个impala的东东搬到其它机子上了,然后还是同样的报错。


更有趣的是,昨天凌晨两点和今天的凌晨七点日志里显示catalog server有一段时间连上过statestore。然后早上我去办公室又蜜汁down掉了。。要疯了啊~~~~


版本信息:
OS: centos 6.7,
cloudera: 5.7.1 express
impala:2.5.0
CM上已经装了hdfs, yarn, hive, zookeeper, impala。之前还跑过些sqoop。


报错几乎和这个链接里的差不多,只是CM版本不同。  好像也有其它人遇到同样的问题,但是木有解答。。。也不知道是不是known issue。
https://community.cloudera.com/t5/Cloudera-Manager-Installation/This-Catalog-Server-is-not-connected-to-its-StateStore/td-p/25668 (怕打不开,我贴下面了)
This Catalog Server is not connected to its StateStore.


Options









03-17-2015 03:54 PM

Hello,

One of our clients is running Cloudera Manager Express 5.3.2 and CDH 5.3.2.
There is a strange problem with the Impala Catalog Server.
It has Bad Health in Cloudera Manager which shows the following messages:

StateStore Connectivity Bad
This Catalog Server is not connected to its StateStore.

I can't find anything useful in the log files of the Catalog Server or the StateStore.
Also everything seems to be working properly and the metadata changes are relayed successfully (I performed several tests using 2 different Data Nodes).

In the StateStore Web UI I can see the Catalog Server in the list of the subscribers.

Id: catalog-server@hostname:26000
Address: hostname:23020
Subscribed topics: 1
Transient entries: 0

If I stop the Catalog Server this entry disappears so it is not a stale one.
I even tried to delete the whole Impala Service and install it again using different node for the StateStore and the Catalog Server. However nothing seems to be helping.

Kerberos and SSL are not used in the cluster.

I suspect that there is only a problem with the Cloudera Manager health check but I have no idea how to troubleshoot further
Please let me know if you have any suggestions.

Best Regards,
Cyril



已有(4)人评论

跳转到指定楼层
nextuser 发表于 2016-8-5 12:42:52
这个问题原因挺多的:可能原因如下:


1.在/etc/hosts中没有配置 127.0.0.1   localhost 导致impala启动statestore和catalog出现错误

2.版本问题
3.如果还解决不了的话,仔细定位下日志吧


回复

使用道具 举报

bluebluebeebee 发表于 2016-8-5 23:28:05
nextuser 发表于 2016-8-5 12:42
这个问题原因挺多的:可能原因如下:

谢谢你, /etc/hosts我的配置如下,第一行是不是有啥问题哇(努力学习linux中)?
127.0.0.1 localhost.localdomain localhost192.168.1.1 cluster-01.example.com cluster-01192.168.1.2 cluster-02.example.com cluster-02192.168.1.3 cluster-03.example.com cluster-03




然后您也提到可能是版本不对,我查了好像之前cloudera4.x和impala1.x会有些问题,但那些解决办法也都是升级到cloudera5.x呀。
log我会再仔细定位一下,偶也是几乎这一周都在和log死磕啊。。。





回复

使用道具 举报

nextuser 发表于 2016-8-9 13:33:12
bluebluebeebee 发表于 2016-8-5 23:28
谢谢你, /etc/hosts我的配置如下,第一行是不是有啥问题哇(努力学习linux中)?
127.0.0.1 localhost. ...

hosts最好上图,如果Linux也是这种不换行的话,出问题是肯定的
回复

使用道具 举报

bluebluebeebee 发表于 2016-8-11 10:13:56
nextuser 发表于 2016-8-9 13:33
hosts最好上图,如果Linux也是这种不换行的话,出问题是肯定的

谢谢,我已经解决啦,重新配置了一次hive,然后就OK了。 然后我并不明白这是为什么。。。偶也问了cloudera community那个贴里跟我有同样问题的朋友,然后大家都很懵,没有解决。。。太晕了。
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

关闭

推荐上一条 /2 下一条