【详述】当前sr集群1fe+3be,现在要部署fe的高可用,在新的fe节点hadoop2上执行/opt/service/starrocks/starrocks-3.0.3_fe/fe/bin/start_fe.sh --helper hadoop1:9010 --daemon启动hadoop2上的fe节点,启动后查看日志,报错信息如下:2023-08-17 15:38:44,875 INFO (main|1) [StarRocksFE.start():124] StarRocks FE starting, version: 3.0.3-fe5e3a1
2023-08-17 15:38:44,884 INFO (main|1) [FrontendOptions.analyzePriorityCidrs():299] configured prior_cidrs value: 192.168.188.0/24
2023-08-17 15:38:44,890 INFO (main|1) [FrontendOptions.initAddrUseIp():249] Use IP init local addr, IP: /hadoop2
2023-08-17 15:38:45,239 INFO (main|1) [Auth.grantRoleInternal():835] grant operator to ‘root’@’%’, isReplay = true
2023-08-17 15:38:45,271 INFO (main|1) [AuthorizationMgr.initBuiltinRoleUnlocked():277] create built-in role root[-1]
2023-08-17 15:38:45,277 INFO (main|1) [AuthorizationMgr.initBuiltinRoleUnlocked():277] create built-in role db_admin[-2]
2023-08-17 15:38:45,278 INFO (main|1) [AuthorizationMgr.initBuiltinRoleUnlocked():277] create built-in role cluster_admin[-3]
2023-08-17 15:38:45,278 INFO (main|1) [AuthorizationMgr.initBuiltinRoleUnlocked():277] create built-in role user_admin[-4]
2023-08-17 15:38:45,278 INFO (main|1) [AuthorizationMgr.initBuiltinRoleUnlocked():277] create built-in role public[-5]
2023-08-17 15:38:45,278 INFO (main|1) [GlobalStateMgr.initAuth():1043] using new privilege framework…
2023-08-17 15:38:45,490 INFO (main|1) [NodeMgr.getHelperNodes():656] get helper nodes: [hadoop1:9010]
2023-08-17 15:38:45,522 WARN (main|1) [NodeMgr.getFeNodeTypeAndNameFromHelpers():473] failed to get fe node type from helper node: hadoop1:9010. response code: 400
2023-08-17 15:38:45,523 WARN (main|1) [NodeMgr.getClusterIdAndRoleOnStartup():301] current node is not added to the group. please add it first. sleep 5 seconds and retry, current helper nodes: [hadoop1:9010]
2023-08-17 15:38:50,526 WARN (main|1) [NodeMgr.getFeNodeTypeAndNameFromHelpers():473] failed to get fe node type from helper node: hadoop1:9010. response code: 400
2023-08-17 15:38:50,527 WARN (main|1) [NodeMgr.getClusterIdAndRoleOnStartup():301] current node is not added to the group. please add it first. sleep 5 seconds and retry, current helper nodes: [hadoop1:9010]
2023-08-17 15:38:55,530 WARN (main|1) [NodeMgr.getFeNodeTypeAndNameFromHelpers():473] failed to get fe node type from helper node: hadoop1:9010. response code: 400
2023-08-17 15:38:55,531 WARN (main|1) [NodeMgr.getClusterIdAndRoleOnStartup():301] current node is not added to the group. please add it first. sleep 5 seconds and retry, current helper nodes: [hadoop1:9010]
2023-08-17 15:39:00,535 WARN (main|1) [NodeMgr.getFeNodeTypeAndNameFromHelpers():473] failed to get fe node type from helper node: hadoop1:9010. response code: 400
2023-08-17 15:39:00,535 WARN (main|1) [NodeMgr.getClusterIdAndRoleOnStartup():301] current node is not added to the group. please add it first. sleep 5 seconds and retry, current helper nodes: [hadoop1:9010]
2023-08-17 15:39:05,540 WARN (main|1) [NodeMgr.getFeNodeTypeAndNameFromHelpers():473] failed to get fe node type from helper node: hadoop1:9010. response code: 400
2023-08-17 15:39:05,540 WARN (main|1) [NodeMgr.getClusterIdAndRoleOnStartup():301] current node is not added to the group. please add it first. sleep 5 seconds and retry, current helper nodes: [hadoop1:9010]
2023-08-17 15:39:10,544 WARN (main|1) [NodeMgr.getFeNodeTypeAndNameFromHelpers():473] failed to get fe node type from helper node: hadoop1:9010. response code: 400
2023-08-17 15:39:10,545 WARN (main|1) [NodeMgr.getClusterIdAndRoleOnStartup():301] current node is not added to the group. please add it first. sleep 5 seconds and retry, current helper nodes: [hadoop1:9010]
已经看过https://forum.mirrorship.cn/t/topic/2838/8和https://forum.mirrorship.cn/t/topic/869/3帖子,排除新fe节点与leader fe节点版本不一致的问题,排除新fe节点和leader fe节点的端口不一致。此外,很好奇报错你说的current node is not added to the group是什么group
【背景】1.用leader fe节点的fe安装路径分发到hadoop2的服务器上,删掉log目录下的文件,手动创建元数据路径;也用解压出来的fe目录,然后修改fe.conf配置文件元数据路径
2.半个月前搭建1fe+3be,集群正常运行半个月后计划扩展为3fe+3be
【业务影响】测试集群补充fe高可用部署,暂无影响
【StarRocks版本】3.0.3
【集群规模】1fe+3be,计划扩展为3fe+3be,3台服务器,每台都部署fe和be
【机器信息】CPU虚拟核/内存/网卡,16C/32G/万兆
【联系方式】flinkxabc@yeah.net,或者社区群【StarRocks社区群12】满天星辰
【附件】
我自己稀里糊涂解决了
2赞
这个看上述的信息怀疑是没有加入集群吧 解决了就好