【版本】2.5.6企业版
【现象】两个FOLLOWER FE突然挂掉,且无法拉起
【fe.log】FOLLOWER日志:
2023-10-09 13:43:24,291 WARN (replayer|86) [GlobalStateMgr.replayJournalInner():1929] catch exception when replaying 157304691,
com.starrocks.journal.JournalInconsistentException: failed to load journal type 45
at com.starrocks.persist.EditLog.loadJournal(EditLog.java:948) ~[starrocks-fe.jar:?]
at com.starrocks.server.GlobalStateMgr.replayJournalInner(GlobalStateMgr.java:1918) [starrocks-fe.jar:?]
at com.starrocks.server.GlobalStateMgr$5.runOneCycle(GlobalStateMgr.java:1775) [starrocks-fe.jar:?]
at com.starrocks.common.util.Daemon.run(Daemon.java:115) [starrocks-fe.jar:?]
at com.starrocks.server.GlobalStateMgr$5.run(GlobalStateMgr.java:1840) [starrocks-fe.jar:?]
Caused by: java.lang.NullPointerException: table id: 292906 partition id: 8020272 index id: 292907 index id: 292907 tablet id: 8020279 backend id: 10009 replica id: 8020280 version: 83 vers
ion hash: 0 schema hash: 1233819161 data size: 0 row count: 0 last failed version: -1 last failed version hash: 0 last success version: 83 last success version hash: 0 min readable version:
-1
at com.google.common.base.Preconditions.checkNotNull(Preconditions.java:899) ~[spark-dpp-1.0.0.jar:?]
at com.starrocks.server.LocalMetastore.unprotectUpdateReplica(LocalMetastore.java:2934) ~[starrocks-fe.jar:?]
at com.starrocks.server.LocalMetastore.replayUpdateReplica(LocalMetastore.java:2953) ~[starrocks-fe.jar:?]
at com.starrocks.server.GlobalStateMgr.replayUpdateReplica(GlobalStateMgr.java:2600) ~[starrocks-fe.jar:?]
at com.starrocks.persist.EditLog.loadJournal(EditLog.java:375) ~[starrocks-fe.jar:?]
… 4 more
2023-10-09 13:43:24,298 WARN (replayer|86) [GlobalStateMgr$5.runOneCycle():1778] got interrupt exception or inconsistent exception when replay journal 157304691, will exit,
com.starrocks.journal.JournalInconsistentException: failed to load journal type 45
at com.starrocks.persist.EditLog.loadJournal(EditLog.java:948) ~[starrocks-fe.jar:?]
at com.starrocks.server.GlobalStateMgr.replayJournalInner(GlobalStateMgr.java:1918) ~[starrocks-fe.jar:?]
at com.starrocks.server.GlobalStateMgr$5.runOneCycle(GlobalStateMgr.java:1775) [starrocks-fe.jar:?]
at com.starrocks.common.util.Daemon.run(Daemon.java:115) [starrocks-fe.jar:?]
at com.starrocks.server.GlobalStateMgr$5.run(GlobalStateMgr.java:1840) [starrocks-fe.jar:?]
Caused by: java.lang.NullPointerException: table id: 292906 partition id: 8020272 index id: 292907 index id: 292907 tablet id: 8020279 backend id: 10009 replica id: 8020280 version: 83 vers
ion hash: 0 schema hash: 1233819161 data size: 0 row count: 0 last failed version: -1 last failed version hash: 0 last success version: 83 last success version hash: 0 min readable version:
-1
at com.google.common.base.Preconditions.checkNotNull(Preconditions.java:899) ~[spark-dpp-1.0.0.jar:?]
at com.starrocks.server.LocalMetastore.unprotectUpdateReplica(LocalMetastore.java:2934) ~[starrocks-fe.jar:?]
at com.starrocks.server.LocalMetastore.replayUpdateReplica(LocalMetastore.java:2953) ~[starrocks-fe.jar:?]
at com.starrocks.server.GlobalStateMgr.replayUpdateReplica(GlobalStateMgr.java:2600) ~[starrocks-fe.jar:?]
at com.starrocks.persist.EditLog.loadJournal(EditLog.java:375) ~[starrocks-fe.jar:?]
… 4 more
【详细日志】
fe.log.txt (789.7 KB)