3.1.17版本be trash后单表出现version_error 查询报错Build Exec OlapScanNode fail, scan info is invalid

为了更快的定位您的问题,请提供以下信息,谢谢
【详述】
1、be节点异常trash,cpu、内存监控指标正常;be.out有如下日志
*** Aborted at 1760244562 (unix time) try “date -d @1760244562” if you are using GNU date ***
PC: @ 0x32ac300 starrocks::PUniqueId::~PUniqueId()
*** SIGSEGV (@0x0) received by PID 165029 (TID 0x7f4d253fc700) from PID 0; stack trace: ***
@ 0x688b642 google::(anonymous namespace)::FailureSignalHandler()
@ 0x7f5069f205f0 (unknown)
@ 0x32ac300 starrocks::PUniqueId::~PUniqueId()
@ 0x3368cef starrocks::RowsetMetaPB::~RowsetMetaPB()
@ 0x3368db2 starrocks::RowsetMetaPB::~RowsetMetaPB()
@ 0x2c1ae6a std::_Sp_counted_base<>::_M_release()
@ 0x598a82c starrocks::Rowset::~Rowset()
@ 0x5207a6a starrocks::TabletUpdates::_remove_unused_rowsets()
@ 0x51c19b0 starrocks::TabletManager::start_trash_sweep()
@ 0x517054c starrocks::StorageEngine::_start_trash_sweep()
@ 0x5100130 starrocks::StorageEngine::_garbage_sweeper_thread_callback()
@ 0x8cca900 execute_native_thread_routine
@ 0x7f5069f18e65 start_thread
@ 0x7f506931a88d __clone
@ 0x0 (unknown)

2、be.info日志 trash前 最后的日志如下
task_worker_pool.cpp:581] batch submit 1 finish publish version task txn publish task(s). #dir:0 flush:3ms

3、be节点启动后,单表查询报错
Build Exec OlapScanNode fail, scan info is invalid

4、尝试使用 ADMIN SET REPLICA STATUS将异常tablet 设置为bad,未能解决问题

【背景】be异常后,手动启动了be节点
【业务影响】
【是否存算分离】 否
【StarRocks版本】例如:3.1.17
【集群规模】例如:3fe(2 follower)+6be(fe与be混部)
【机器信息】64C/380G/万兆

  • be节点cpu和内存使用率截图
  • 查询报错:
    com.starrocks.sql.common.StarRocksPlannerException: Build Exec OlapScanNode fail, scan info is invalid
    at com.starrocks.sql.plan.PlanFragmentBuilder$PhysicalPlanTranslator.visitPhysicalOlapScan(PlanFragmentBuilder.java:815)
    at com.starrocks.sql.plan.PlanFragmentBuilder$PhysicalPlanTranslator.visitPhysicalOlapScan(PlanFragmentBuilder.java:412)
    at com.starrocks.sql.optimizer.operator.physical.PhysicalOlapScanOperator.accept(PhysicalOlapScanOperator.java:169)
    at com.starrocks.sql.plan.PlanFragmentBuilder$PhysicalPlanTranslator.visit(PlanFragmentBuilder.java:456)
    at com.starrocks.sql.plan.PlanFragmentBuilder$PhysicalPlanTranslator.visitPhysicalDistribution(PlanFragmentBuilder.java:1998)
    at com.starrocks.sql.plan.PlanFragmentBuilder$PhysicalPlanTranslator.visitPhysicalDistribution(PlanFragmentBuilder.java:412)
    at com.starrocks.sql.optimizer.operator.physical.PhysicalDistributionOperator.accept(PhysicalDistributionOperator.java:63)
    at com.starrocks.sql.plan.PlanFragmentBuilder$PhysicalPlanTranslator.visit(PlanFragmentBuilder.java:456)
    at com.starrocks.sql.plan.PlanFragmentBuilder$PhysicalPlanTranslator.visitPhysicalLimit(PlanFragmentBuilder.java:2801)
    at com.starrocks.sql.plan.PlanFragmentBuilder$PhysicalPlanTranslator.visitPhysicalLimit(PlanFragmentBuilder.java:412)
    at com.starrocks.sql.optimizer.operator.physical.PhysicalLimitOperator.accept(PhysicalLimitOperator.java:54)
    at com.starrocks.sql.plan.PlanFragmentBuilder$PhysicalPlanTranslator.visit(PlanFragmentBuilder.java:456)
    at com.starrocks.sql.plan.PlanFragmentBuilder$PhysicalPlanTranslator.translate(PlanFragmentBuilder.java:423)
    at com.starrocks.sql.plan.PlanFragmentBuilder.createPhysicalPlan(PlanFragmentBuilder.java:219)
    at com.starrocks.sql.StatementPlanner.createQueryPlan(StatementPlanner.java:178)
    at com.starrocks.sql.StatementPlanner.plan(StatementPlanner.java:100)
    at com.starrocks.sql.StatementPlanner.plan(StatementPlanner.java:63)
    at com.starrocks.qe.StmtExecutor.execute(StmtExecutor.java:484)
    at com.starrocks.qe.ConnectProcessor.handleQuery(ConnectProcessor.java:391)
    at com.starrocks.qe.ConnectProcessor.dispatch(ConnectProcessor.java:497)
    at com.starrocks.qe.ConnectProcessor.processOnce(ConnectProcessor.java:798)
    at com.starrocks.mysql.nio.ReadListener.lambda$handleEvent$0(ReadListener.java:69)
    at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
    at java.lang.Thread.run(Thread.java:748)
    Caused by: com.starrocks.common.UserException: Failed to get scan range, no queryable replica found in tablet=465744497 replica=12160:1399/-1/1399/1263:NORMAL:ALIVE,12159:1397/-1/1397/1281:NORMAL:ALIVE,12161:1398/-1/1398/1295:NORMAL:ALIVE,10004:1362/1400/1362/1362:NORMAL:ALIVE, schema_hash=13585366 version=1129
    at com.starrocks.planner.OlapScanNode.addScanRangeLocations(OlapScanNode.java:488)