BE节点不能正常启动

BE.OUT日志报错如下,请问有人知道是什么原因吗 :pray: :pray: :pray:

query_id:00000000-0000-0000-0000-000000000000, fragment_instance:00000000-0000-0000-0000-000000000000
*** Aborted at 1686294170 (unix time) try “date -d @1686294170” if you are using GNU date ***
PC: @ 0x24d2f7b starrocks::Rowset::do_load()
*** SIGSEGV (@0x8) received by PID 244327 (TID 0x7facab9fe700) from PID 8; stack trace: ***
@ 0x481e332 google::(anonymous namespace)::FailureSignalHandler()
@ 0x7facdbc62630 (unknown)
@ 0x24d2f7b starrocks::Rowset::do_load()
@ 0x24d35cf starrocks::Rowset::load()
@ 0x24d3966 starrocks::Rowset::get_segment_iterators2()
@ 0x20334ec starrocks::RowsetUpdateState::_do_load()
@ 0x2034f78 _ZZSt9call_onceIZN9starrocks17RowsetUpdateState4loadEPNS0_6TabletEPNS0_6RowsetEEUlvE_JEEvRSt9once_flagOT_DpOT0_ENUlvE0_4_FUNEv
@ 0x7facdbc5920b __pthread_once_slow
@ 0x202fd63 starrocks::RowsetUpdateState::load()
@ 0x1e6da98 starrocks::TabletUpdates::_apply_rowset_commit()
@ 0x1e73bb3 starrocks::TabletUpdates::do_apply()
@ 0x2680af5 starrocks::ThreadPool::dispatch_thread()
@ 0x267bf2a starrocks::thread::supervise_thread()
@ 0x7facdbc5aea5 start_thread
@ 0x7facdb27596d __clone
@ 0x0 (unknown)

哪个版本?另外看下be.INFO没启动的最新日志

版本是2.1.2。以下是be.INFO的最新启动日志。三个节点,两个正常,一个不能启动。

I0612 09:23:26.856721 254225 daemon.cpp:280] version 2.4.0 RELEASE (build c0fa2bb)
Built on 2022-10-20 15:08:05 by StarRocks@docker
I0612 09:23:26.860005 254225 mem_info.cpp:74] Physical Memory: 1510.27 GB
I0612 09:23:26.860016 254225 daemon.cpp:286] Cpu Info:
Model: Intel® Xeon® Gold 5218 CPU @ 2.30GHz
Cores: 64
Max Possible Cores: 64
L1 Cache: 32.00 KB (Line: 64.00 B)
L2 Cache: 1.00 MB (Line: 64.00 B)
L3 Cache: 22.00 MB (Line: 64.00 B)
Hardware Supports:
ssse3
sse4_1
sse4_2
popcnt
avx
avx2
Numa Nodes: 2
Numa Nodes of Cores: 0->0 | 1->1 | 2->0 | 3->1 | 4->0 | 5->1 | 6->0 | 7->1 | 8->0 | 9->1 | 10->0 | 11->1 | 12->0 | 13->1 | 14->0 | 15->1 | 16->0 | 17->1 | 18->0 | 19->1 | 20->0 | 21->1 | 22->0 | 23->1 | 24->0 | 25->1 | 26->0 | 27->1 | 28->0 | 29->1 | 30->0 | 31->1 | 32->0 | 33->1 | 34->0 | 35->1 | 36->0 | 37->1 | 38->0 | 39->1 | 40->0 | 41->1 | 42->0 | 43->1 | 44->0 | 45->1 | 46->0 | 47->1 | 48->0 | 49->1 | 50->0 | 51->1 | 52->0 | 53->1 | 54->0 | 55->1 | 56->0 | 57->1 | 58->0 | 59->1 | 60->0 | 61->1 | 62->0 | 63->1 |
I0612 09:23:26.860038 254225 daemon.cpp:287] Disk Info:
Num disks 5: sda, sr, sdb, sdc, dm-
I0612 09:23:26.860042 254225 daemon.cpp:288] Mem Info: 1510.27 GB
W0612 09:23:26.860589 254225 user_function_cache.cpp:178] load a library failed, dir=/opt/starRocks/StarRocks-2.1.2/be/lib/udf/84, file=-86612.eae0866e4c06f372b1c695aeb787c718.jar.tmp
I0612 09:23:26.957834 254225 daemon.cpp:263] Minidump is disabled
I0612 09:23:26.957866 254225 backend_options.cpp:100] priority cidrs in conf: 172.16.4.0/24
I0612 09:23:26.958282 254225 backend_options.cpp:77] local host172.16.4.2
I0612 09:23:26.961477 254271 daemon.cpp:191] Current memory statistics: process(28648880), query_pool(0), load(0), metadata(0), compaction(0), schema_change(0), column_pool(0), page_cache(0), update(0), chunk_allocator(0), clone(0), consistency(0)
I0612 09:23:26.964529 254274 data_dir.cpp:121] path: /data/starrocks/storage, hash: 5548113219158243971
I0612 09:23:27.039988 254341 data_dir.cpp:230] start to load tablets from /data/starrocks/storage
I0612 09:23:27.040016 254341 data_dir.cpp:236] begin loading rowset from meta
I0612 09:23:27.251251 254341 data_dir.cpp:254] load rowset from meta finished, data dir: /data/starrocks/storage
I0612 09:23:27.251267 254341 data_dir.cpp:259] begin loading tablet from meta

在不能启动的BE上执行ulimit -c unlimited,然后再启动一次,生成个core文件看下

core_uses_pid设为了1,core_pattern里设置了路径。然后启动了一次发现没有生成core文件,怎么才能把启动的这个生成core文件。以前没有操作过core文件的生成。

你执行ulimit -c看下结果,如果不是unlimited,执行下ulimit -c unlimited

执行过了,没生成core文件。昨天晚上运维的给集群升级了,好了。具体啥问题好像没诊断出来