Starocks 2.5.2 三节点集群,be内存配置40G
刚部署的集群,每个节点只有几百兆的数据量,有一个节点连接宕机,报错信息如下:
be.out 报错:
terminate called after throwing an instance of ‘std::runtime_error’
what(): random_device: rdrand failed
2.5.2 RELEASE (build )
query_id:00000000-0000-0000-0000-000000000000, fragment_instance:00000000-0000-0000-0000-000000000000
tracker:process consumption: 2394372288
tracker:query_pool consumption: 350000
tracker:load consumption: 0
tracker:metadata consumption: 14333501
tracker:tablet_metadata consumption: 944906
tracker:rowset_metadata consumption: 7305336
tracker:segment_metadata consumption: 799832
tracker:column_metadata consumption: 5283427
tracker:tablet_schema consumption: 224330
tracker:segment_zonemap consumption: 416630
tracker:short_key_index consumption: 1977
tracker:column_zonemap_index consumption: 723539
tracker:ordinal_index consumption: 1788464
tracker:bitmap_index consumption: 0
tracker:bloom_filter_index consumption: 0
tracker:compaction consumption: 0
tracker:schema_change consumption: 0
tracker:column_pool consumption: 738275820
tracker:page_cache consumption: 33144320
tracker:update consumption: 109396
tracker:chunk_allocator consumption: 1139663656
tracker:clone consumption: 0
tracker:consistency consumption: 0
*** Aborted at 1698141744 (unix time) try “date -d @1698141744” if you are using GNU date ***
PC: @ 0x7fe80237adff (unknown)
*** SIGABRT (@0x1314f) received by PID 78159 (TID 0x7fe5e33c7640) from PID 78159; stack trace: ***
@ 0x583b0e2 google::(anonymous namespace)::FailureSignalHandler()
@ 0x7fe80232eed0 (unknown)
@ 0x7fe80237adff (unknown)
@ 0x7fe80232ee26 raise
@ 0x7fe80231a457 abort
@ 0x2a8c04e _ZN9__gnu_cxx27__verbose_terminate_handlerEv.cold
@ 0x7cf12f6 __cxxabiv1::__terminate()
@ 0x7d993e9 __cxa_call_terminate
@ 0x7cf0d11 __gxx_personality_v0
@ 0x7d9ff8e _Unwind_RaiseException_Phase2
@ 0x7da0a86 _Unwind_Resume
@ 0x298745a _ZN4brpc6policy17ProcessRpcRequestEPNS_16InputMessageBaseE.cold
@ 0x59c6147 brpc::ProcessInputMessage()
@ 0x59c701b brpc::InputMessenger::OnNewMessages()
@ 0x59b65ae brpc::Socket::ProcessEvent()
@ 0x59892ff bthread::TaskGroup::task_runner()
@ 0x5ad10b1 bthread_make_fcontext
be.warnning 报错:
W1026 17:15:17.050866 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:15:25.097569 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:15:27.074007 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:15:35.448155 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:15:43.574595 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:16:02.080888 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:16:11.612802 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:16:12.782581 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:16:19.719478 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:16:38.041903 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:16:46.132165 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:16:56.642777 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:16:56.646562 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:17:12.931808 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:17:21.208366 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:17:37.702744 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:17:42.296146 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:17:55.108198 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:18:03.203058 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:18:03.374619 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:18:11.374922 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:18:27.614571 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:18:35.774806 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:18:39.932246 17127 exec_state_reporter.cpp:129] Retrying ReportExecStatus: No more data to read.
W1026 17:18:43.655020 18136 stack_util.cpp:128] 2023-10-26 17:18:43.654961, query_id=00000000-0000-0000-0000-000000000000, fragment_instance_id=00000000-0000-0000-0000-000000000000 throws exception: std::runtime_error, trace:
@ 0x2a8de29 std::__throw_runtime_error()
@ 0x7d5d31e std::(anonymous namespace)::__x86_rdrand()
@ 0x7d5d45e std::(anonymous namespace)::__x86_rdseed_rdrand()
@ 0x4ee12d2 starrocks::pipeline::ExchangeSinkOperator::prepare()
@ 0x2cb10a4 starrocks::pipeline::PipelineDriver::prepare()
@ 0x4eb6c37 starrocks::pipeline::FragmentExecutor::execute()
@ 0x5153bf0 starrocks::PInternalServiceImplBase<>::_exec_plan_fragment_by_pipeline()
@ 0x515599d starrocks::PInternalServiceImplBase<>::_exec_batch_plan_fragments()
@ 0x51597b9 starrocks::PInternalServiceImplBase<>::exec_batch_plan_fragments()
@ 0x5a8399d brpc::policy::ProcessRpcRequest()
@ 0x59c6147 brpc::ProcessInputMessage()
@ 0x59c701b brpc::InputMessenger::OnNewMessages()
@ 0x59b65ae brpc::Socket::ProcessEvent()
@ 0x59892ff bthread::TaskGroup::task_runner()
@ 0x5ad10b1 bthread_make_fcontext
哪位大佬帮忙看看是啥问题