-
BE stream load 导入死锁
admin execute on 10004 ‘System.print(ExecEnv.get_stack_trace_for_all_threads())’; 看有下面典型的堆栈: (10004是BE_ID, show backends 可以看到)
48 tids: 495840,495841,495842,495843,495844,495845,495846,495847,495848,495849,495850,495851,495852,495853,495854,495855,495856,495857,495858,495859,495860,495861,495862,495863,495864,495865,495866,495867,495868,495869,495870,495871,495872,495873,495874,495875,495876,495877,495878,495879,495880,495881,495882,495883,495884,495885,495886,495887
0x7fddf41edaf7 syscall
0x91db983 std::__atomic_futex_unsigned_base::_M_futex_wait_until()
0x531efa0 starrocks::StreamLoadAction::_handle()
0x531f4c1 starrocks::StreamLoadAction::handle()
0x5ef5277 evhttp_handle_request
0x5ef5f23 bufferevent_readcb
0x5ee2662 event_process_active_single_queue
0x5ee2d9f event_base_loop
0x53058c4 _ZZN9starrocks12EvHttpServer5startEvENKUlvE_clEv
0x920b9d0 execute_native_thread_routine
0x7fddf4455fa3 start_thread
0x7fddf41f306f clone
(nil) (unknown)
48 tids: 495214,495215,495216,495217,495218,495219,495220,495221,495222,495223,495224,495225,495226,495227,495228,495229,495230,495231,495232,495233,495234,495235,495236,495237,495238,495239,495240,495241,495242,495243,495244,495245,495246,495247,495248,495249,495250,495251,495252,495253,495254,495255,495256,495257,495258,495259,495260,495261
0x7fddf445c00a __pthread_cond_wait
0x91a063c std::condition_variable::wait()
0x4cf9cd6 starrocks::StreamLoadPipe::read()
0x335e25d starrocks::vectorized::JsonReader::_read_and_parse_json()
0x3362447 starrocks::vectorized::JsonScanner::_open_next_reader()
0x3363cda starrocks::vectorized::JsonScanner::get_next()
0x53c76d1 starrocks::connector::FileDataSource::get_next()
0x3460c45 starrocks::vectorized::ConnectorScanNode::_scanner_thread()
0x4c7c2f0 starrocks::PriorityThreadPool::work_thread()
0x5e29b67 thread_proxy
0x7fddf4455fa3 start_thread
0x7fddf41f306f clone
(nil) (unknown)
120 tids: 495264,495265,495266,495267,495268,495269,495270,495271,495272,495273,495274,495275,495276,495277,495278,495279,495280,495281,495282,495283,495284,495285,495286,495287,495288,495289,495290,495291,495292,495293,495294,495295,495296,495297,495298,495299,495300,495301,495302,495303,495304,495305,495306,495307,495308,495309,495310,495311,495312,495313,495314,495315,495316,495317,495318,495319,495320,495321,495322,495323,495324,495325,495326,495327,2154605,2154606,2154607,2154608,2154610,2154671,2154964,2154965,2154966,2154967,2154969,2154984,2155017,2155039,2155069,2155070,2155080,2155081,2155082,2155083,2155084,2155085,2155086,2155087,2155088,2155113,2155125,2155128,2155130,2155131,2155132,2155133,2155148,2155153,2155154,2155335,2155336,2155349,2155354,2155365,2155404,2155417,2155432,2155451,2155461,2155462,2155485,2155486,2155503,2155509,2155521,2155576,2155639,2155712,2155713,2155733
0x7fddf445c00a __pthread_cond_wait
0x91a063c std::condition_variable::wait()
0x3460520 starrocks::vectorized::ConnectorScanNode::get_next()
0x4d4ad53 starrocks::PlanFragmentExecutor::_get_next_internal_vectorized()
0x4d4b140 starrocks::PlanFragmentExecutor::_open_internal_vectorized()
0x4d4d2dd starrocks::PlanFragmentExecutor::open()
0x4c9e71b starrocks::FragmentExecState::execute()
0x4ca4993 starrocks::FragmentMgr::exec_actual()
0x4e43062 starrocks::ThreadPool::dispatch_thread()
0x4e3db5a starrocks::Thread::supervise_thread()
0x7fddf4455fa3 start_thread
0x7fddf41f306f clone
(nil) (unknown)
-
Github Issue:
-
Github Fix PR:
-
Jira
-
问题版本:
- 2.5.0 ~ 2.5.12
- 3.0.0 ~ 3.0.6
- 3.1.0 ~ 3.1.3
-
修复版本:
-
问题原因:
-
临时解决办法:
webserver_num_workers=128 (默认48)
scanner_thread_pool_thread_num=128 (默认48)