常见 Crash / BUG / 优化 查询

Analyze too many versions

SHOW ANALYZE STATUS

有 Too many versions 错误

RoaringBitmap Crash

*** Aborted at 1666613216 (unix time) try “date -d @1666613216” if you are using GNU date ***
PC: @ 0x4a4b44d roaring_read_uint32_iterator
*** SIGILL (@0x4a4b44d) received by PID 45161 (TID 0x7fac92d3c700) from PID 77902925; stack trace: ***
@ 0x45fa272 google::(anonymous namespace)::FailureSignalHandler()
@ 0x7fad6111a630 (unknown)
@ 0x4a4b44d roaring_read_uint32_iterator
@ 0x1e95f5a starrocks::vectorized::roaring2range()
@ 0x1e9617d starrocks::vectorized::SegmentIterator::_apply_del_vector()
@ 0x1e9dcc2 starrocks::vectorized::SegmentIterator::_init()
@ 0x1e9e519 starrocks::vectorized::SegmentIterator::do_get_next()
@ 0x1ea8b02 starrocks::vectorized::SegmentIterator::do_get_next()
@ 0x1f44d12 starrocks::vectorized::MergeEntry<>::next()
@ 0x1f4819a starrocks::vectorized::RowsetMergerImpl<>::_do_merge_horizontally()
@ 0x1f49a6c starrocks::vectorized::RowsetMergerImpl<>::_do_merge_vertically()
@ 0x1f4b813 starrocks::vectorized::RowsetMergerImpl<>::do_merge()
@ 0x1f3f811 starrocks::vectorized::compaction_merge_rowsets()
@ 0x1e30d93 starrocks::TabletUpdates::_do_compaction()
@ 0x1e31d4c starrocks::TabletUpdates::compaction()
@ 0x1dab646 starrocks::StorageEngine::_perform_update_compaction()
@ 0x1f9fc4e starrocks::StorageEngine::_update_compaction_thread_callback()
@ 0x60f3a40 execute_native_thread_routine
@ 0x7fad61112ea5 start_thread
@ 0x7fad6072d8dd __clone
@ 0x0 (unknown)
  • 临时规避方法:
    • 换支持 AVX2 的机器
    • 自已修改编译选项,编译一个不需要AVX2指令集的包
  • 问题原因: 机器不支持 avx2
cat /proc/cpuinfo | grep avx2
1赞

save meta crash

PC: @          0x203e811 starrocks::RowsetMetaPB::MergeFrom()
*** SIGSEGV (@0x8) received by PID 131926 (TID 0x7f3b3e417700) from PID 8; stack trace: ***
    @          0x354a282 google::(anonymous namespace)::FailureSignalHandler()
    @     0x7f3c3451e630 (unknown)
    @          0x203e811 starrocks::RowsetMetaPB::MergeFrom()
    @          0x17c027f starrocks::TabletMeta::to_meta_pb()
    @          0x17c06f8 starrocks::TabletMeta::_save_meta()
    @          0x17c0cbd starrocks::TabletMeta::save_meta()
    @          0x179a7fb starrocks::Tablet::save_meta()
    @          0x17b0c3a starrocks::TabletManager::_drop_tablet_directly_unlocked()
    @          0x17b10f4 starrocks::TabletManager::_drop_tablet_unlocked()
    @          0x17b22a4 starrocks::TabletManager::_add_tablet_unlocked()
    @          0x17b3097 starrocks::TabletManager::load_tablet_from_meta()
    @          0x17b4100 starrocks::TabletManager::load_tablet_from_dir()
    @          0x2e86752 starrocks::EngineStorageMigrationTask::_storage_migrate()
    @          0x2e870c4 starrocks::EngineStorageMigrationTask::execute()
    @          0x1786cce starrocks::StorageEngine::execute_task()
    @          0x20e5cbf starrocks::TaskWorkerPool::_clone_worker_thread_callback()
    @          0x5027870 execute_native_thread_routine
    @     0x7f3c34516ea5 start_thread
    @     0x7f3c3391bb0d __clone
    @                0x0 (unknown)

Sort Crash

PC: @          0x2520e16 starrocks::vectorized::HeapChunkSorter::_do_filter_data_for_type<>()
*** SIGSEGV (@0xc39) received by PID 300010 (TID 0x7fe795eba700) from PID 3129; stack trace: ***
    @          0x3cb75d2 google::(anonymous namespace)::FailureSignalHandler()
    @     0x7fe83a5c5630 (unknown)
    @          0x2520e16 starrocks::vectorized::HeapChunkSorter::_do_filter_data_for_type<>()
    @          0x251792f starrocks::vectorized::HeapChunkSorter::_filter_data()
    @          0x2518e5d starrocks::vectorized::HeapChunkSorter::update()
    @          0x25138ec starrocks::vectorized::TopNNode::_consume_chunks()
    @          0x2514b6f starrocks::vectorized::TopNNode::open()
    @          0x1e948f4 starrocks::PlanFragmentExecutor::_open_internal_vectorized()
    @          0x1e955f7 starrocks::PlanFragmentExecutor::open()
    @          0x1e294bb starrocks::FragmentExecState::execute()
    @          0x1e2db9c starrocks::FragmentMgr::exec_actual()
    @          0x1e2e401 _ZNSt17_Function_handlerIFvvEZN9starrocks11FragmentMgr18exec_plan_fragmentERKNS1_23TExecPlanFragmentParamsERKSt8functionIFvPNS1_20PlanFragmentExecutorEEESC_EUlvE_E9_M_invokeERKSt9_Any_data
    @          0x1f723d9 starrocks::ThreadPool::dispatch_thread()
    @          0x1f6df8a starrocks::Thread::supervise_thread()
    @     0x7fe83a5bdea5 start_thread
    @     0x7fe839bd896d __clone
    @                0x0 (unknown)

Hudi 外表部分字段查询出来为Null

实际不是Null,但是查询出的结果中,部分字段为 null

  • Github Issue:
  • Github Fix PR: https://github.com/StarRocks/starrocks/pull/13058
  • 问题版本:
    • 2.3.0 ~ 2.3.4
    • 2.4.0
  • 修复版本:
    • 2.3.5+
    • 2.4.1+
  • 临时规避方法:
    • 修改be.conf: parquet_late_materialization_enable = false
  • 问题原因:
    • Parquet 延迟物化处理有问题

主键模型表使用 persistent index 导致磁盘空间膨胀

主键模型表使用 persistent index, 导致磁盘空间不断增长,无法回收

Ubuntu启动BE报错

be.out 报错

undefined symbol: _dl_sym, version GLIBC_PRIVATE

Not Found string in global dict

Unique表,查询报错

Not Found string in global dict

build hash map crash

0x7f83e010a6d0(unknown)
0x256c340 starrocks::Agguegator::build_hash_map<>()
0x28ac22b starrocks::pipeline::Aggregatestreamingsinkoperator::_push_chunk_by_auto()
0x28b2635 starrocks::pipeline::Aggregatestreamingsinkoperator::push_chunk()
0x289716c starrocks::pipeline::PipelineDriver::process()
0x288c564 starrocks::pipeline::GlobalDriverExecutor::_worker_thread()
0x2132549 starrocks::ThreadPool::dispatch_thread()
0x212e0fa starrocks::Thread::supervise_thread()
0x7f83e0102e25 start_thread
0x7f83df71ebad__clone
0x0(unknown)

Schema Change+Add Partition+Balance之后,内存中的元数据出错

*** SIGABRT (@0x3e900000b7a) received by PID 2938 (TID 0x7efe4114a700) from PID 2938; stack trace: ***
    @          0x33a1be2 google::(anonymous namespace)::FailureSignalHandler()
    @     0x7efe9dc36630 (unknown)
    @     0x7efe9cf6b3d7 __GI_raise
    @     0x7efe9cf6cac8 __GI_abort
    @          0x157860d _ZN9__gnu_cxx27__verbose_terminate_handlerEv.cold
    @          0x4d3dfe6 __cxxabiv1::__terminate()
    @          0x4d3e051 std::terminate()
    @          0x4d3e1a5 __cxa_throw
    @          0x157a1f1 std::__throw_length_error()
    @          0x16ef0d0 std::vector<>::_M_fill_insert()
    @          0x16fa218 starrocks::vectorized::NullableColumn::append_numbers()
    @          0x1abe001 starrocks::segment_v2::BitShufflePageDecoder<>::next_batch()
    @          0x1ab43b7 starrocks::segment_v2::ParsedPageV2::read()
    @          0x2cae998 starrocks::segment_v2::ScalarColumnIterator::next_batch()
    @          0x2c173e1 starrocks::vectorized::SegmentIterator::_do_get_next()
    @          0x2c1ace1 starrocks::vectorized::SegmentIterator::do_get_next()
    @          0x1b0a40a starrocks::SegmentIteratorWrapper::do_get_next()
    @          0x189116b starrocks::vectorized::TimedChunkIterator::do_get_next()
    @          0x190cf3d starrocks::vectorized::HeapMergeIterator::do_get_next()
    @          0x189116b starrocks::vectorized::TimedChunkIterator::do_get_next()
    @          0x19041f1 starrocks::vectorized::AggregateIterator::do_get_next()
    @          0x189116b starrocks::vectorized::TimedChunkIterator::do_get_next()
    @          0x18c701a starrocks::vectorized::TabletReader::do_get_next()
    @          0x1822aa7 starrocks::vectorized::Compaction::merge_rowsets()
    @          0x18249b2 starrocks::vectorized::Compaction::do_compaction_impl()
    @          0x182679b starrocks::vectorized::Compaction::do_compaction()
    @          0x1820904 starrocks::vectorized::CumulativeCompaction::compact()
    @          0x1745439 starrocks::StorageEngine::_perform_cumulative_compaction()
    @          0x17326cf starrocks::StorageEngine::_cumulative_compaction_thread_callback()
    @          0x4db83a0 execute_native_thread_routine
    @     0x7efe9dc2eea5 start_thread
    @     0x7efe9d0339fd __clone
admin set frontend config ("max_scheduling_tablets"="0");
admin set frontend config ("disable_balance"="true");
admin set frontend config ("disable_colocate_balance"="true");
admin set frontend config ("max_scheduling_tablets"="2000");
admin set frontend config ("disable_balance"="false");
admin set frontend config ("disable_colocate_balance"="false");
  • 问题原因:
    • 存储层复用SchemaID,在上述操作之下出现错乱。

OlapScanNode use-after-free

*** Aborted at 1666617333 (unix time) try "date -d @1666617333" if you are using GNU date ***
PC: @          0x2b5243b starrocks::ExprContext::close()
*** SIGSEGV (@0x60) received by PID 302338 (TID 0x7f58702a0700) from PID 96; stack trace: ***
    @          0x3cf55d2 google::(anonymous namespace)::FailureSignalHandler()
    @     0x7f5921984630 (unknown)
    @          0x2b5243b starrocks::ExprContext::close()
    @          0x2b535c0 starrocks::Expr::close()
    @          0x27467e6 starrocks::vectorized::TabletScanner::close()
    @          0x2746e78 starrocks::vectorized::TabletScanner::~TabletScanner()
    @          0x246bbd7 _ZZN9starrocks10ObjectPool3addINS_10vectorized13TabletScannerEEEPT_S5_ENUlPvE_4_FUNES6_
    @          0x246b3ff starrocks::vectorized::OlapScanNode::~OlapScanNode()
    @          0x246ba22 starrocks::vectorized::OlapScanNode::~OlapScanNode()
    @          0x1ed7787 std::_Sp_counted_ptr<>::_M_dispose()
    @          0x18e7fda std::_Sp_counted_base<>::_M_release()
    @          0x1ed3e42 starrocks::RuntimeState::~RuntimeState()
    @          0x1e65a22 starrocks::FragmentExecState::~FragmentExecState()
    @          0x1e6edab std::_Sp_counted_ptr<>::_M_dispose()
    @          0x18e7fda std::_Sp_counted_base<>::_M_release()
    @          0x1e66db5

或堆栈打不全

*** SIGSEGV (@0x0) received by PID 3104 (TID 0x7f51b752b700) from PID 0; stack trace: ***
    @          0x3ff7972 google::(anonymous namespace)::FailureSignalHandler()
    @     0x7f522347c630 (unknown)
    @                0x0 (unknown)

array_agg crash

*** Aborted at 1670508783 (unix time) try "date -d @1670508783" if you are using GNU date ***
PC: @          0x28f91b5 starrocks::vectorized::NullableAggregateFunctionUnary<>::update_batch_selectively()
*** SIGSEGV (@0xa0) received by PID 4440 (TID 0x7f9a6a737700) from PID 160; stack trace: ***
    @          0x3ca37d2 google::(anonymous namespace)::FailureSignalHandler()
    @     0x7f9ad08a3630 (unknown)
    @          0x28f91b5 starrocks::vectorized::NullableAggregateFunctionUnary<>::update_batch_selectively()
    @          0x26c9cd9 starrocks::Aggregator::compute_batch_agg_states_with_selection()
    @          0x263df4f starrocks::pipeline::AggregateStreamingSinkOperator::_push_chunk_by_auto()
    @          0x26446cd starrocks::pipeline::AggregateStreamingSinkOperator::push_chunk()
    @          0x2627d5d starrocks::pipeline::PipelineDriver::process()
    @          0x261da11 starrocks::pipeline::GlobalDriverExecutor::_worker_thread()
    @          0x1f5a169 starrocks::ThreadPool::dispatch_thread()
    @          0x1f55d1a starrocks::Thread::supervise_thread()
    @     0x7f9ad089bea5 start_thread
    @     0x7f9acfeb6b0d __clone
    @                0x0 (unknown)

主键模型使用 Persistent index 导入 crash

也有可能启动占用大量内存

W0912 16:42:51.123860 2268472 mem_hook.cpp:254] large memory alloc: 103079215105 bytes, stack:
    @          0x4a360eb  malloc
    @          0x7faf765  operator new()
    @          0x41ed62d  std::__cxx11::basic_string<>::_M_mutate()
    @          0x4458b11  std::__cxx11::basic_string<>::resize()
    @          0x44f3f97  starrocks::SliceMutableIndex::load_snapshot()
    @          0x4448f76  starrocks::ShardByLengthMutableIndex::load_snapshot()
    @          0x4459795  starrocks::ShardByLengthMutableIndex::load()
    @          0x44669dd  starrocks::PersistentIndex::_load()
    @          0x4467a3f  starrocks::PersistentIndex::load()
    @          0x446e6c3  starrocks::PersistentIndex::load_from_tablet()
    @          0x4180f92  starrocks::PrimaryIndex::_do_load()
    @          0x41824bf  starrocks::PrimaryIndex::load()
    @          0x426265e  starrocks::	TabletUpdates::_apply_rowset_commit()
    @          0x4266353  starrocks::TabletUpdates::do_apply()
    @          0x4b17465  starrocks::ThreadPool::dispatch_thread()
    @          0x4b11e4a  starrocks::Thread::supervise_thread()
    @     0x7f759126c609  start_thread
    @     0x7f7591030133  clone
    @              (nil)  (unknown)

start time:2022年11月08日屋期二07:23:17csT
terminate called after throwing an instance of 'std:bad_alloc'
what(): std::bad_alloc
query_id:00000000-0000-0000-0000-000000000000,fragment_instance:00000000-0000-0000-0000-000000000000
**Aborted at 1667863398 (unix time)try "date -d @1667863398"if you are using GNU date **
PC:
0x7fd3c5943207 GI_raise
**SIGABRT (@Ox5abc)received by PID 23228 (TID 0x7fd3515fd700)from PID 23228;stack trace:**
0x481e332 google::(anonymous namespace)::FailuresignalHandler()
0x7fd3c63f75d0 (unknown)
0x7fd3c5943207 GI_raise
0x7fd3c59448f8 GI_abort
0x1c4acef_ZN9_gnu_cxx27_verbose_terminate_handlerEv.cold
0x62a0af6 _cxxabivl:_terminate()
0x62a0b61 std::terminate()
0x62a0cb4 __cxa_throw
0x1c4abf6 _Znwm.cold
0x22bab2b starrocks::FixedMutableIndex::load_snapshot()
0x229e9e6 starrocks::shardByLengthMutableIndex::load()
0x22aa9bc starrocks::PersistentIndex::_load()
0x22abe77 starrocks:PersistentIndex::load()
0x22ad821 starrocks:PersistentIndex::load_from_tablet()
0x1ff713c starrocks:PrimaryIndex:_do_load()
0x1ff7edf starrocks::PrimaryIndex::load()
0x1e6df30 starrocks:Tabletupdates::_apply_rowset_commit()
0x1e73bb3 starrocks::Tabletupdates::do_apply()
0x2680af5 starrocks:ThreadPool:dispatch_thread()
0x267bf2a starrocks:Thread:supervise_thread()
0x7fd3c63efdd5 start_thread
0x7fd3c5a0aead clone

我看这个pr cherrypick到2.3的分支失败了,然后就close了,是有什么特殊原因吗?

机器升级规格后,FE记录的 Cpu Cores 信息不对

Show backens; CpuCores还是记录的原来的32核,实际是64核

          Version: 2.2.8-3edb7b7
           Status: {"lastSuccessReportTabletsTime":"2022-12-11 12:00:45"}
DataTotalCapacity: 4.584 TB
      DataUsedPct: 20.59 %
         CpuCores: 32

Join + bitmap crash

terminate called after throwing an instance of 'std::runtime_error'
  what():  failed memory alloc in constructor
*** Aborted at 1670985104 (unix time) try "date -d @1670985104" if you are using GNU date ***
PC: @     0x7fae92edf387 __GI_raise
*** SIGABRT (@0x3dc0001022a) received by PID 66090 (TID 0x7fae04ae5700) from PID 66090; stack trace: ***
    @          0x3fa3ad2 google::(anonymous namespace)::FailureSignalHandler()
    @     0x7fae93994630 (unknown)
    @     0x7fae92edf387 __GI_raise
    @     0x7fae92ee0a78 __GI_abort
    @          0x188857d _ZN9__gnu_cxx27__verbose_terminate_handlerEv.cold
    @          0x59a72a6 __cxxabiv1::__terminate()
    @          0x59a7311 std::terminate()
    @          0x59a74b6 __cxa_rethrow
    @          0x16067b0 _ZNSt8_Rb_treeIjSt4pairIKj7RoaringESt10_Select1stIS3_ESt4lessIjESaIS3_EE7_M_copyINS9_11_Alloc_nodeEEEPSt13_Rb_tree_nodeIS3_EPKSD_PSt18_Rb_tree_node_baseRT_.isra.0.cold
    @          0x209078b starrocks::BitmapValue::BitmapValue()
    @          0x24e859b starrocks::vectorized::ObjectColumn<>::append()
    @          0x24e88b5 starrocks::vectorized::ObjectColumn<>::append_selective()
    @          0x26918f8 starrocks::vectorized::JoinHashMap<>::_copy_build_column()
    @          0x2692099 starrocks::vectorized::JoinHashMap<>::_build_output()
    @          0x26e6289 starrocks::vectorized::JoinHashMap<>::probe()
    @          0x2673478 starrocks::vectorized::JoinHashTable::probe()
    @          0x26646a8 starrocks::vectorized::HashJoinNode::_probe()
    @          0x2665839 starrocks::vectorized::HashJoinNode::get_next()
    @          0x274bd91 starrocks::vectorized::ProjectNode::get_next()
    @          0x2051f23 starrocks::PlanFragmentExecutor::_get_next_internal_vectorized()
    @          0x205375e starrocks::PlanFragmentExecutor::_open_internal_vectorized()
    @          0x2054347 starrocks::PlanFragmentExecutor::open()
    @          0x1fd8fab starrocks::FragmentExecState::execute()
    @          0x1fdd5dc starrocks::FragmentMgr::exec_actual()
    @          0x1fdde81 _ZNSt17_Function_handlerIFvvEZN9starrocks11FragmentMgr18exec_plan_fragmentERKNS1_23TExecPlanFragmentParamsERKSt8functionIFvPNS1_20PlanFragmentExecutorEEESC_EUlvE_E9_M_invokeERKSt9_Any_data
    @          0x2132549 starrocks::ThreadPool::dispatch_thread()
    @          0x212e0fa starrocks::Thread::supervise_thread()
    @     0x7fae9398cea5 start_thread
    @     0x7fae92fa796d __clone
    @                0x0 (unknown)

读 Parquet crash

*** Aborted at 1670504382 (unix time) try "date -d @1670504382" if you are using GNU date ***
PC: @          0xcd14fa9 starrocks::parquet::DictDecoder<>::get_dict_values()
*** SIGSEGV (@0x604313bec270) received by PID 155105 (TID 0x7f7e6ae2b700) from PID 331268720; stack trace: ***
    @          0xdb15d42 google::(anonymous namespace)::FailureSignalHandler()
    @     0x7f7f4792f3ab os::Linux::chained_handler()
    @     0x7f7f47933efc JVM_handle_linux_signal
    @     0x7f7f47926d48 signalHandler()
    @     0x7f7f46e04630 (unknown)
    @          0xcd14fa9 starrocks::parquet::DictDecoder<>::get_dict_values()
    @          0xccfe8b3 starrocks::parquet::ColumnChunkReader::get_dict_values()
    @          0xccfee2c starrocks::parquet::StoredColumnReader::get_dict_values()
    @          0xccf0520 starrocks::parquet::ScalarColumnReader::get_dict_values()
    @          0xcce43d0 starrocks::parquet::GroupReader::_dict_decode()
    @          0xccdd403 starrocks::parquet::GroupReader::get_next()
    @          0xcca4e6a starrocks::parquet::FileReader::get_next()
    @          0xc96db97 starrocks::vectorized::HdfsParquetScanner::do_get_next()
    @          0xc9414b9 starrocks::vectorized::HdfsScanner::get_next()
    @          0xc825ce6 starrocks::connector::HiveDataSource::get_next()
    @          0x749bd98 starrocks::pipeline::ConnectorChunkSource::_read_chunk()
    @          0x7449923 starrocks::pipeline::ChunkSource::buffer_next_batch_chunks_blocking()
    @          0x63a7ae8 _ZZN9starrocks8pipeline12ScanOperator18_trigger_next_scanEPNS_12RuntimeStateEiENKUlvE_clEv
    @          0x63ac2fe _ZSt13__invoke_implIvRZN9starrocks8pipeline12ScanOperator18_trigger_next_scanEPNS0_12RuntimeStateEiEUlvE_JEET_St14__invoke_otherOT0_DpOT1_
    @          0x63ac1ac _ZSt10__invoke_rIvRZN9starrocks8pipeline12ScanOperator18_trigger_next_scanEPNS0_12RuntimeStateEiEUlvE_JEENSt9enable_ifIX16is_invocable_r_vIT_T0_DpT1_EES8_E4typeEOS9_DpOSA_
    @          0x63ac021 _ZNSt17_Function_handlerIFvvEZN9starrocks8pipeline12ScanOperator18_trigger_next_scanEPNS1_12RuntimeStateEiEUlvE_E9_M_invokeERKSt9_Any_data
    @          0x6022428 std::function<>::operator()()
    @          0x63bf8be starrocks::workgroup::ScanExecutor::worker_thread()
    @          0x63bf108 _ZZN9starrocks9workgroup12ScanExecutor10initializeEiENKUlvE_clEv
    @          0x63c0ada _ZSt13__invoke_implIvRZN9starrocks9workgroup12ScanExecutor10initializeEiEUlvE_JEET_St14__invoke_otherOT0_DpOT1_
    @          0x63c07a2 _ZSt10__invoke_rIvRZN9starrocks9workgroup12ScanExecutor10initializeEiEUlvE_JEENSt9enable_ifIX16is_invocable_r_vIT_T0_DpT1_EES6_E4typeEOS7_DpOS8_
    @          0x63c0317 _ZNSt17_Function_handlerIFvvEZN9starrocks9workgroup12ScanExecutor10initializeEiEUlvE_E9_M_invokeERKSt9_Any_data
    @          0x6022428 std::function<>::operator()()
    @          0xbbe3054 starrocks::FunctionRunnable::run()
    @          0xbbdfe94 starrocks::ThreadPool::dispatch_thread()
    @          0xbbfd420 std::__invoke_impl<>()
    @          0xbbfcd79 std::__invoke<>()
  • Github Issue: 无
  • Github Fix PR: https://github.com/StarRocks/starrocks/pull/15189
  • 问题版本:
    • 2.3.0 ~ 2.3.5
    • 2.4.0 ~ 2.4.2
  • 修复版本:
    • 2.3.6+
    • 2.4.3+
  • 临时规避方法:
  • 问题原因:
    • 一个scan range跨越多个row group, 一个row group有字典, 一个row group没有字典编码

TopN crash

query_id:09e1a166-803f-11ed-b2ae-c4b8b44f4875, fragment_instance:09e1a166-803f-11ed-b2ae-c4b8b44f4875
*** Aborted at 1671524376 (unix time) try "date -d @1671524376" if you are using GNU date ***
PC: @     0x2af0ba23c676 __memcmp_sse4_1
*** SIGSEGV (@0xe4d00000019) received by PID 20674 (TID 0x2af0f4fcd700) from PID 25; stack trace: ***
    @          0x3ff4972 google::(anonymous namespace)::FailureSignalHandler()
    @     0x2af0b97b6630 (unknown)
    @     0x2af0ba23c676 __memcmp_sse4_1
    @          0x27e54d8 _ZSt13__adjust_heapIN9__gnu_cxx17__normal_iteratorIPN9starrocks10vectorized20VerticalColumnSorter16CompactChunkItemINS2_5SliceEEESt6vectorIS7_SaIS7_EEEElS7_NS0_5__ops15_Iter_comp_iterIZNS3_L19sort_and_tie_helperIZNS4_8do_visitIjEENS2_6StatusERKNS3_16BinaryColumnBaseIT_EEEUlRKS7_SO_E_SB_EESH_RKbPKNS3_6ColumnEbRT0_RS9_IhSaIhEESJ_St4pairIiiEbmPmEUlSJ_SV_E_EEEvSJ_SV_SV_T1_T2_
    @          0x2884c07 starrocks::vectorized::VerticalColumnSorter::do_visit<>()
    @          0x2885a76 starrocks::ColumnVisitorAdapter<>::visit()
    @          0x19d1edc starrocks::vectorized::ColumnFactory<>::accept()
    @          0x27dcc5b starrocks::vectorized::sort_vertical_columns()
    @          0x2859c4b starrocks::vectorized::VerticalColumnSorter::do_visit()
    @          0x2859e26 starrocks::ColumnVisitorAdapter<>::visit()
    @          0x252298c starrocks::vectorized::ColumnFactory<>::accept()
    @          0x27dcc5b starrocks::vectorized::sort_vertical_columns()
    @          0x27de21a starrocks::vectorized::sort_vertical_chunks()
    @          0x27529e5 starrocks::vectorized::ChunksSorterTopn::_partial_sort_col_wise()
    @          0x2752e9c starrocks::vectorized::ChunksSorterTopn::_filter_and_sort_data()
    @          0x2756434 starrocks::vectorized::ChunksSorterTopn::_sort_chunks()
    @          0x2756b10 starrocks::vectorized::ChunksSorterTopn::done()
    @          0x27419e5 starrocks::vectorized::ChunksSorter::finish()
    @          0x28ba860 starrocks::pipeline::PartitionSortSinkOperator::set_finishing()
    @          0x28def07 starrocks::pipeline::PipelineDriver::_mark_operator_finishing()
    @          0x28dff3b starrocks::pipeline::PipelineDriver::process()
    @          0x28d67dc starrocks::pipeline::GlobalDriverExecutor::_worker_thread()
    @          0x21772c9 starrocks::ThreadPool::dispatch_thread()
    @          0x2172e7a starrocks::Thread::supervise_thread()
    @     0x2af0b97aeea5 start_thread
    @     0x2af0ba1cfb0d __clone
    @                0x0 (unknown)
  1. CrossJoin Crash

*** SIGABRT (@0x9219) received by PID 37401 (TID 0x2aaeee9bf700) from PID 37401; stack trace: ***
102
103    @          0x3d0a5d2 google::(anonymous namespace)::FailureSignalHandler()
105    @     0x2aaeb4ebe630 (unknown)
107    @     0x2aaeb580f387 __GI_raise
109    @     0x2aaeb5810a78 __GI_abort
111    @          0x17f69ed _ZN9__gnu_cxx27__verbose_terminate_handlerEv.cold
113    @          0x56c8086 __cxxabiv1::__terminate()
115    @          0x56c80f1 std::terminate()
117    @          0x56c8244 __cxa_throw
119    @          0x17f85d1 std::__throw_length_error()
121    @          0x196c682 std::vector<>::_M_range_insert<>()
123    @          0x1966ce4 starrocks::vectorized::BinaryColumn::append()
125    @          0x232fd96 starrocks::vectorized::NullableColumn::append()
127    @          0x2677b80 starrocks::pipeline::CrossJoinLeftOperator::_copy_joined_rows_with_index_base_probe()
129    @          0x2678041 starrocks::pipeline::CrossJoinLeftOperator::pull_chunk()
131    @          0x268f05a starrocks::pipeline::PipelineDriver::process()
133    @          0x268525e starrocks::pipeline::GlobalDriverExecutor::_worker_thread()
135    @          0x1fb37d9 starrocks::ThreadPool::dispatch_thread()
137    @          0x1faf38a starrocks::Thread::supervise_thread()
139    @     0x2aaeb4eb6ea5 start_thread
141    @     0x2aaeb58d7b0d __clone
143    @                0x0 (unknown)
  • Github Issue: 无
  • Github Fix PR: https://github.com/StarRocks/starrocks/pull/15497
  • 问题版本:
    • 2.3.0 ~ 2.3.6
    • 2.4.0 ~ 2.4.2
  • 修复版本:
    • 2.3.7+
    • 2.4.3+
  • 临时规避方法:
  • 问题原因:
    • 基数估计有问题,导致执行计划生成不合理。