背景:
从1.18.4升级至2.1.3后,base compaction会导致be down掉,在1.18.4版本没有这个问题,1.19.7版本也有这个问题,但是发生频率没有2.1.3版本高。
be.out:
tcmalloc: large alloc 2006474752 bytes == 0x1da6c2000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
tcmalloc: large alloc 2844852224 bytes == 0x1da6c2000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
tcmalloc: large alloc 4348510208 bytes == 0x238bd6000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
tcmalloc: large alloc 4388855808 bytes == 0x238bba000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
tcmalloc: large alloc 4429209600 bytes == 0x238bba000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
tcmalloc: large alloc 4429209600 bytes == 0x238bba000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
tcmalloc: large alloc 4469563392 bytes == 0x238bba000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
tcmalloc: large alloc 4469563392 bytes == 0x238bb4000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
tcmalloc: large alloc 4509917184 bytes == 0x238bb0000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
tcmalloc: large alloc 4550270976 bytes == 0x238ba8000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
tcmalloc: large alloc 4590624768 bytes == 0x238ba8000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
tcmalloc: large alloc 4630978560 bytes == 0x238ba8000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
tcmalloc: large alloc 4630978560 bytes == 0x238ba4000 @ 0x4e98709 0x50163dc 0x1ceab18 0x4f66c75 0x20de5d0 0x17e5761 0x19ab5c7 0x184ad59 0x1c1a159 0x18ed52f 0x18f1c4d 0x18f308b 0x18e84ee 0x176e009 0x175a0af 0x4fe0870
terminate called after throwing an instance of ‘std::bad_alloc’
what(): std::bad_alloc
*** Aborted at 1650236616 (unix time) try “date -d @1650236616” if you are using GNU date ***
PC: @ 0x7f9279c48207 __GI_raise
*** SIGABRT (@0x3ea00017286) received by PID 94854 (TID 0x7f91f512d700) from PID 94854; stack trace: ***
@ 0x3503022 google::(anonymous namespace)::FailureSignalHandler()
@ 0x7f927a9125d0 (unknown)
@ 0x7f9279c48207 __GI_raise
@ 0x7f9279c498f8 __GI_abort
@ 0x15de049 _ZN9__gnu_cxx27__verbose_terminate_handlerEv.cold
@ 0x4f66726 __cxxabiv1::__terminate()
@ 0x4f66791 std::terminate()
@ 0x4f668e4 __cxa_throw
@ 0x15ddf50 _Znwm.cold
@ 0x1d5b8c9 starrocks::faststring::GrowArray()
@ 0x19f9533 starrocks::PageIO::compress_page_body()
@ 0x19a6e43 starrocks::ScalarColumnWriter::finish_current_page()
@ 0x19a8b8c starrocks::ScalarColumnWriter::append()
@ 0x19ab612 starrocks::ScalarColumnWriter::append()
@ 0x184ad59 starrocks::SegmentWriter::append_chunk()
@ 0x1c1a159 starrocks::HorizontalBetaRowsetWriter::add_chunk()
@ 0x18ed52f starrocks::vectorized::Compaction::_merge_rowsets_horizontally()
@ 0x18f1c4d starrocks::vectorized::Compaction::do_compaction_impl()
@ 0x18f308b starrocks::vectorized::Compaction::do_compaction()
@ 0x18e84ee starrocks::vectorized::BaseCompaction::compact()
@ 0x176e009 starrocks::StorageEngine::_perform_base_compaction()
@ 0x175a0af starrocks::StorageEngine::_base_compaction_thread_callback()
@ 0x4fe0870 execute_native_thread_routine
@ 0x7f927a90add5 start_thread
@ 0x7f9279d0fead __clone
@ 0x0 (unknown)
主机配置: 16c31g
base compaction 的 tablet大小为8G