【详述】
由于环境原因,需要在不支持AVX2指令集的机器上,通过容器的形式部署StarRocks。
参考以下内容,进行镜像的制作:
2023年中旬,已制作过2.2.13版本的不支持AVX2指令集的镜像,并可以正常运行。
近期,尝试制作2.5.13版本的镜像,发现如下情况:
- 启动服务初期,运行正常。在客户端中执行
SHOW BACKENDS\G
,BE节点状态正常 - 运行一段时间后,服务崩溃,
be.out
中日志内容如下
start time: Mon Dec 18 04:00:14 UTC 2023
2.5.13 RELEASE (build )
query_id:00000000-0000-0000-0000-000000000000, fragment_instance:00000000-0000-0000-0000-000000000000
tracker:process consumption: 154748672
tracker:query_pool consumption: 0
tracker:load consumption: 0
tracker:metadata consumption: 996870
tracker:tablet_metadata consumption: 382340
tracker:rowset_metadata consumption: 497277
tracker:segment_metadata consumption: 28501
tracker:column_metadata consumption: 88752
tracker:tablet_schema consumption: 28724
tracker:segment_zonemap consumption: 19426
tracker:short_key_index consumption: 0
tracker:column_zonemap_index consumption: 34720
tracker:ordinal_index consumption: 19184
tracker:bitmap_index consumption: 0
tracker:bloom_filter_index consumption: 0
tracker:compaction consumption: 0
tracker:schema_change consumption: 0
tracker:column_pool consumption: 1738354
tracker:page_cache consumption: 832
tracker:update consumption: 458
tracker:chunk_allocator consumption: 1458600
tracker:clone consumption: 0
tracker:consistency consumption: 0
*** Aborted at 1702872917 (unix time) try "date -d @1702872917" if you are using GNU date ***
PC: @ 0x6018f0d ra_init
*** SIGILL (@0x6018f0d) received by PID 860 (TID 0x7fb0125f5700) from PID 100765453; stack trace: ***
@ 0x592ed42 google::(anonymous namespace)::FailureSignalHandler()
@ 0x7fb156473630 (unknown)
@ 0x6018f0d ra_init
@ 0x42fafd0 starrocks::DelVector::_add_dels()
@ 0x42fb78c starrocks::DelVector::add_dels_as_new_version()
@ 0x414562b starrocks::TabletUpdates::_apply_rowset_commit()
@ 0x4147512 starrocks::TabletUpdates::do_apply()
@ 0x49b43d5 starrocks::ThreadPool::dispatch_thread()
@ 0x49aee7a starrocks::Thread::supervise_thread()
@ 0x7fb15646bea5 start_thread
@ 0x7fb155a8696d __clone
@ 0x0 (unknown)
- BE服务的stdout如下:
ERROR 1064 (HY000) at line 1: Unexpected exception: Same backend already exists[127.0.0.1:9050]
对比2.2.13和2.5.13,制作镜像过程中的差异点如下
- 源文件:分别为2.2.13与2.5.13的tar.gz文件
- 编译环境
- 2.2.13(运行正常):starrocks/dev-env:branch-2.2(该镜像线上已不存在)
- 2.5.13(运行不正常):starrocks/dev-en-centos7:2.5-latest
想咨询社区老师,问题的原因是什么?在现有环境下,可以如何解决?
【StarRocks版本】:2.5.13
【集群规模】例如:1fe(1 follower)+1be(fe与be混部)
【机器信息】
CPU某核的信息如下
processor : 7
vendor_id : GenuineIntel
cpu family : 6
model : 45
model name : Intel(R) Xeon(R) CPU E5-2665 0 @ 2.40GHz
stepping : 7
microcode : 0x5003604
cpu MHz : 2393.998
cache size : 20480 KB
physical id : 0
siblings : 8
core id : 7
cpu cores : 8
apicid : 14
initial apicid : 14
fpu : yes
fpu_exception : yes
cpuid level : 13
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat clflush mmx fxsr sse sse2 ht syscall nx lm constant_tsc rep_good nopl cpuid tsc_known_freq pni ssse3 cx16 sse4_1 sse4_2 popcnt hypervisor lahf_lm pti
bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_unknown
bogomips : 4787.99
clflush size : 64
cache_alignment : 64
address sizes : 46 bits physical, 48 bits virtual
power management:
【联系方式】社区群12 - srrookie