be节点异常掉线

为了更快的定位您的问题,请提供以下信息,谢谢
【详述】BI上线的调度任务,凌晨0点执行,然后执行到某个步骤,突然连接就断开了,导致任务失败。任务开启时并发只有4个任务。报错信息如下:
W0716 00:32:02.403928 9003 sink_buffer.cpp:362] transmit chunk rpc failed [dest_instance_id=3a444a9c-6199-11f0-93c0-a0369fd8cee2] [dest=x.x.x.x:8060] detail:brpc failed, error=Host is down, error_text=[E110]Fail to read from Socket{id=453 fd=9163 addr=x.x.x.x:8060:64110} (0x0x7f5ddab6080
0): Connection timed out [R1][E112]Not connected to x.x.x.x:8060 yet, server_id=453 [R2][E112]Not connected to x.x.x.x:8060 yet, server_id=453 [R3][E112]Not connected to x.x.x.x:8060 yet, server_id=453
这是3个be节点中的一个,另两个同样的报错。x.x.x.x ,ip是指报错节点连接的另一个be的ip。等于是3个be节点互相不通。查看be进程也没有重启。这种报错长的持续3、4分钟,短的1分钟左右

【背景】无
【业务影响】连接中断后,业务重试也失败,直接导致报表任务失败,只能白天在手动重调处理
【是否存算分离】否
【StarRocks版本】3.1.15
【集群规模】3fe+3be(fe与be独立部署)
【机器信息】40C/256G/千兆
【联系方式】社区群24-Hanson,谢谢


集群也经常遇到这个问题,哥们,最后解决了吗?

没有解决,不知道是什么原因,目前还一直存在

环境中有外表吗?

改下be.conf brpc_connection_type=pooled 试试

没有外表

是不是出问题的时候, 网络流量压力很大

我在官方文档BE参数配置页面(https://docs.starrocks.io/zh/docs/3.1/administration/management/BE_configuration/),没看到 brpc_connection_type 这个参数,3.1.15版本有这个参数吗?


这是最近一周的网络流量,00:40前后流量有高峰


同时昨天出现新的问题,be节点不定时出现宕掉的情况。业务已无法正常使用,具体表现简单的查询没问题,复杂的查询只要查询就报:transmit chunk rpc failed [dest_instance_id=3a444a9c-6199-11f0-93c0-a0369fd8cee2] [dest=x.x.x.x:8060] detail:brpc failed, error=Host is down, error_text=[E110]Fail to read from Socket{id=453 fd=9163 addr=x.x.x.x:8060:64110} (0x0x7f5ddab6080
0): Connection timed out [R1][E112]Not connected to x.x.x.x:8060 yet, server_id=453 [R2][E112]Not connected to x.x.x.x:8060 yet, server_id=453 [R3][E112]Not connected to x.x.x.x:8060 yet, server_id=453 这样的错误。3个be节点没有同时宕掉,每次都是宕掉1个be节点,不一定是同一个be节点。3个be节点都频繁宕掉。
另外,从服务器进程上查看 starrocks_be 进程,发现进程也没有重启过。

be.out发下,看看Crash堆栈是什么样的

看来是网络突发流量太大导致的假死?
这种可以改成pooled试试,或是把pipeline_dop调小

这个等集群能用了,我试一下调参数。
pipeline_dop怎么配置?文档中也没搜到相关参数

be.out.1 (4.9 KB) be.out.2 (522.6 KB) be.out.3 (913.7 KB)

日志里,昨天的是人为主动重启的

先调pooled试试。如果没效果的话,手动set global pipeline_dop=4; (默认应该是0)

pooled参数加上后,上面的Host is down错误至今没出现,集群也能跑起来了
但监控上看,BE后端节点状态还是有比较多的掉线情况:

观察集群日志,还有大量 Connection timed out 错误,业务侧SQL任务还会有报错,但频率比较低了。
以下是部分 Connection timed out 报错日志:

引用
W1121 13:41:12.110149 17255 input_messenger.cpp:226] Fail to read from Socket{id=609885357217 fd=13848 addr=x.x.x.151:8060:9356} (0x14dd8de38600): Connection timed out [110]
W1121 13:41:12.134232 17200 input_messenger.cpp:226] Fail to read from Socket{id=1864015840948 fd=3249 addr=x.x.x.151:8060:9360} (0x14b7461dea00): Connection timed out [110]
W1121 13:41:12.830236 17196 input_messenger.cpp:226] Fail to read from Socket{id=867583407856 fd=19648 addr=x.x.x.151:8060:9374} (0x14b81646eb80): Connection timed out [110]
W1121 13:41:12.854214 17279 input_messenger.cpp:226] Fail to read from Socket{id=1271310320905 fd=10762 addr=x.x.x.151:8060:9366} (0x14ddabed40c0): Connection timed out [110]
W1121 13:41:13.550213 17200 input_messenger.cpp:226] Fail to read from Socket{id=1915555423035 fd=8958 addr=x.x.x.151:8060:9380} (0x14dc5b64bc80): Connection timed out [110]
W1121 13:41:13.574215 17196 input_messenger.cpp:226] Fail to read from Socket{id=1675037261632 fd=6701 addr=x.x.x.151:8060:9378} (0x14bc6e9df800): Connection timed out [110]
W1121 13:41:22.854239 17254 input_messenger.cpp:226] Fail to read from Socket{id=2439541425549 fd=20999 addr=x.x.x.218:8060:6012} (0x14ddb3131b40): Connection timed out [110]
W1121 13:41:27.262200 17200 input_messenger.cpp:226] Fail to read from Socket{id=1348619742524 fd=20688 addr=x.x.x.151:8060:9748} (0x14cd56faf580): Connection timed out [110]
W1121 13:41:27.270154 17268 input_messenger.cpp:226] Fail to read from Socket{id=773094116619 fd=6879 addr=x.x.x.151:8060:9522} (0x14dc80fd5300): Connection timed out [110]
W1121 13:41:27.270228 17262 input_messenger.cpp:226] Fail to read from Socket{id=2207613209709 fd=20865 addr=x.x.x.151:8060:9956} (0x14cf00752bc0): Connection timed out [110]
W1121 13:41:27.326195 17257 input_messenger.cpp:226] Fail to read from Socket{id=2164663518881 fd=13769 addr=x.x.x.151:8060:10694} (0x14ddb5bbd7c0): Connection timed out [110]
W1121 13:41:27.334146 17270 input_messenger.cpp:226] Fail to read from Socket{id=1649267456474 fd=13902 addr=x.x.x.151:8060:10696} (0x14b50dc32540): Connection timed out [110]
W1121 13:41:27.342242 17250 input_messenger.cpp:226] Fail to read from Socket{id=1537598312804 fd=14201 addr=x.x.x.151:8060:10698} (0x149f3d706f40): Connection timed out [110]
W1121 13:41:42.230180 17255 input_messenger.cpp:226] Fail to read from Socket{id=1211180796300 fd=20632 addr=x.x.x.151:8060:11094} (0x14d235a52e80): Connection timed out [110]
W1121 13:41:42.262171 17200 input_messenger.cpp:226] Fail to read from Socket{id=1932735318799 fd=20562 addr=x.x.x.151:8060:11026} (0x14b312d34000): Connection timed out [110]
W1121 13:41:42.326187 17262 input_messenger.cpp:226] Fail to read from Socket{id=1589137909809 fd=20799 addr=x.x.x.151:8060:11250} (0x14db99cf6f40): Connection timed out [110]
W1121 13:41:42.390226 17250 input_messenger.cpp:226] Fail to read from Socket{id=1975684964451 fd=21270 addr=x.x.x.151:8060:11648} (0x14d8bc8bfe00): Connection timed out [110]
W1121 13:41:42.390272 17275 input_messenger.cpp:226] Fail to read from Socket{id=2370821956304 fd=20993 addr=x.x.x.151:8060:11432} (0x14dcba42d780): Connection timed out [110]
W1121 13:41:52.222151 17246 input_messenger.cpp:226] Fail to read from Socket{id=1340029831052 fd=20928 addr=x.x.x.218:8060:7268} (0x14b8e97c92c0): Connection timed out [110]
W1121 13:41:52.230146 17200 input_messenger.cpp:226] Fail to read from Socket{id=2010044726339 fd=21631 addr=x.x.x.218:8060:7812} (0x14b70caace00): Connection timed out [110]
W1121 13:41:52.262146 17260 input_messenger.cpp:226] Fail to read from Socket{id=1408749289245 fd=21551 addr=x.x.x.218:8060:7758} (0x14ca5b00fa40): Connection timed out [110]
W1121 13:41:52.302124 17260 input_messenger.cpp:226] Fail to read from Socket{id=1821066154644 fd=21618 addr=x.x.x.218:8060:7800} (0x14ca82574e80): Connection timed out [110]
W1121 13:41:52.342149 17270 input_messenger.cpp:226] Fail to read from Socket{id=1391569420987 fd=21170 addr=x.x.x.218:8060:7482} (0x14ada6cbe540): Connection timed out [110]
W1121 13:41:52.342167 17257 input_messenger.cpp:226] Fail to read from Socket{id=1992864845874 fd=20612 addr=x.x.x.218:8060:8092} (0x14d7b4191ac0): Connection timed out [110]
W1121 13:41:52.350219 17281 input_messenger.cpp:226] Fail to read from Socket{id=1717986932280 fd=21386 addr=x.x.x.218:8060:7618} (0x14c1d76c1140): Connection timed out [110]
W1121 13:41:52.382145 17248 input_messenger.cpp:226] Fail to read from Socket{id=1468878844030 fd=21701 addr=x.x.x.218:8060:7904} (0x14b638280680): Connection timed out [110]
W1121 13:41:52.383127 17202 input_messenger.cpp:226] Fail to read from Socket{id=1082331771512 fd=18953 addr=x.x.x.218:8060:7944} (0x14d6e605d680): Connection timed out [110]
W1121 13:41:52.471302 17262 input_messenger.cpp:226] Fail to read from Socket{id=1348619754066 fd=17502 addr=x.x.x.218:8060:8336} (0x14cd64140940): Connection timed out [110]
W1121 13:41:57.870185 17270 input_messenger.cpp:226] Fail to read from Socket{id=996432436515 fd=21518 addr=x.x.x.151:8060:11850} (0x14cbe94ce040): Connection timed out [110]
W1121 13:41:57.950248 17268 input_messenger.cpp:226] Fail to read from Socket{id=721554534207 fd=21693 addr=x.x.x.151:8060:11996} (0x14c5c606d340): Connection timed out [110]
W1121 13:41:57.958185 17262 input_messenger.cpp:226] Fail to read from Socket{id=2259152819600 fd=20535 addr=x.x.x.151:8060:10998} (0x14d039bb0980): Connection timed out [110]
W1121 13:41:57.975122 17260 input_messenger.cpp:226] Fail to read from Socket{id=1374389549988 fd=19076 addr=x.x.x.151:8060:12104} (0x14bc6c18ce40): Connection timed out [110]
W1121 13:41:57.998289 17279 input_messenger.cpp:226] Fail to read from Socket{id=1477468778778 fd=21140 addr=x.x.x.151:8060:11580} (0x14b90818be80): Connection timed out [110]
W1121 13:41:58.070186 17262 input_messenger.cpp:226] Fail to read from Socket{id=2121713847245 fd=21078 addr=x.x.x.151:8060:11512} (0x14dca9bf57c0): Connection timed out [110]
W1121 13:41:58.150630 17279 input_messenger.cpp:226] Fail to read from Socket{id=1700807057536 fd=19154 addr=x.x.x.151:8060:12768} (0x14d8bc8c3f40): Connection timed out [110]
W1121 13:41:58.158485 17270 input_messenger.cpp:226] Fail to read from Socket{id=1717986921283 fd=1986 addr=x.x.x.151:8060:12780} (0x14dc7f448040): Connection timed out [110]
W1121 13:41:58.159125 17201 input_messenger.cpp:226] Fail to read from Socket{id=1511828489827 fd=19396 addr=x.x.x.151:8060:12778} (0x14dd27ae72c0): Connection timed out [110]
W1121 13:41:58.190168 17201 input_messenger.cpp:226] Fail to read from Socket{id=1322849962474 fd=21334 addr=x.x.x.151:8060:11686} (0x14b5d6538c80): Connection timed out [110]
W1121 13:41:58.270113 17196 input_messenger.cpp:226] Fail to read from Socket{id=1606317776045 fd=19613 addr=x.x.x.151:8060:12788} (0x14dbccfe3680): Connection timed out [110]
W1121 13:41:58.870364 17270 input_messenger.cpp:226] Fail to read from Socket{id=1486058713370 fd=20208 addr=x.x.x.151:8060:12854} (0x14b90818be80): Connection timed out [110]
W1121 13:42:12.695161 17201 input_messenger.cpp:226] Fail to read from Socket{id=1546188248985 fd=19154 addr=x.x.x.151:8060:12856} (0x14bfd26fd040): Connection timed out [110]
W1121 13:42:13.415155 17257 input_messenger.cpp:226] Fail to read from Socket{id=1211180794175 fd=21630 addr=x.x.x.151:8060:11918} (0x14b3c6616040): Connection timed out [110]
W1121 13:42:14.135141 17246 input_messenger.cpp:226] Fail to read from Socket{id=1468878818474 fd=2234 addr=x.x.x.151:8060:12920} (0x14dbb6b3f140): Connection timed out [110]
W1121 13:42:22.870136 17246 input_messenger.cpp:226] Fail to read from Socket{id=2405181706249 fd=21510 addr=x.x.x.218:8060:7730} (0x14d7b418be80): Connection timed out [110]
W1121 13:43:07.854173 17257 input_messenger.cpp:226] Fail to read from Socket{id=2087354139360 fd=21135 addr=x.x.x.218:8060:6136} (0x14bd326343c0): Connection timed out [110]
W1121 13:43:22.710209 17249 input_messenger.cpp:226] Fail to read from Socket{id=1992864841483 fd=20409 addr=x.x.x.218:8060:6776} (0x14ca5b00d1c0): Connection timed out [110]
W1121 13:43:22.718180 17272 input_messenger.cpp:226] Fail to read from Socket{id=919123002749 fd=17754 addr=x.x.x.218:8060:8920} (0x14ddb312f740): Connection timed out [110]
W1121 13:43:22.718238 17261 input_messenger.cpp:226] Fail to read from Socket{id=1769526558506 fd=17453 addr=x.x.x.218:8060:8918} (0x14b4e9bb8140): Connection timed out [110]
W1121 13:43:23.438181 17275 input_messenger.cpp:226] Fail to read from Socket{id=1657857378831 fd=17754 addr=x.x.x.218:8060:8938} (0x14dcb7a5b4c0): Connection timed out [110]
W1121 13:43:24.158257 17255 input_messenger.cpp:226] Fail to read from Socket{id=1357209688057 fd=1190 addr=x.x.x.218:8060:8940} (0x14ce7750e8c0): Connection timed out [110]
W1121 13:44:08.294281 17257 input_messenger.cpp:226] Fail to read from Socket{id=2027224566469 fd=20419 addr=x.x.x.218:8060:6782} (0x14dbfa2e9240): Connection timed out [110]
W1121 13:44:08.342268 17275 input_messenger.cpp:226] Fail to read from Socket{id=1537598307202 fd=20568 addr=x.x.x.218:8060:6926} (0x14afcdba9440): Connection timed out [110]
W1121 13:44:09.014256 17275 input_messenger.cpp:226] Fail to read from Socket{id=1692217130556 fd=15893 addr=x.x.x.218:8060:9006} (0x14afcfa5ac00): Connection timed out [110]
W1121 13:44:09.062295 17196 input_messenger.cpp:226] Fail to read from Socket{id=841813596804 fd=9963 addr=x.x.x.218:8060:9008} (0x14dc7bdaa740): Connection timed out [110]
W1121 13:44:09.734273 17279 input_messenger.cpp:226] Fail to read from Socket{id=1546188241794 fd=929 addr=x.x.x.218:8060:9010} (0x14afcdba9440): Connection timed out [110]
W1121 13:44:09.782255 17200 input_messenger.cpp:226] Fail to read from Socket{id=1511828490993 fd=17481 addr=x.x.x.218:8060:9012} (0x14dbfa2ef540): Connection timed out [110]
W1121 13:44:10.454221 17275 input_messenger.cpp:226] Fail to read from Socket{id=1262720407018 fd=384 addr=x.x.x.218:8060:9014} (0x14d1e5fcea00): Connection timed out [110]
W1121 13:44:10.502274 17255 input_messenger.cpp:226] Fail to read from Socket{id=1357209681585 fd=929 addr=x.x.x.218:8060:9016} (0x14ca5b000740): Connection timed out [110]
W1121 13:44:13.226372 16635 tablet_sink.cpp:1807] close channel failed. channel_name=NodeChannel[3397792], load_info=load_id=ad41fad2-c69e-11f0-9c15-a0369fd8ced8, txn_id: 72566406, parallel=1, compress_type=2, error_msg=[E110]Fail to read from Socket{id=2027224566469 fd=20419 addr=x.x.x.21
8:8060:6782} (0x0x14dbfa2e9240): Connection timed out [R1][E110]Fail to read from Socket{id=1692217130556 fd=15893 addr=x.x.x.218:8060:9006} (0x0x14afcfa5ac00): Connection timed out [R2][E110]Fail to read from Socket{id=1546188241794 fd=929 addr=x.x.x.218:8060:9010} (0x0x14afcdba9440):
Connection timed out [R3][E110]Fail to read from Socket{id=1262720407018 fd=384 addr=x.x.x.218:8060:9014} (0x0x14d1e5fcea00): Connection timed out
W1121 13:44:13.227124 16635 pipeline_driver.cpp:730] fragment_id ad41fad2-c69e-11f0-9c15-a0369fd8cef2 driver query_id=ad41fad2-c69e-11f0-9c15-a0369fd8ced8 fragment_id=ad41fad2-c69e-11f0-9c15-a0369fd8cef2 driver=driver_19_873, status=INPUT_EMPTY, operator-chain: [exchange_source_19_0x14c1786edd
10(X) -> spillable_hash_join_probe_23_0x14c177f84810(X)(HashJoiner=0x14c4aa63be10) -> project_24_0x14c1786ef110(X) -> spillable_hash_join_probe_95_0x14c177f84d10(X)(HashJoiner=0x14c4aa63a010) -> project_96_0x14c1786f0510(O) -> spillable_hash_join_probe_188_0x14c177f85210(X)(HashJoiner=0x14d0f6
db7010) -> project_189_0x14c1786f1910(O) -> spillable_hash_join_probe_261_0x14c177f85710(X)(HashJoiner=0x14d72309b010) -> project_262_0x14c1786f2d10(O) -> spillable_hash_join_probe_331_0x14c177f85c10(X)(HashJoiner=0x14d84cf6a010) -> project_332_0x14c1786f4110(O) -> spillable_hash_join_probe_39
7_0x14c177f86110(X)(HashJoiner=0x14d84b98d810) -> project_398_0x14c179c9c510(O) -> olap_table_sink_-1_0x14c179c9cf10(X)] cancels operator olap_table_sink_-1_0x14c179c9cf10(X) with finished error [E110]Fail to read from Socket{id=2027224566469 fd=20419 addr=x.x.x.218:8060:6782} (0x0x14dbfa2
e9240): Connection timed out [R1][E110]Fail to read from Socket{id=1692217130556 fd=15893 addr=x.x.x.218:8060:9006} (0x0x14afcfa5ac00): Connection timed out [R2][E110]Fail to read from Socket{id=1546188241794 fd=929 addr=x.x.x.218:8060:9010} (0x0x14afcdba9440): Connection timed out [R3
][E110]Fail to read from Socket{id=1262720407018 fd=384 addr=x.x.x.218:8060:9014} (0x0x14d1e5fcea00): Connection timed out
W1121 13:44:13.227509 16635 tablet_sink.cpp:1807] close channel failed. channel_name=NodeChannel[3397792], load_info=load_id=ad41fad2-c69e-11f0-9c15-a0369fd8ced8, txn_id: 72566406, parallel=1, compress_type=2, error_msg=[E110]Fail to read from Socket{id=1537598307202 fd=20568 addr=x.x.x.21
8:8060:6926} (0x0x14afcdba9440): Connection timed out [R1][E110]Fail to read from Socket{id=841813596804 fd=9963 addr=x.x.x.218:8060:9008} (0x0x14dc7bdaa740): Connection timed out [R2][E110]Fail to read from Socket{id=1511828490993 fd=17481 addr=x.x.x.218:8060:9012} (0x0x14dbfa2ef540):
Connection timed out [R3][E110]Fail to read from Socket{id=1357209681585 fd=929 addr=x.x.x.218:8060:9016} (0x0x14ca5b000740): Connection timed out
W1121 13:44:13.234254 16635 pipeline_driver.cpp:730] fragment_id ad41fad2-c69e-11f0-9c15-a0369fd8cef2 driver query_id=ad41fad2-c69e-11f0-9c15-a0369fd8ced8 fragment_id=ad41fad2-c69e-11f0-9c15-a0369fd8cef2 driver=driver_19_875, status=INPUT_EMPTY, operator-chain: [exchange_source_19_0x14c179ca6f
10(X) -> spillable_hash_join_probe_23_0x14c179c98410(X)(HashJoiner=0x14c4aa63ca10) -> project_24_0x14c179ca8310(X) -> spillable_hash_join_probe_95_0x14c179c98910(X)(HashJoiner=0x14c4aa63b210) -> project_96_0x14c179ca9710(X) -> spillable_hash_join_probe_188_0x14c179c98e10(X)(HashJoiner=0x14d0f6
db8210) -> project_189_0x14c179caab10(X) -> spillable_hash_join_probe_261_0x14c179c99310(X)(HashJoiner=0x14d72309c210) -> project_262_0x14c179cabf10(X) -> spillable_hash_join_probe_331_0x14c179c99810(X)(HashJoiner=0x14d84cf6b210) -> project_332_0x14c179cad310(O) -> spillable_hash_join_probe_39
7_0x14c179c99d10(X)(HashJoiner=0x14d84b98ea10) -> project_398_0x14c179cae710(O) -> olap_table_sink_-1_0x14c179caf110(X)] cancels operator olap_table_sink_-1_0x14c179caf110(X) with finished error [E110]Fail to read from Socket{id=1537598307202 fd=20568 addr=x.x.x.218:8060:6926} (0x0x14afcdb
a9440): Connection timed out [R1][E110]Fail to read from Socket{id=841813596804 fd=9963 addr=x.x.x.218:8060:9008} (0x0x14dc7bdaa740): Connection timed out [R2][E110]Fail to read from Socket{id=1511828490993 fd=17481 addr=x.x.x.218:8060:9012} (0x0x14dbfa2ef540): Connection timed out [R3
][E110]Fail to read from Socket{id=1357209681585 fd=929 addr=x.x.x.218:8060:9016} (0x0x14ca5b000740): Connection timed out
W1121 13:44:13.239461 16635 tablet_sink.cpp:1984] close channel failed. channel_name=NodeChannel[3397792], load_info=load_id=ad41fad2-c69e-11f0-9c15-a0369fd8ced8, txn_id: 72566406, parallel=1, compress_type=2, error_msg=[E110]Fail to read from Socket{id=2027224566469 fd=20419 addr=x.x.x.21
8:8060:6782} (0x0x14dbfa2e9240): Connection timed out [R1][E110]Fail to read from Socket{id=1692217130556 fd=15893 addr=x.x.x.218:8060:9006} (0x0x14afcfa5ac00): Connection timed out [R2][E110]Fail to read from Socket{id=1546188241794 fd=929 addr=x.x.x.218:8060:9010} (0x0x14afcdba9440):
Connection timed out [R3][E110]Fail to read from Socket{id=1262720407018 fd=384 addr=x.x.x.218:8060:9014} (0x0x14d1e5fcea00): Connection timed out
W1121 13:44:13.244699 16653 tablet_sink.cpp:1984] close channel failed. channel_name=NodeChannel[3397792], load_info=load_id=ad41fad2-c69e-11f0-9c15-a0369fd8ced8, txn_id: 72566406, parallel=1, compress_type=2, error_msg=[E110]Fail to read from Socket{id=2027224566469 fd=20419 addr=x.x.x.21
8:8060:6782} (0x0x14dbfa2e9240): Connection timed out [R1][E110]Fail to read from Socket{id=1692217130556 fd=15893 addr=x.x.x.218:8060:9006} (0x0x14afcfa5ac00): Connection timed out [R2][E110]Fail to read from Socket{id=1546188241794 fd=929 addr=x.x.x.218:8060:9010} (0x0x14afcdba9440):
Connection timed out [R3][E110]Fail to read from Socket{id=1262720407018 fd=384 addr=x.x.x.218:8060:9014} (0x0x14d1e5fcea00): Connection timed out
W1121 13:44:13.248042 16635 tablet_sink.cpp:1984] close channel failed. channel_name=NodeChannel[3397792], load_info=load_id=ad41fad2-c69e-11f0-9c15-a0369fd8ced8, txn_id: 72566406, parallel=1, compress_type=2, error_msg=[E110]Fail to read from Socket{id=1537598307202 fd=20568 addr=x.x.x.21
8:8060:6926} (0x0x14afcdba9440): Connection timed out [R1][E110]Fail to read from Socket{id=841813596804 fd=9963 addr=x.x.x.218:8060:9008} (0x0x14dc7bdaa740): Connection timed out [R2][E110]Fail to read from Socket{id=1511828490993 fd=17481 addr=x.x.x.218:8060:9012} (0x0x14dbfa2ef540):
Connection timed out [R3][E110]Fail to read from Socket{id=1357209681585 fd=929 addr=x.x.x.218:8060:9016} (0x0x14ca5b000740): Connection timed out
W1121 13:44:13.253327 16664 tablet_sink.cpp:1984] close channel failed. channel_name=NodeChannel[3397792], load_info=load_id=ad41fad2-c69e-11f0-9c15-a0369fd8ced8, txn_id: 72566406, parallel=1, compress_type=2, error_msg=[E110]Fail to read from Socket{id=1537598307202 fd=20568 addr=x.x.x.21
8:8060:6926} (0x0x14afcdba9440): Connection timed out [R1][E110]Fail to read from Socket{id=841813596804 fd=9963 addr=x.x.x.218:8060:9008} (0x0x14dc7bdaa740): Connection timed out [R2][E110]Fail to read from Socket{id=1511828490993 fd=17481 addr=x.x.x.218:8060:9012} (0x0x14dbfa2ef540):
Connection timed out [R3][E110]Fail to read from Socket{id=1357209681585 fd=929 addr=x.x.x.218:8060:9016} (0x0x14ca5b000740): Connection timed out
W1121 13:44:23.222164 17260 input_messenger.cpp:226] Fail to read from Socket{id=146028891016 fd=21398 addr=x.x.x.218:8060:10138} (0x14dca9bebc80): Connection timed out [110]
W1121 13:44:23.238196 17255 input_messenger.cpp:226] Fail to read from Socket{id=1133871385030 fd=21387 addr=x.x.x.218:8060:10124} (0x14dd738abd40): Connection timed out [110]
W1121 13:44:23.239144 17262 input_messenger.cpp:226] Fail to read from Socket{id=1546188247418 fd=21264 addr=x.x.x.218:8060:10052} (0x149f3d70a0c0): Connection timed out [110]
W1121 13:44:23.246235 17196 input_messenger.cpp:226] Fail to read from Socket{id=2430951491040 fd=20842 addr=x.x.x.218:8060:9702} (0x14dd145ea300): Connection timed out [110]
W1121 13:44:23.254235 17268 input_messenger.cpp:226] Fail to read from Socket{id=1795296357461 fd=21201 addr=x.x.x.218:8060:10010} (0x14ae01fdf540): Connection timed out [110]
W1121 13:44:25.934250 17255 input_messenger.cpp:226] Fail to read from Socket{id=317827599582 fd=21440 addr=x.x.x.218:8060:10174} (0x14c9a3c6e280): Connection timed out [110]
W1121 13:44:37.455154 17200 input_messenger.cpp:226] Fail to read from Socket{id=1305670083460 fd=20590 addr=x.x.x.218:8060:9466} (0x14b977ab8e40): Connection timed out [110]
W1121 13:44:37.463146 17255 input_messenger.cpp:226] Fail to read from Socket{id=2379411902777 fd=21029 addr=x.x.x.218:8060:9840} (0x149f3d700e80): Connection timed out [110]
W1121 13:44:37.574131 17275 input_messenger.cpp:226] Fail to read from Socket{id=644245114271 fd=21467 addr=x.x.x.218:8060:10196} (0x14c6609e7880): Connection timed out [110]
W1121 13:44:37.574133 17246 input_messenger.cpp:226] Fail to read from Socket{id=1382979484667 fd=20462 addr=x.x.x.218:8060:9346} (0x14bc6c199200): Connection timed out [110]
W1121 13:44:37.599143 17262 input_messenger.cpp:226] Fail to read from Socket{id=395137020004 fd=21098 addr=x.x.x.218:8060:9900} (0x14b63827cc00): Connection timed out [110]
W1121 13:44:37.623162 17199 input_messenger.cpp:226] Fail to read from Socket{id=1717986936848 fd=21288 addr=x.x.x.218:8060:10078} (0x14b09095bf80): Connection timed out [110]
W1121 13:44:37.630146 17250 input_messenger.cpp:226] Fail to read from Socket{id=1717986923995 fd=20515 addr=x.x.x.218:8060:9398} (0x14dd8c8a1440): Connection timed out [110]
W1121 13:44:38.174129 17270 input_messenger.cpp:226] Fail to read from Socket{id=2078764196826 fd=20704 addr=x.x.x.218:8060:9564} (0x14ab2f7c8400): Connection timed out [110]
W1121 13:44:44.014230 17279 input_messenger.cpp:226] Fail to read from Socket{id=2602750198013 fd=20591 addr=x.x.x.151:8060:13576} (0x14b3c660cbc0): Connection timed out [110]
W1121 13:46:00.142176 17262 input_messenger.cpp:226] Fail to read from Socket{id=2027224567870 fd=7689 addr=x.x.x.151:8060:15694} (0x14dd8e104c00): Connection timed out [110]
W1121 13:46:00.150123 17260 input_messenger.cpp:226] Fail to read from Socket{id=2645699861797 fd=20875 addr=x.x.x.151:8060:13836} (0x14dc7b004280): Connection timed out [110]
W1121 13:46:08.071174 17284 input_messenger.cpp:226] Fail to read from Socket{id=2516850840932 fd=20368 addr=x.x.x.218:8060:9266} (0x14dcba9f3dc0): Connection timed out [110]
W1121 13:46:08.079150 17277 input_messenger.cpp:226] Fail to read from Socket{id=1142461301750 fd=16470 addr=x.x.x.218:8060:11610} (0x14ddac73a000): Connection timed out [110]
W1121 13:46:08.110219 17197 input_messenger.cpp:226] Fail to read from Socket{id=2070174240478 fd=19813 addr=x.x.x.218:8060:11624} (0x14dca4280480): Connection timed out [110]
W1121 13:46:08.126163 17278 input_messenger.cpp:226] Fail to read from Socket{id=807453885125 fd=19854 addr=x.x.x.218:8060:11626} (0x14bd32630700): Connection timed out [110]
W1121 13:46:08.134171 17202 input_messenger.cpp:226] Fail to read from Socket{id=2001454780466 fd=19865 addr=x.x.x.218:8060:11632} (0x14d7b4191ac0): Connection timed out [110]
W1121 13:46:08.190167 17195 input_messenger.cpp:226] Fail to read from Socket{id=2095944065637 fd=19888 addr=x.x.x.218:8060:11648} (0x14be58cef340): Connection timed out [110]
W1121 13:46:08.790216 17276 input_messenger.cpp:226] Fail to read from Socket{id=987842485640 fd=12045 addr=x.x.x.218:8060:11656} (0x14dc7b012140): Connection timed out [110]
W1121 13:46:08.798141 17283 input_messenger.cpp:226] Fail to read from Socket{id=1700807052208 fd=14987 addr=x.x.x.218:8060:11658} (0x14dca9bf1680): Connection timed out [110]
W1121 13:46:09.510210 17281 input_messenger.cpp:226] Fail to read from Socket{id=1958505101275 fd=9032 addr=x.x.x.218:8060:11668} (0x14ad684ed140): Connection timed out [110]
W1121 13:46:09.518165 17278 input_messenger.cpp:226] Fail to read from Socket{id=2877628095001 fd=12045 addr=x.x.x.218:8060:11670} (0x14dc66c47580): Connection timed out [110]
W1121 13:46:13.950410 17262 input_messenger.cpp:226] Fail to read from Socket{id=1288490189667 fd=20731 addr=x.x.x.151:8060:13696} (0x14dcc9df8b40): Connection timed out [110]
W1121 13:46:24.294323 17260 input_messenger.cpp:226] Fail to read from Socket{id=2224793087390 fd=19854 addr=x.x.x.218:8060:11664} (0x14b7ee5a51c0): Connection timed out [110]
W1121 13:46:37.910400 17270 socket.cpp:1648] Fail to keep-write into Socket{id=798863939836 fd=20121 addr=x.x.x.218:8060:12168} (0x14d06d09d480): Connection timed out [110]
W1121 13:47:23.342270 17262 input_messenger.cpp:226] Fail to read from Socket{id=962072693074 fd=18489 addr=x.x.x.218:8060:12892} (0x14d235a4ac00): Connection timed out [110]
W1121 13:47:23.358295 17255 input_messenger.cpp:226] Fail to read from Socket{id=1176821060129 fd=20566 addr=x.x.x.218:8060:13310} (0x14d0a97eafc0): Connection timed out [110]
W1121 13:47:23.358299 17257 input_messenger.cpp:226] Fail to read from Socket{id=1786706427005 fd=7022 addr=x.x.x.218:8060:12602} (0x14bddeb96c40): Connection timed out [110]
W1121 13:47:24.063194 17275 input_messenger.cpp:226] Fail to read from Socket{id=2259152829731 fd=20473 addr=x.x.x.218:8060:13264} (0x14ac0cf3aa40): Connection timed out [110]

set global pipeline_dop=4 这个参数也加上了,感觉没有效果。日志中还是存在大量 Connection timed out 错误

感觉应该是网络框架或是网络的瓶颈了。改下这个试试, be.conf brpc_max_connections_per_server=4

最近两三天,出现新情况。3台be节点,轮流宕机,比较频繁。
be.out内容如下(3台be报错内容类似):

3.1.15 RELEASE (build 5625961)
query_id:00000000-0000-0000-0000-000000000000, fragment_instance:00000000-0000-0000-0000-000000000000
tracker:process consumption: 110951443864
tracker:query_pool consumption: 352440133
tracker:load consumption: 524159204
tracker:metadata consumption: 2146928089
tracker:tablet_metadata consumption: 101372955
tracker:rowset_metadata consumption: 118975606
tracker:segment_metadata consumption: 220503174
tracker:column_metadata consumption: 1706076354
tracker:tablet_schema consumption: 1489531
tracker:segment_zonemap consumption: 46318051
tracker:short_key_index consumption: 163422229
tracker:column_zonemap_index consumption: 201321554
tracker:ordinal_index consumption: 1366010264
tracker:bitmap_index consumption: 0
tracker:bloom_filter_index consumption: 0
tracker:compaction consumption: 0
tracker:schema_change consumption: 0
tracker:column_pool consumption: 4400534682
tracker:page_cache consumption: 87774398384
tracker:update consumption: 10930583206
tracker:chunk_allocator consumption: 2148140024
tracker:clone consumption: 0
tracker:consistency consumption: 0
tracker:datacache consumption: 0
tracker:replication consumption: 0
*** Aborted at 1763760994 (unix time) try “date -d @1763760994” if you are using GNU date ***
PC: @ 0x5906354 starrocks::RowsetWriter::build()
*** SIGSEGV (@0xb0) received by PID 18441 (TID 0x14c692dfe700) from PID 176; stack trace: ***
@ 0x67e5682 google::(anonymous namespace)::FailureSignalHandler()
@ 0x14c6a9069630 (unknown)
@ 0x5906354 starrocks::RowsetWriter::build()
@ 0x5910b05 starrocks::HorizontalRowsetWriter::build()
@ 0x52bc7ec starrocks::DeltaWriter::commit()
@ 0x50419a5 starrocks::SegmentFlushTask::run()
@ 0x2e90add starrocks::ThreadPool::dispatch_thread()
@ 0x2e89eda starrocks::thread::supervise_thread()
@ 0x14c6a9061ea5 start_thread
@ 0x14c6a8462b0d __clone
@ 0x0 (unknown)
start time: Sat Nov 22 07:10:41 CST 2025, server uptime: 07:10:41 up 2 days, 9:01, 1 user, load average: 0.04, 0.03, 0.05
3.1.15 RELEASE (build 5625961)
query_id:00000000-0000-0000-0000-000000000000, fragment_instance:00000000-0000-0000-0000-000000000000
tracker:process consumption: 106518130984
tracker:query_pool consumption: 1038786005
tracker:load consumption: 683160000
tracker:metadata consumption: 2003910930
tracker:tablet_metadata consumption: 101284096
tracker:rowset_metadata consumption: 124211519
tracker:segment_metadata consumption: 218517531
tracker:column_metadata consumption: 1559897784
tracker:tablet_schema consumption: 1452768
tracker:segment_zonemap consumption: 39864028
tracker:short_key_index consumption: 169197762
tracker:column_zonemap_index consumption: 110359608
tracker:ordinal_index consumption: 1330689832
tracker:bitmap_index consumption: 0
tracker:bloom_filter_index consumption: 0
tracker:compaction consumption: 0
tracker:schema_change consumption: 0
tracker:column_pool consumption: 4621706848
tracker:page_cache consumption: 87782607552
tracker:update consumption: 6791024316
tracker:chunk_allocator consumption: 2147938504
tracker:clone consumption: 0
tracker:consistency consumption: 0
tracker:datacache consumption: 0
tracker:replication consumption: 0
*** Aborted at 1763840089 (unix time) try “date -d @1763840089” if you are using GNU date ***
PC: @ 0x5906354 starrocks::RowsetWriter::build()
*** SIGSEGV (@0xb0) received by PID 2947 (TID 0x14a50237e700) from PID 176; stack trace: ***
@ 0x67e5682 google::(anonymous namespace)::FailureSignalHandler()
@ 0x14a51a178630 (unknown)
@ 0x5906354 starrocks::RowsetWriter::build()
@ 0x5910b05 starrocks::HorizontalRowsetWriter::build()
@ 0x52bc7ec starrocks::DeltaWriter::commit()
@ 0x50419a5 starrocks::SegmentFlushTask::run()
@ 0x2e90add starrocks::ThreadPool::dispatch_thread()
@ 0x2e89eda starrocks::thread::supervise_thread()
@ 0x14a51a170ea5 start_thread
@ 0x14a519571b0d __clone
@ 0x0 (unknown)
start time: Sun Nov 23 07:36:52 CST 2025, server uptime: 07:36:52 up 3 days, 9:27, 1 user, load average: 0.38, 0.19, 0.12
3.1.15 RELEASE (build 5625961)
query_id:00000000-0000-0000-0000-000000000000, fragment_instance:00000000-0000-0000-0000-000000000000
tracker:process consumption: 177274511632
tracker:query_pool consumption: 51198282069
tracker:load consumption: 4873032
tracker:metadata consumption: 1207189201
tracker:tablet_metadata consumption: 102914004
tracker:rowset_metadata consumption: 159412825
tracker:segment_metadata consumption: 99778820
tracker:column_metadata consumption: 845083552
tracker:tablet_schema consumption: 1454676
tracker:segment_zonemap consumption: 13497748
tracker:short_key_index consumption: 81882989
tracker:column_zonemap_index consumption: 32946736
tracker:ordinal_index consumption: 775483840
tracker:bitmap_index consumption: 0
tracker:bloom_filter_index consumption: 0
tracker:compaction consumption: 0
tracker:schema_change consumption: 0
tracker:column_pool consumption: 2915275092
tracker:page_cache consumption: 87772776208
tracker:update consumption: 28453176479
tracker:chunk_allocator consumption: 851685216
tracker:clone consumption: 0
tracker:consistency consumption: 0
tracker:datacache consumption: 0
tracker:replication consumption: 0
*** Aborted at 1763858734 (unix time) try “date -d @1763858734” if you are using GNU date ***
PC: @ 0x5906354 starrocks::RowsetWriter::build()
*** SIGSEGV (@0xb0) received by PID 8236 (TID 0x14b0839fe700) from PID 176; stack trace: ***
@ 0x67e5682 google::(anonymous namespace)::FailureSignalHandler()
@ 0x14b09ab44630 (unknown)
@ 0x5906354 starrocks::RowsetWriter::build()
@ 0x5910b05 starrocks::HorizontalRowsetWriter::build()
@ 0x52bc7ec starrocks::DeltaWriter::commit()
@ 0x50419a5 starrocks::SegmentFlushTask::run()
@ 0x2e90add starrocks::ThreadPool::dispatch_thread()
@ 0x2e89eda starrocks::thread::supervise_thread()
@ 0x14b09ab3cea5 start_thread
@ 0x14b099f3db0d __clone
@ 0x0 (unknown)
start time: Sun Nov 23 09:50:14 CST 2025, server uptime: 09:50:14 up 3 days, 11:40, 1 user, load average: 0.00, 0.17, 0.47
3.1.15 RELEASE (build 5625961)
query_id:00000000-0000-0000-0000-000000000000, fragment_instance:00000000-0000-0000-0000-000000000000
tracker:process consumption: 110900331376
tracker:query_pool consumption: 1780518804
tracker:load consumption: 1019201956
tracker:metadata consumption: 1927151639
tracker:tablet_metadata consumption: 101224468
tracker:rowset_metadata consumption: 115132931
tracker:segment_metadata consumption: 191845698
tracker:column_metadata consumption: 1518948542
tracker:tablet_schema consumption: 1451156
tracker:segment_zonemap consumption: 36158511
tracker:short_key_index consumption: 146572870
tracker:column_zonemap_index consumption: 112475374
tracker:ordinal_index consumption: 1302177000
tracker:bitmap_index consumption: 0
tracker:bloom_filter_index consumption: 0
tracker:compaction consumption: 0
tracker:schema_change consumption: 0
tracker:column_pool consumption: 4421361545
tracker:page_cache consumption: 87776171024
tracker:update consumption: 10412731344
tracker:chunk_allocator consumption: 2148037136
tracker:clone consumption: 0
tracker:consistency consumption: 0
tracker:datacache consumption: 0
tracker:replication consumption: 0
*** Aborted at 1763925499 (unix time) try “date -d @1763925499” if you are using GNU date ***
PC: @ 0x5906354 starrocks::RowsetWriter::build()
*** SIGSEGV (@0xb0) received by PID 19243 (TID 0x1532ffdff700) from PID 176; stack trace: ***
@ 0x67e5682 google::(anonymous namespace)::FailureSignalHandler()
@ 0x153317f1a630 (unknown)
@ 0x5906354 starrocks::RowsetWriter::build()
@ 0x5910b05 starrocks::HorizontalRowsetWriter::build()
@ 0x52bc7ec starrocks::DeltaWriter::commit()
@ 0x50419a5 starrocks::SegmentFlushTask::run()
@ 0x2e90add starrocks::ThreadPool::dispatch_thread()
@ 0x2e89eda starrocks::thread::supervise_thread()
@ 0x153317f12ea5 start_thread
@ 0x153317313b0d __clone
@ 0x0 (unknown)
start time: Mon Nov 24 03:20:02 CST 2025, server uptime: 03:20:02 up 4 days, 5:10, 0 users, load average: 4.31, 6.22, 8.35
3.1.15 RELEASE (build 5625961)
query_id:00000000-0000-0000-0000-000000000000, fragment_instance:00000000-0000-0000-0000-000000000000
tracker:process consumption: 102853695800
tracker:query_pool consumption: 65600
tracker:load consumption: 379056
tracker:metadata consumption: 1135406708
tracker:tablet_metadata consumption: 101671094
tracker:rowset_metadata consumption: 125125388
tracker:segment_metadata consumption: 97546816
tracker:column_metadata consumption: 811063410
tracker:tablet_schema consumption: 1450230
tracker:segment_zonemap consumption: 6591270
tracker:short_key_index consumption: 88413108
tracker:column_zonemap_index consumption: 15569370
tracker:ordinal_index consumption: 771785840
tracker:bitmap_index consumption: 0
tracker:bloom_filter_index consumption: 0
tracker:compaction consumption: 0
tracker:schema_change consumption: 0
tracker:column_pool consumption: 2665561611
tracker:page_cache consumption: 87772347072
tracker:update consumption: 6945720215
tracker:chunk_allocator consumption: 2147632832
tracker:clone consumption: 0
tracker:consistency consumption: 0
tracker:datacache consumption: 0
tracker:replication consumption: 0
*** Aborted at 1763927658 (unix time) try “date -d @1763927658” if you are using GNU date ***
PC: @ 0x5906354 starrocks::RowsetWriter::build()
*** SIGSEGV (@0xb0) received by PID 11265 (TID 0x14d91b7fe700) from PID 176; stack trace: ***
@ 0x67e5682 google::(anonymous namespace)::FailureSignalHandler()
@ 0x14d933ed3630 (unknown)
@ 0x5906354 starrocks::RowsetWriter::build()
@ 0x5910b05 starrocks::HorizontalRowsetWriter::build()
@ 0x52bc7ec starrocks::DeltaWriter::commit()
@ 0x50419a5 starrocks::SegmentFlushTask::run()
@ 0x2e90add starrocks::ThreadPool::dispatch_thread()
@ 0x2e89eda starrocks::thread::supervise_thread()
@ 0x14d933ecbea5 start_thread
@ 0x14d9332ccb0d __clone
@ 0x0 (unknown)
start time: Mon Nov 24 03:55:01 CST 2025, server uptime: 03:55:01 up 4 days, 5:45, 0 users, load average: 2.65, 6.05, 6.92
3.1.15 RELEASE (build 5625961)
query_id:00000000-0000-0000-0000-000000000000, fragment_instance:00000000-0000-0000-0000-000000000000
tracker:process consumption: 104617834560
tracker:query_pool consumption: 973246175
tracker:load consumption: 527961162
tracker:metadata consumption: 1555470151
tracker:tablet_metadata consumption: 101112964
tracker:rowset_metadata consumption: 108697525
tracker:segment_metadata consumption: 139116962
tracker:column_metadata consumption: 1206542700
tracker:tablet_schema consumption: 1452132
tracker:segment_zonemap consumption: 9199454
tracker:short_key_index consumption: 126222384
tracker:column_zonemap_index consumption: 27319052
tracker:ordinal_index consumption: 1146274000
tracker:bitmap_index consumption: 0
tracker:bloom_filter_index consumption: 0
tracker:compaction consumption: 0
tracker:schema_change consumption: 0
tracker:column_pool consumption: 2837922089
tracker:page_cache consumption: 87772708960
tracker:update consumption: 7449995885
tracker:chunk_allocator consumption: 2147713160
tracker:clone consumption: 0
tracker:consistency consumption: 0
tracker:datacache consumption: 0
tracker:replication consumption: 0
*** Aborted at 1763932458 (unix time) try “date -d @1763932458” if you are using GNU date ***
PC: @ 0x5906354 starrocks::RowsetWriter::build()
*** SIGSEGV (@0xb0) received by PID 31717 (TID 0x148759bfe700) from PID 176; stack trace: ***
@ 0x67e5682 google::(anonymous namespace)::FailureSignalHandler()
@ 0x14877194d630 (unknown)
@ 0x5906354 starrocks::RowsetWriter::build()
@ 0x5910b05 starrocks::HorizontalRowsetWriter::build()
@ 0x52bc7ec starrocks::DeltaWriter::commit()
@ 0x50419a5 starrocks::SegmentFlushTask::run()
@ 0x2e90add starrocks::ThreadPool::dispatch_thread()
@ 0x2e89eda starrocks::thread::supervise_thread()
@ 0x148771945ea5 start_thread
@ 0x148770d46b0d __clone
@ 0x0 (unknown)
start time: Mon Nov 24 05:15:01 CST 2025, server uptime: 05:15:01 up 4 days, 7:05, 0 users, load average: 1.10, 6.02, 8.33
3.1.15 RELEASE (build 5625961)
query_id:00000000-0000-0000-0000-000000000000, fragment_instance:00000000-0000-0000-0000-000000000000
tracker:process consumption: 99271910720
tracker:query_pool consumption: 568556208
tracker:load consumption: 275839020
tracker:metadata consumption: 652445505
tracker:tablet_metadata consumption: 100959146
tracker:rowset_metadata consumption: 108322282
tracker:segment_metadata consumption: 47347637
tracker:column_metadata consumption: 395816440
tracker:tablet_schema consumption: 1448682
tracker:segment_zonemap consumption: 1685436
tracker:short_key_index consumption: 44624136
tracker:column_zonemap_index consumption: 2934376
tracker:ordinal_index consumption: 383819520
tracker:bitmap_index consumption: 0
tracker:bloom_filter_index consumption: 0
tracker:compaction consumption: 0
tracker:schema_change consumption: 0
tracker:column_pool consumption: 1866177950
tracker:page_cache consumption: 87776769632
tracker:update consumption: 6149587653
tracker:chunk_allocator consumption: 579433904
tracker:clone consumption: 0
tracker:consistency consumption: 0
tracker:datacache consumption: 0
tracker:replication consumption: 0
*** Aborted at 1763933058 (unix time) try “date -d @1763933058” if you are using GNU date ***
PC: @ 0x5906354 starrocks::RowsetWriter::build()
*** SIGSEGV (@0xb0) received by PID 29184 (TID 0x1486b61fd700) from PID 176; stack trace: ***
@ 0x67e5682 google::(anonymous namespace)::FailureSignalHandler()
@ 0x1486cf149630 (unknown)
@ 0x5906354 starrocks::RowsetWriter::build()
@ 0x5910b05 starrocks::HorizontalRowsetWriter::build()
@ 0x52bc7ec starrocks::DeltaWriter::commit()
@ 0x50419a5 starrocks::SegmentFlushTask::run()
@ 0x2e90add starrocks::ThreadPool::dispatch_thread()
@ 0x2e89eda starrocks::thread::supervise_thread()
@ 0x1486cf141ea5 start_thread
@ 0x1486ce542b0d __clone
@ 0x0 (unknown)
start time: Mon Nov 24 05:25:01 CST 2025, server uptime: 05:25:01 up 4 days, 7:15, 0 users, load average: 0.68, 2.23, 5.27

这个问题已经Fix了,https://github.com/StarRocks/starrocks/pull/64360 可以基于3.1打上这个Patch, 或是升级到3.3.20