背景:routine load或flink sql有时候偶发出现以下报错
发生作业失败 2023-06-10 07:09:16 实例 71
作业从 RUNNING 状态变为 RESTARTING 状态,异常消息:com.starrocks.data.load.stream.exception.StreamLoadFailException: { “TxnId”: -1, “Label”: “40b89596-6f6c-45fb-8a7a-a8d1183f7265”, “Status”: “Fail”, “Message”: “Couldn’t open transport for 10.128.8.75:9020 (open() timed out)”, “NumberTotalRows”: 0, “NumberLoadedRows”: 0, "…发生作业失败 2023-06-10 02:05:41 实例 71
作业从 RUNNING 状态变为 RESTARTING 状态,异常消息:com.starrocks.data.load.stream.exception.StreamLoadFailException: { “TxnId”: 16295231, “Label”: “296cd17b-83df-49d9-b5f2-4ec564060a0a”, “Status”: “Fail”, “Message”: “[E1008]Reached timeout=60000ms @10.128.8.96:8060”, “NumberTotalRows”: 0, “NumberLoadedRows”: 0, "NumberFil…发生作业失败 2023-06-10 00:24:30 实例 112
作业从 RUNNING 状态变为 RESTARTING 状态,异常消息:com.starrocks.data.load.stream.exception.StreamLoadFailException: { “TxnId”: 16238147, “Label”: “b6c7c906-7b94-4401-9d93-4fbd2b5925be”, “Status”: “Fail”, “Message”: "Commit transaction fail cause call frontend service failed, address=TNetworkAddress(hostname=10.128.8.75, port=9020), …收起
经过多方面的判断发现这个极有可能跟CBO的自动采集引发有关
ADMIN SET FRONTEND CONFIG (“enable_collect_full_statistic” = “false”)
关闭发现有效规避