flink写入表报错Cancelled because of runtime state is cancelled

报错信息
Caused by: com.starrocks.data.load.stream.exception.StreamLoadFailException: Stream load failed because of error, db: nedw_ods, table: t_ods_data, label: flink-df70c65e-189f-4b54-9c1a-d1554394d72c,
responseBody: {
“Status”: “CANCELLED”,
“Message”: “Cancelled because of runtime state is cancelled”
}

这种的话需要拿label和txn id去fe leader节点和be.INFO日志搜下

FE日志:

cn节点没看到这个日志

搜下63149,存算分离集群吗

是的存算分离的,

FE和CN都搜索了,没看到63149

W0305 03:29:48.501678 574 stream_load_executor.cpp:111] fragment execute failed, query_id=8648381633ad711e-b743dc0f865446bb, err_msg=Cancelled because of runtime state is cancelled, id=8648381633ad711e-b743dc0f865446bb, job_id=-1, txn_id: 63149, label=flink-df70c65e-189f-4b54-9c1a-d1554394d72c, db=nedw_ods
W0305 03:29:48.506722 837 transaction_stream_load.cpp:542] append body content failed. errmsg=Cancelled: Cancelled because of runtime state is cancelled
/build/starrocks/be/src/exec/connector_scan_node.cpp:191 _data_source->get_next(state, chunk)
/build/starrocks/be/src/runtime/plan_fragment_executor.cpp:331 _plan->get_next(_runtime_state, &_chunk, &_done)
/build/starrocks/be/src/runtime/plan_fragment_executor.cpp:231 _get_next_internal_vectorized(&chunk) context=id=8648381633ad711e-b743dc0f865446bb, job_id=-1, txn_id: 63149, label=flink-df70c65e-189f-4b54-9c1a-d1554394d72c, db=nedw_ods
W0305 03:29:48.506959 837 transaction_stream_load.cpp:542] append body content failed. errmsg=Cancelled: Cancelled because of runtime state is cancelled
/build/starrocks/be/src/exec/connector_scan_node.cpp:191 _data_source->get_next(state, chunk)
/build/starrocks/be/src/runtime/plan_fragment_executor.cpp:331 _plan->get_next(_runtime_state, &_chunk, &_done)
/build/starrocks/be/src/runtime/plan_fragment_executor.cpp:231 _get_next_internal_vectorized(&chunk) context=id=8648381633ad711e-b743dc0f865446bb, job_id=-1, txn_id: 63149, label=flink-df70c65e-189f-4b54-9c1a-d1554394d72c, db=nedw_ods
W0305 03:29:48.507161 837 transaction_stream_load.cpp:542] append body content failed. errmsg=Cancelled: Cancelled because of runtime state is cancelled
/build/starrocks/be/src/exec/connector_scan_node.cpp:191 _data_source->get_next(state, chunk)
/build/starrocks/be/src/runtime/plan_fragment_executor.cpp:331 _plan->get_next(_runtime_state, &_chunk, &_done)
/build/starrocks/be/src/runtime/plan_fragment_executor.cpp:231 _get_next_internal_vectorized(&chunk) context=id=8648381633ad711e-b743dc0f865446bb, job_id=-1, txn_id: 63149, label=flink-df70c65e-189f-4b54-9c1a-d1554394d72c, db=nedw_ods
W0305 03:29:48.507385 837 transaction_stream_load.cpp:542] append body content failed. errmsg=Cancelled: Cancelled because of runtime state is cancelled
/build/starrocks/be/src/exec/connector_scan_node.cpp:191 _data_source->get_next(state, chunk)
/build/starrocks/be/src/runtime/plan_fragment_executor.cpp:331 _plan->get_next(_runtime_state, &_chunk, &_done)
/build/starrocks/be/src/runtime/plan_fragment_executor.cpp:231 _get_next_internal_vectorized(&chunk) context=id=8648381633ad711e-b743dc0f865446bb, job_id=-1, txn_id: 63149, label=flink-df70c65e-189f-4b54-9c1a-d1554394d72c, db=nedw_ods
W0305 03:29:48.507593 837 transaction_stream_load.cpp:542] append body content failed. errmsg=Cancelled: Cancelled because of runtime state is cancelled
/build/starrocks/be/src/exec/connector_scan_node.cpp:191 _data_source->get_next(state, chunk)
/build/starrocks/be/src/runtime/plan_fragment_executor.cpp:331 _plan->get_next(_runtime_state, &_chunk, &_done)

@Liang 遇到了同样的问题,请教下这是什么情况呢,救救孩子吧。
存算分离集群,存储使用ceph s3。 异常发生在flink大流量导入过程中,30000 rps,数据字段用了json,内容有大有小。 有什么方式或者配置能优化这种情况吗
------

报错日志

I0409 08:20:43.819255 894 stream_load.cpp:240] new income streaming load request.id=29423e101d87c0f6-be9b99da830b83ac, job_id=-1, txn_id: -1, label=262fd360-061c-4142-b716-b2487926dced, db=test_link_restore, db=test_link_restore, tbl=hit_risk_detail
I0409 08:20:43.821085 894 stream_load_executor.cpp:67] begin to execute job. label=262fd360-061c-4142-b716-b2487926dced, txn_id: 6809, query_id=29423e10-1d87-c0f6-be9b-99da830b83ac
I0409 08:20:43.821121 894 plan_fragment_executor.cpp:83] Prepare(): query_id=29423e10-1d87-c0f6-be9b-99da830b83ac fragment_instance_id=29423e10-1d87-c0f6-be9b-99da830b83ad backend_num=0
I0409 08:20:43.821528 277 plan_fragment_executor.cpp:185] Open(): fragment_instance_id=29423e10-1d87-c0f6-be9b-99da830b83ad
I0409 08:20:43.821907 894 stream_load.cpp:240] new income streaming load request.id=3944f600316f59b0-8d89ba5ba218f589, job_id=-1, txn_id: -1, label=9f9cddda-c21f-4b3e-b594-00500a8113be, db=test_link_restore, db=test_link_restore, tbl=hit_risk_detail
I0409 08:20:43.823562 894 stream_load_executor.cpp:67] begin to execute job. label=9f9cddda-c21f-4b3e-b594-00500a8113be, txn_id: 6810, query_id=3944f600-316f-59b0-8d89-ba5ba218f589
I0409 08:20:43.823588 894 plan_fragment_executor.cpp:83] Prepare(): query_id=3944f600-316f-59b0-8d89-ba5ba218f589 fragment_instance_id=3944f600-316f-59b0-8d89-ba5ba218f58a backend_num=0
I0409 08:20:43.823911 283 plan_fragment_executor.cpp:185] Open(): fragment_instance_id=3944f600-316f-59b0-8d89-ba5ba218f58a
I0409 08:20:44.216621 894 stream_load.cpp:240] new income streaming load request.id=ed40f85a685198b8-77b73f71b005a393, job_id=-1, txn_id: -1, label=883164b7-809b-4039-82b3-5238d3cf60e9, db=test_link_restore, db=test_link_restore, tbl=hit_risk_detail
I0409 08:20:44.218667 894 stream_load_executor.cpp:67] begin to execute job. label=883164b7-809b-4039-82b3-5238d3cf60e9, txn_id: 6815, query_id=ed40f85a-6851-98b8-77b7-3f71b005a393
I0409 08:20:44.218703 894 plan_fragment_executor.cpp:83] Prepare(): query_id=ed40f85a-6851-98b8-77b7-3f71b005a393 fragment_instance_id=ed40f85a-6851-98b8-77b7-3f71b005a394 backend_num=0
I0409 08:20:44.219130 280 plan_fragment_executor.cpp:185] Open(): fragment_instance_id=ed40f85a-6851-98b8-77b7-3f71b005a394
DataSource:
- DataSourceType: FileDataSource
- FileScanner: 0
- CastChunkTime: 0.000ns
- CreateChunkTime: 0.000ns
- FileReadTime: 0.000ns
- FillTime: 0.000ns
- MaterializeTime: 0.000ns
- ReadTime: 0.000ns
- ScannerTotalTime: 0.000ns
W0409 08:22:53.916213 277 stream_load_executor.cpp:102] fragment execute failed, query_id=29423e101d87c0f6-be9b99da830b83ac, err_msg=Cancelled because of runtime state is cancelled, id=29423e101d87c0f6-be9b99da830b83ac, job_id=-1, txn_id: 6809, label=262fd360-061c-4142-b716-b2487926dced, db=test_link_restore
W0409 08:22:53.916250 894 stream_load.cpp:373] append body content failed. errmsg=Cancelled: Cancelled because of runtime state is cancelled
/build/starrocks/be/src/exec/connector_scan_node.cpp:66 _data_source->get_next(state, chunk)
/build/starrocks/be/src/runtime/plan_fragment_executor.cpp:322 _plan->get_next(_runtime_state, &_chunk, &_done)
/build/starrocks/be/src/runtime/plan_fragment_executor.cpp:228 _get_next_internal_vectorized(&chunk) context=id=29423e101d87c0f6-be9b99da830b83ac, job_id=-1, txn_id: 6809, label=262fd360-061c-4142-b716-b2487926dced, db=test_link_restore
I0409 08:22:53.927573 786 lake_service.cpp:327] Aborting transactions=[6809] tablets=[]