Caused by: com.starrocks.data.load.stream.exception.StreamLoadFailException: Stream load failed because of unknown exception, db: xxxx, table:xxxx, label: xxxxx

为了更快的定位您的问题,请提供以下信息,谢谢
【详述】Flink任务使用Stream Load的方式向SR写数据,30s一批数据,最近频繁遇到报错
【背景】之前FE集群有问题,重启了3个FE角色
【业务影响】任务失败,导致频繁告警
【StarRocks版本】例如:2.5.3
【集群规模】例如:3fe(1 Leader+2Follower)+6be(fe与be分开部署)
【机器信息】CPU虚拟核/内存/网卡,例如:32C/128G/万兆
【联系方式】社区群16-一叶
【附件】

  • fe.log
    2023-09-09 12:14:00,010 INFO (starrocks-mysql-nio-pool-7987|65281) [QeProcessorImpl.unregisterQuery():91] deregister query id 4d75ad1c-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,013 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.unregisterQuery():91] deregister query id 4d75860b-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,014 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.registerQuery():81] register query id = 4d76706d-4ec7-11ee-9dfb-0632b3434595, job: -1
    2023-09-09 12:14:00,015 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.unregisterQuery():91] deregister query id 4d76706d-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,018 INFO (PUBLISH_VERSION|21) [DatabaseTransactionMgr.finishTransaction():1036] finish transaction TransactionState. txn_id: 77769367, label: audit_20230909_121359_172_26_72_128_9010, db id: 2290760, table id list: 2290763, callback id: -1, coordinator: BE: xxxx transaction status: VISIBLE, error replicas num: 0, replica ids: , prepare time: 1694232839986, commit time: 1694232840002, finish time: 1694232840015, write cost: 16ms, wait for publish cost: 3ms, publish rpc cost: 3ms, finish txn cost: 7ms, publish total cost: 13ms, total cost: 29ms, reason: attachment: com.starrocks.load.loadv2.ManualLoadTxnCommitAttachment@4b062041 successfully
    2023-09-09 12:14:00,044 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.registerQuery():81] register query id = 4d7b2b5e-4ec7-11ee-9dfb-0632b3434595, job: -1
    2023-09-09 12:14:00,047 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.unregisterQuery():91] deregister query id 4d7b2b5e-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,069 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.registerQuery():81] register query id = 4d7efbef-4ec7-11ee-9dfb-0632b3434595, job: -1
    2023-09-09 12:14:00,072 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.unregisterQuery():91] deregister query id 4d7efbef-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,087 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.registerQuery():81] register query id = 4d81bb10-4ec7-11ee-9dfb-0632b3434595, job: -1
    2023-09-09 12:14:00,092 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.unregisterQuery():91] deregister query id 4d81bb10-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,279 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.registerQuery():81] register query id = 4d9ee001-4ec7-11ee-9dfb-0632b3434595, job: -1
    2023-09-09 12:14:00,279 INFO (starrocks-mysql-nio-pool-7987|65281) [QeProcessorImpl.registerQuery():81] register query id = 4d9ee002-4ec7-11ee-9dfb-0632b3434595, job: -1
    2023-09-09 12:14:00,282 INFO (starrocks-mysql-nio-pool-7987|65281) [QeProcessorImpl.unregisterQuery():91] deregister query id 4d9ee002-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,282 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.unregisterQuery():91] deregister query id 4d9ee001-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,282 INFO (thrift-server-pool-18952|65427) [QeProcessorImpl.reportExecStatus():130] ReportExecStatus() failed, query does not exist, fragment_instance_id=4d9ee001-4ec7-11ee-9dfb-0632b343459c, query_id=4d9ee001-4ec7-11ee-9dfb-0632b3434595,
    2023-09-09 12:14:00,289 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.registerQuery():81] register query id = 4da08db3-4ec7-11ee-9dfb-0632b3434595, job: -1
    2023-09-09 12:14:00,290 INFO (starrocks-mysql-nio-pool-7987|65281) [QeProcessorImpl.registerQuery():81] register query id = 4da0b4c4-4ec7-11ee-9dfb-0632b3434595, job: -1
    2023-09-09 12:14:00,292 INFO (nioEventLoopGroup-6-6|282) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/rollback
    2023-09-09 12:14:00,292 INFO (nioEventLoopGroup-6-6|282) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: rollback, label: 5d09b2a0-eaf0-4637-b7c7-5d79099a895c
    2023-09-09 12:14:00,292 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.unregisterQuery():91] deregister query id 4da08db3-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,293 INFO (starrocks-mysql-nio-pool-7987|65281) [QeProcessorImpl.unregisterQuery():91] deregister query id 4da0b4c4-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,293 INFO (thrift-server-pool-18943|65418) [FrontendServiceImpl.loadTxnRollback():1134] receive txn rollback request. db: kc_datawind, tbl: , txn_id: 77768986, reason: transaction is aborted by user., backend: xxxx2023-09-09 12:14:00,296 INFO (thrift-server-pool-18943|65418) [DatabaseTransactionMgr.abortTransaction():1263] transaction:[TransactionState. txn_id: 77768986, label: 5d09b2a0-eaf0-4637-b7c7-5d79099a895c, db id: 918719, table id list: 12447363, callback id: -1, coordinator: BE: xxxx transaction status: ABORTED, error replicas num: 0, replica ids: , prepare time: 1694232777924, commit time: -1, finish time: 1694232840293, total cost: 62369ms, reason: transaction is aborted by user. attachment: com.starrocks.load.loadv2.ManualLoadTxnCommitAttachment@331d6850] successfully rollback
    2023-09-09 12:14:00,296 INFO (thrift-server-pool-18967|65444) [FrontendServiceImpl.loadTxnRollback():1134] receive txn rollback request. db: kc_datawind, tbl: , txn_id: 77768986, reason: transaction is aborted by user., backend: xxxx2023-09-09 12:14:00,296 WARN (thrift-server-pool-18967|65444) [FrontendServiceImpl.loadTxnRollback():1152] failed to rollback txn 77768986: transaction not found
    2023-09-09 12:14:00,302 INFO (nioEventLoopGroup-6-7|283) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/rollback
    2023-09-09 12:14:00,302 INFO (nioEventLoopGroup-6-7|283) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: rollback, label: 5d09b2a0-eaf0-4637-b7c7-5d79099a895c
    2023-09-09 12:14:00,307 INFO (nioEventLoopGroup-6-8|284) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/rollback
    2023-09-09 12:14:00,307 INFO (nioEventLoopGroup-6-8|284) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: rollback, label: 5d09b2a0-eaf0-4637-b7c7-5d79099a895c
    2023-09-09 12:14:00,426 INFO (nioEventLoopGroup-6-9|285) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/begin
    2023-09-09 12:14:00,426 INFO (nioEventLoopGroup-6-9|285) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx, port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: begin, label: 2b9fe219-3ac1-4cd5-a753-8bb053eac561
    2023-09-09 12:14:00,433 INFO (thrift-server-pool-18834|65309) [FrontendServiceImpl.loadTxnBegin():882] receive txn begin request, db: kc_datawind, tbl: ads_risk_contract_common_board_si, label: 2b9fe219-3ac1-4cd5-a753-8bb053eac561, backend: xxxx
    2023-09-09 12:14:00,433 INFO (thrift-server-pool-18834|65309) [DatabaseTransactionMgr.beginTransaction():308] begin transaction: txn_id: 77769368 with label 2b9fe219-3ac1-4cd5-a753-8bb053eac561 from coordinator BE: xxxx, listner id: -1
    2023-09-09 12:14:00,434 INFO (nioEventLoopGroup-6-10|286) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/begin
    2023-09-09 12:14:00,434 INFO (nioEventLoopGroup-6-10|286) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: begin, label: b2728e83-5819-4e72-86a5-49af3e9c34ec
    2023-09-09 12:14:00,437 INFO (thrift-server-pool-18843|65318) [FrontendServiceImpl.loadTxnBegin():882] receive txn begin request, db: kc_datawind, tbl: ads_risk_contract_common_board_si, label: b2728e83-5819-4e72-86a5-49af3e9c34ec, backend: xxxx2023-09-09 12:14:00,437 INFO (thrift-server-pool-18843|65318) [DatabaseTransactionMgr.beginTransaction():308] begin transaction: txn_id: 77769369 with label b2728e83-5819-4e72-86a5-49af3e9c34ec from coordinator BE: xxxx listner id: -1
    2023-09-09 12:14:00,439 INFO (nioEventLoopGroup-6-12|288) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/begin
    2023-09-09 12:14:00,439 INFO (nioEventLoopGroup-6-11|287) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/begin
    2023-09-09 12:14:00,439 INFO (nioEventLoopGroup-6-12|288) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: begin, label: 1d1321a2-49ad-4396-9bf3-cb4261e8896a
    2023-09-09 12:14:00,510 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.registerQuery():81] register query id = 4dc21f87-4ec7-11ee-9dfb-0632b3434595, job: -1
    2023-09-09 12:14:00,510 INFO (thrift-server-pool-18962|65437) [FrontendServiceImpl.streamLoadPut():1195] receive stream load put request. db:kc_datawind, tbl: ads_risk_contract_common_board_si, txn_id: 77769372, load id: b94a86f6-65b4-b0c2-30ae-8c980df03cb9, backend: xxxx5
    2023-09-09 12:14:00,510 INFO (thrift-server-pool-18962|65437) [Load.initColumns():315] load __op column expr: null
    2023-09-09 12:14:00,511 INFO (thrift-server-pool-18962|65437) [StreamLoadPlanner.plan():233] load job id: b94a86f6-65b4-b0c2-30ae-8c980df03cb9 tx id 77769372 parallel 0 compress NO_COMPRESSION replicated false quorum MAJORITY
    2023-09-09 12:14:00,512 INFO (starrocks-mysql-nio-pool-7984|65238) [QeProcessorImpl.registerQuery():81] register query id = 4dc26da8-4ec7-11ee-9dfb-0632b3434595, job: -1
    2023-09-09 12:14:00,512 INFO (nioEventLoopGroup-6-15|291) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/load
    2023-09-09 12:14:00,512 INFO (nioEventLoopGroup-6-15|291) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx2, port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: load, label: 636e8692-fedf-476b-a305-ceb043dd87fd
    2023-09-09 12:14:00,512 INFO (starrocks-mysql-nio-pool-7987|65281) [QeProcessorImpl.unregisterQuery():91] deregister query id 4dc1f876-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,514 INFO (starrocks-mysql-nio-pool-7990|65286) [QeProcessorImpl.unregisterQuery():91] deregister query id 4dc21f87-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,514 INFO (starrocks-mysql-nio-pool-7984|65238) [QeProcessorImpl.unregisterQuery():91] deregister query id 4dc26da8-4ec7-11ee-9dfb-0632b3434595
    2023-09-09 12:14:00,517 INFO (nioEventLoopGroup-6-1|277) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/begin
    2023-09-09 12:14:00,517 INFO (nioEventLoopGroup-6-1|277) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx, port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: begin, label: 5f09786e-c7eb-42d5-8852-5e01fb6441fd
    2023-09-09 12:14:00,519 INFO (thrift-server-pool-18910|65385) [FrontendServiceImpl.loadTxnBegin():882] receive txn begin request, db: kc_datawind, tbl: ads_risk_contract_common_board_si, label: 5f09786e-c7eb-42d5-8852-5e01fb6441fd, backend: xxxx
    2023-09-09 12:14:00,519 INFO (thrift-server-pool-18910|65385) [DatabaseTransactionMgr.beginTransaction():308] begin transaction: txn_id: 77769376 with label 5f09786e-c7eb-42d5-8852-5e01fb6441fd from coordinator BE: xxxx, listner id: -1
    2023-09-09 12:14:00,519 INFO (thrift-server-pool-18929|65404) [FrontendServiceImpl.streamLoadPut():1195] receive stream load put request. db:kc_datawind, tbl: ads_risk_contract_common_board_si, txn_id: 77769373, load id: c248ddf3-3868-18b3-4160-669fd77a93af, backend: xxxx2
    2023-09-09 12:14:00,519 INFO (thrift-server-pool-18929|65404) [Load.initColumns():315] load __op column expr: null
    2023-09-09 12:14:00,519 INFO (thrift-server-pool-18929|65404) [StreamLoadPlanner.plan():233] load job id: c248ddf3-3868-18b3-4160-669fd77a93af tx id 77769373 parallel 0 compress NO_COMPRESSION replicated false quorum MAJORITY
    2023-09-09 12:14:00,521 INFO (nioEventLoopGroup-6-2|278) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/prepare
    2023-09-09 12:14:00,521 INFO (nioEventLoopGroup-6-2|278) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx5, port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: prepare, label: 2b9fe219-3ac1-4cd5-a753-8bb053eac561
    2023-09-09 12:14:00,522 INFO (nioEventLoopGroup-6-3|279) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/prepare
    2023-09-09 12:14:00,522 INFO (nioEventLoopGroup-6-3|279) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx, port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: prepare, label: b2728e83-5819-4e72-86a5-49af3e9c34ec
    2023-09-09 12:14:00,522 INFO (nioEventLoopGroup-6-4|280) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/prepare
    2023-09-09 12:14:00,522 INFO (nioEventLoopGroup-6-4|280) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx5, port:8040), db: kc_datawind, table: ads_risk_speedy_common_board_si, op: prepare, label: 5cd8f2b0-34df-4174-aa1e-bf342104d830
    2023-09-09 12:14:00,522 INFO (nioEventLoopGroup-6-5|281) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/prepare
    2023-09-09 12:14:00,522 INFO (nioEventLoopGroup-6-5|281) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx, port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: prepare, label: 1d1321a2-49ad-4396-9bf3-cb4261e8896a
    2023-09-09 12:14:00,523 INFO (nioEventLoopGroup-6-6|282) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/load
    2023-09-09 12:14:00,523 INFO (nioEventLoopGroup-6-6|282) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx, port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: load, label: 406572c3-35bc-4f98-ae28-1c525736a4a1
    2023-09-09 12:14:00,523 INFO (nioEventLoopGroup-6-7|283) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/begin
    2023-09-09 12:14:00,523 INFO (nioEventLoopGroup-6-7|283) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx6, port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: begin, label: 31fd1a5a-a793-4070-a36d-29a8422898bd
    2023-09-09 12:14:00,523 INFO (nioEventLoopGroup-6-8|284) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/begin
    2023-09-09 12:14:00,523 INFO (nioEventLoopGroup-6-8|284) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx5, port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: begin, label: 4d214e3f-89fd-4c4e-ab1d-d248fe361c19
    2023-09-09 12:14:00,533 INFO (thrift-server-pool-18947|65422) [FrontendServiceImpl.loadTxnPrepare():1073] receive txn prepare request. db: kc_datawind, tbl: ads_risk_contract_common_board_si, txn_id: 77769369, backend: xxxx
    2023-09-09 12:14:00,533 INFO (thrift-server-pool-18934|65409) [FrontendServiceImpl.loadTxnPrepare():1073] receive txn prepare request. db: kc_datawind, tbl: ads_risk_speedy_common_board_si, txn_id: 77769374, backend: xxxx5
    2023-09-09 12:14:00,533 INFO (thrift-server-pool-18928|65403) [FrontendServiceImpl.loadTxnPrepare():1073] receive txn prepare request. db: kc_datawind, tbl: ads_risk_contract_common_board_si, txn_id: 77769370, backend: xxxx
    2023-09-09 12:14:00,533 INFO (thrift-server-pool-18930|65405) [FrontendServiceImpl.loadTxnPrepare():1073] receive txn prepare request. db: kc_datawind, tbl: ads_risk_contract_common_board_si, txn_id: 77769368, backend: xxxx5
    2023-09-09 12:14:00,536 INFO (thrift-server-pool-18957|65432) [FrontendServiceImpl.loadTxnBegin():882] receive txn begin request, db: kc_datawind, tbl: ads_risk_contract_common_board_si, label: 31fd1a5a-a793-4070-a36d-29a8422898bd, backend: xxxx6
    2023-09-09 12:14:00,536 INFO (nioEventLoopGroup-6-9|285) [RestBaseAction.handleRequest():57] receive http request. url=/api/transaction/prepare
    2023-09-09 12:14:00,536 INFO (nioEventLoopGroup-6-9|285) [TransactionLoadAction.executeTransaction():261] redirect transaction action to destination=TNetworkAddress(hostname:xxxx5, port:8040), db: kc_datawind, table: ads_risk_contract_common_board_si, op: prepare, label: 3e8a52c6-0c64-483b-a416-b2f109358d14
    2023-09-09 12:14:00,537 INFO (thrift-server-pool-18947|65422) [DatabaseTransactionMgr.prepareTransaction():597] transaction:[TransactionState. txn_id: 77769369, label: b2728e83-5819-4e72-86a5-49af3e9c34ec, db id: 918719, table id list: 12447363, callback id: -1, coordinator: BE: xxxx, transaction status: PREPARED, error replicas num: 0, replica ids: , prepare time: 1694232840437, commit time: 1694232840533, finish time: -1, write cost: 96ms, reason: attachment: com.starrocks.load.loadv2.ManualLoadTxnCommitAttachment@2e3f603e] successfully prepare
  • fe.warn
    2023-09-09 12:13:02,022 WARN (thrift-server-pool-18897|65372) [FrontendServiceImpl.streamLoadPut():1206] failed to get stream load plan: get database read lock timeout, database=kc_futures
    2023-09-09 12:13:02,022 WARN (thrift-server-pool-18898|65373) [FrontendServiceImpl.streamLoadPut():1206] failed to get stream load plan: get database read lock timeout, database=kc_futures
    2023-09-09 12:13:02,246 WARN (thrift-server-pool-18900|65375) [Database.logTryLockFailureEvent():148] try db lock failed. type: readLock, current owner id: 65316, owner name: thrift-server-pool-18841, owner stack: dump thread: thrift-server-pool-18841, id: 65316
    sun.misc.Unsafe.park(Native Method)
    java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
    java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836)
    java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:997)
    java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)
    java.util.concurrent.CountDownLatch.await(CountDownLatch.java:231)
    com.starrocks.journal.JournalTask.get(JournalTask.java:65)
    com.starrocks.journal.JournalTask.get(JournalTask.java:14)
    com.starrocks.persist.EditLog.waitInfinity(EditLog.java:1017)
    com.starrocks.persist.EditLog.logEdit(EditLog.java:959)
    com.starrocks.persist.EditLog.logInsertTransactionState(EditLog.java:1320)
    com.starrocks.transaction.DatabaseTransactionMgr.unprotectUpsertTransactionState(DatabaseTransactionMgr.java:1145)
    com.starrocks.transaction.DatabaseTransactionMgr.unprotectedCommitTransaction(DatabaseTransactionMgr.java:1056)
    com.starrocks.transaction.DatabaseTransactionMgr.commitTransaction(DatabaseTransactionMgr.java:457)
    com.starrocks.transaction.GlobalTransactionMgr.commitTransaction(GlobalTransactionMgr.java:360)
    com.starrocks.transaction.GlobalTransactionMgr.commitAndPublishTransaction(GlobalTransactionMgr.java:438)
    com.starrocks.service.FrontendServiceImpl.loadTxnCommitImpl(FrontendServiceImpl.java:1015)
    com.starrocks.service.FrontendServiceImpl.loadTxnCommit(FrontendServiceImpl.java:974)
    com.starrocks.thrift.FrontendService$Processor$loadTxnCommit.getResult(FrontendService.java:2596)
    com.starrocks.thrift.FrontendService$Processor$loadTxnCommit.getResult(FrontendService.java:2576)
    org.apache.thrift.ProcessFunction.process(ProcessFunction.java:38)
    org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:38)
    com.starrocks.common.SRTThreadPoolServer$WorkerProcess.run(SRTThreadPoolServer.java:311)
    java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
    java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
    java.lang.Thread.run(Thread.java:748)

2023-09-09 12:13:02,246 WARN (thrift-server-pool-18900|65375) [FrontendServiceImpl.streamLoadPut():1206] failed to get stream load plan: get database read lock timeout, database=kc_futures
2023-09-09 12:13:02,274 WARN (thrift-server-pool-18829|65304) [Database.logSlowLockEventIfNeeded():141] slow db lock. type: tryWriteLock, db id: 250373, db name: kc_futures, wait time: 5638ms, former owner id: 65309, owner name: thrift-server-pool-18834, owner stack: dump thread: thrift-server-pool-18834, id: 65309
sun.misc.Unsafe.park(Native Method)
java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836)
java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:997)
java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)
java.util.concurrent.CountDownLatch.await(CountDownLatch.java:231)
com.starrocks.journal.JournalTask.get(JournalTask.java:65)
com.starrocks.journal.JournalTask.get(JournalTask.java:14)
com.starrocks.persist.EditLog.waitInfinity(EditLog.java:1017)
com.starrocks.persist.EditLog.logEdit(EditLog.java:959)
com.starrocks.persist.EditLog.logInsertTransactionState(EditLog.java:1320)
com.starrocks.transaction.DatabaseTransactionMgr.unprotectUpsertTransactionState(DatabaseTransactionMgr.java:1145)
com.starrocks.transaction.DatabaseTransactionMgr.unprotectedCommitTransaction(DatabaseTransactionMgr.java:1056)
com.starrocks.transaction.DatabaseTransactionMgr.commitTransaction(DatabaseTransactionMgr.java:457)
com.starrocks.transaction.GlobalTransactionMgr.commitTransaction(GlobalTransactionMgr.java:360)
com.starrocks.transaction.GlobalTransactionMgr.commitAndPublishTransaction(GlobalTransactionMgr.java:438)
com.starrocks.service.FrontendServiceImpl.loadTxnCommitImpl(FrontendServiceImpl.java:1015)
com.starrocks.service.FrontendServiceImpl.loadTxnCommit(FrontendServiceImpl.java:974)
com.starrocks.thrift.FrontendService$Processor$loadTxnCommit.getResult(FrontendService.java:2596)
com.starrocks.thrift.FrontendService$Processor$loadTxnCommit.getResult(FrontendService.java:2576)
org.apache.thrift.ProcessFunction.process(ProcessFunction.java:38)
org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:38)
com.starrocks.common.SRTThreadPoolServer$WorkerProcess.run(SRTThreadPoolServer.java:311)
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
java.lang.Thread.run(Thread.java:748)
, current stack trace:
java.lang.Exception: null
at com.starrocks.catalog.Database.logSlowLockEventIfNeeded(Database.java:142) [starrocks-fe.jar:?]
at com.starrocks.catalog.Database.tryWriteLock(Database.java:244) [starrocks-fe.jar:?]
at com.starrocks.transaction.GlobalTransactionMgr.commitAndPublishTransaction(GlobalTransactionMgr.java:432) [starrocks-fe.jar:?]
at com.starrocks.service.FrontendServiceImpl.loadTxnCommitImpl(FrontendServiceImpl.java:1015) [starrocks-fe.jar:?]
at com.starrocks.service.FrontendServiceImpl.loadTxnCommit(FrontendServiceImpl.java:974) [starrocks-fe.jar:?]
at com.starrocks.thrift.FrontendService$Processor$loadTxnCommit.getResult(FrontendService.java:2596) [starrocks-fe.jar:?]
at com.starrocks.thrift.FrontendService$Processor$loadTxnCommit.getResult(FrontendService.java:2576) [starrocks-fe.jar:?]
at org.apache.thrift.ProcessFunction.process(ProcessFunction.java:38) [libthrift-0.13.0.jar:0.13.0]
at org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:38) [libthrift-0.13.0.jar:0.13.0]
at com.starrocks.common.SRTThreadPoolServer$WorkerProcess.run(SRTThreadPoolServer.java:311) [starrocks-fe.jar:?]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [?:1.8.0_202]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [?:1.8.0_202]
at java.lang.Thread.run(Thread.java:748) [?:1.8.0_202]
2023-09-09 12:13:30,358 WARN (thrift-server-pool-18832|65307) [FrontendServiceImpl.loadTxnRollback():1152] failed to rollback txn 77768893: transaction not found
2023-09-09 12:13:30,358 WARN (thrift-server-pool-18846|65321) [FrontendServiceImpl.loadTxnRollback():1152] failed to rollback txn 77768891: transaction not found
2023-09-09 12:13:30,359 WARN (thrift-server-pool-18856|65331) [FrontendServiceImpl.loadTxnRollback():1152] failed to rollback txn 77768892: transaction not found
2023-09-09 12:14:00,296 WARN (thrift-server-pool-18967|65444) [FrontendServiceImpl.loadTxnRollback():1152] failed to rollback txn 77768986: transaction not found
2023-09-09 12:14:30,596 WARN (thrift-server-pool-18930|65405) [FrontendServiceImpl.loadTxnRollback():1152] failed to rollback txn 77769003: transaction not found
2023-09-09 12:17:29,496 WARN (thrift-server-pool-18942|65417) [FrontendServiceImpl.loadTxnRollback():1152] failed to rollback txn 77769004: transaction not found
2023-09-09 12:17:29,496 WARN (thrift-server-pool-18927|65402) [FrontendServiceImpl.loadTxnRollback():1152] failed to rollback txn 77769002: transaction not found
2023-09-09 12:18:24,029 WARN (pool-19-thread-7020|65465) [StatisticUtils.convertStatisticsToDouble():232] Statistic convert error, type datetime, statistic 2023-09-06 12:18:23.941000, Text ‘2023-09-06 12:18:23.941000’ could not be parsed, unparsed text found at index 19
2023-09-09 12:18:24,029 WARN (pool-19-thread-7020|65465) [StatisticUtils.convertStatisticsToDouble():232] Statistic convert error, type datetime, statistic 2023-09-06 12:18:23.941000, Text ‘2023-09-06 12:18:23.941000’ could not be parsed, unparsed text found at index 19
2023-09-09 12:18:24,030 WARN (pool-19-thread-7020|65465) [StatisticUtils.convertStatisticsToDouble():232] Statistic convert error, type datetime, statistic 2023-09-06 12:18:23.941000, Text ‘2023-09-06 12:18:23.941000’ could not be parsed, unparsed text found at index 19
2023-09-09 12:18:24,053 WARN (pool-19-thread-7029|65474) [StatisticUtils.convertStatisticsToDouble():232] Statistic convert error, type datetime, statistic 2023-09-08 12:18:23.973000, Text ‘2023-09-08 12:18:23.973000’ could not be parsed, unparsed text found at index 19
2023-09-09 12:18:24,053 WARN (pool-19-thread-7029|65474) [StatisticUtils.convertStatisticsToDouble():232] Statistic convert error, type datetime, statistic 2023-09-08 12:18:23.973000, Text ‘2023-09-08 12:18:23.973000’ could not be parsed, unparsed text found at index 19
2023-09-09 12:18:24,053 WARN (pool-19-thread-7029|65474) [StatisticUtils.convertStatisticsToDouble():232] Statistic convert error, type datetime, statistic 2023-09-08 12:18:23.973000, Text ‘2023-09-08 12:18:23.973000’ could not be parsed, unparsed text found at index 19
2023-09-09 12:18:24,689 WARN (pool-19-thread-7020|65465) [StatisticUtils.convertStatisticsToDouble():232] Statistic convert error, type datetime, statistic 2023-09-06 12:18:23.941000, Text ‘2023-09-06 12:18:23.941000’ could not be parsed, unparsed text found at index 19
2023-09-09 12:18:24,690 WARN (pool-19-thread-7020|65465) [StatisticUtils.convertStatisticsToDouble():232] Statistic convert error, type datetime, statistic 2023-09-06 12:18:23.941000, Text ‘2023-09-06 12:18:23.941000’ could not be parsed, unparsed text found at index 19
2023-09-09 12:18:24,690 WARN (pool-19-thread-7020|65465) [StatisticUtils.convertStatisticsToDouble():232] Statistic convert error, type datetime, statistic 2023-09-06 12:18:23.941000, Text ‘2023-09-06 12:18:23.941000’ could not be parsed, unparsed text found at index 19
2023-09-09 12:18:24,798 WARN (pool-19-thread-7029|65474) [StatisticUtils.convertStatisticsToDouble():232] Statistic convert error, type datetime, statistic 2023-09-08 12:18:23.973000, Text ‘2023-09-08 12:18:23.973000’ could not be parsed, unparsed text found at index 19
2023-09-09 12:18:24,798 WARN (pool-19-thread-7029|65474) [StatisticUtils.convertStatisticsToDouble():232] Statistic convert error, type datetime, statistic 2023-09-08 12:18:23.973000, Text ‘2023-09-08 12:18:23.973000’ could not be parsed, unparsed text found at index 19
2023-09-09 12:18:24,799 WARN (pool-19-thread-7029|65474) [StatisticUtils.convertStatisticsToDouble():232] Statistic convert error, type datetime, statistic 2023-09-08 12:18:23.973000, Text ‘2023-09-08 12:18:23.973000’ could not be parsed, unparsed text found at index 19

  • 慢查询:
    • load数据不存在慢查询
    • 并行度:show variables like ‘%parallel_fragment_exec_instance_num%’;
      ±------------------------------------±------+
      | Variable_name | Value |
      ±------------------------------------±------+
      | parallel_fragment_exec_instance_num | 1 |
    • pipeline是否开启:show variables like ‘%pipeline%’;
      ±--------------------------------±------+
      | Variable_name | Value |
      ±--------------------------------±------+
      | enable_pipeline_engine | true |
      | enable_pipeline_query_statistic | true |
      | pipeline_dop | 0 |
      | pipeline_profile_level | 1 |
      ±--------------------------------±------+
    • be节点cpu和内存使用率截图


有做过哪些变动么

我也遇上相同的问题
StarRocks 版本:3.1.3
flink-connector-starrocks版本:1.2.6_flink-1.13

@jingdan 能帮忙看下么

搜下fe的leader日志,是否有NullPointerException的相关信息

未发现有 Null 相关日志

我升级了一下flink-connector-starrocks 的版本到 1.2.8_flink-1.13_2.12 目前运行了20多分钟暂未出现该问题,所以暂不确定是客户端问题,还是服务端问题

是leader节点吗,对应时间点fe的info日志还在吗?或者搜一下NullPointerException对应的label看看

是leader节点,因为FE只部署了一个节点
FE节点日志太大了,所以这里就只给一部分了
FE节点日志情况


对应BE节点日志情况

升级connector之后还有出现吗

目前运行了5个小时,还未复现,等明天再看看

目前稳定运行22小时,未发现该问题

嗯嗯那大概率是已经修过的问题,有问题再反馈

好的,感谢回复哈