W1207 16:22:16.800920 8610 system_allocator.cc:56] Disable aggressive memory decommit
W1207 16:22:16.814775 8931 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W1207 16:22:16.814802 8931 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W1207 16:31:56.140347 14043 system_allocator.cc:56] Disable aggressive memory decommit
W1207 16:31:56.155552 14338 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W1207 16:31:56.155578 14338 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W1207 16:42:55.144666 22563 system_allocator.cc:56] Disable aggressive memory decommit
W1207 16:42:55.161206 22959 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W1207 16:42:55.161234 22959 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W1207 16:50:35.193226 20554 system_allocator.cc:56] Disable aggressive memory decommit
W1207 16:50:35.207423 20890 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W1207 16:50:35.207450 20890 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W1207 16:54:11.211185 1918 system_allocator.cc:56] Disable aggressive memory decommit
W1207 16:54:11.226442 2225 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W1207 16:54:11.226465 2225 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W1207 16:56:53.040920 1928 sampler.cpp:183] bvar is busy at sampling for 2 seconds!
麻烦贴下be.out内容
[root@emr-worker-2 log]# cat be.out
start time: Tue Dec 7 11:44:14 CST 2021
start time: Tue Dec 7 11:52:03 CST 2021
terminate called after throwing an instance of ‘terminate called recursively
*** Aborted at 1638863145 (unix time) try “date -d @1638863145” if you are using GNU date ***
PC: @ 0x7f63918f4387 __GI_raise
*** SIGABRT (@0x6685) received by PID 26245 (TID 0x7f62ff0dd700) from PID 26245; stack trace: ***
std::system_error’
what(): Resource temporarily unavailable @ 0x32b89d2 google::(anonymous namespace)::FailureSignalHandler()
@ 0x7f63925bf630 (unknown)
@ 0x7f63918f4387 __GI_raise
@ 0x7f63918f5a78 __GI_abort
@ 0x4b20272 __gnu_cxx::__verbose_terminate_handler()
@ 0x4b1edf6 __cxxabiv1::__terminate()
@ 0x4bc2fe9 __cxa_call_terminate
@ 0x4b1e811 __gxx_personality_v0
@ 0x4bc9e6e _Unwind_RaiseException_Phase2
@ 0x4bca966 _Unwind_Resume
@ 0x13cddf4 _ZN4brpc6policy17ProcessRpcRequestEPNS_16InputMessageBaseE.cold
@ 0x33de7c7 brpc::ProcessInputMessage()
@ 0x33df673 brpc::InputMessenger::OnNewMessages()
@ 0x34873ae brpc::Socket::ProcessEvent()
@ 0x353dddf bthread::TaskGroup::task_runner()
@ 0x352b8a1 bthread_make_fcontext
start time: Tue Dec 7 16:05:47 CST 2021
WARNING: Logging before InitGoogleLogging() is written to STDERR
W1207 16:05:47.504927 6555 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W1207 16:05:47.504953 6555 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W1207 16:05:47.510987 6517 beta_rowset.cpp:85] Fail to open /mnt/disk1/starrocks/data/76/11480/1603701057/02000000000001a6c54f12db27b2d7bc22a1ba2fa76438a8_9.dat: Internal error: fail to find valid type encoding, type:17, encoding:5
start time: Tue Dec 7 16:06:24 CST 2021
start time: Tue Dec 7 16:08:23 CST 2021
start time: Tue Dec 7 16:24:43 CST 2021
start time: Tue Dec 7 16:25:02 CST 2021
start time: Tue Dec 7 16:25:53 CST 2021
start time: Tue Dec 7 17:06:35 CST 2021
be.warning的日志
[root@emr-worker-2 log]# tail -f be.WARNING.log.20211207-114414
W1207 16:35:12.258675 32120 thrift_rpc_helper.cpp:70] client repoen failed. address=TNetworkAddress(hostname=172.17.187.188, port=9060), status=Couldn’t open transport for 172.17.187.188:9060 (socket open() error: Connection refused)
W1207 16:35:12.258711 32120 engine_clone_task.cpp:266] Fail to download snapshot from http://172.17.187.188:8041/api/_tablet/_download?token=58242ca2-1013-4e2f-b6b0-9d2d246c70f4&file=/mnt/disk1/starrocks/snapshot/20211207163434.282.180//11195/1603701057/: Internal error: Recv failure: Connection reset by peer
W1207 16:35:12.258841 32120 task_worker_pool.cpp:918] clone failed. signature: 11195
W1207 16:43:11.300021 32121 thrift_rpc_helper.cpp:65] retrying call frontend service after 100 ms, address=TNetworkAddress(hostname=172.17.187.188, port=9060), reason=No more data to read.
W1207 16:51:32.218931 32121 thrift_rpc_helper.cpp:65] retrying call frontend service after 100 ms, address=TNetworkAddress(hostname=172.17.187.188, port=9060), reason=No more data to read.
W1207 17:06:36.055104 9815 data_dir.cpp:541] Found invalid rowset=0200000000000222c54f12db27b2d7bc22a1ba2fa76438a8 tablet id=12260 tablet uid=12448d512110c026-493b3078ea5db4bf schema hash=1603701057 txn=1096 current valid tablet uid=474ac277dd62d6a2-f86b79ff00adaeb9
W1207 17:06:36.056113 9815 data_dir.cpp:541] Found invalid rowset=0200000000000223c54f12db27b2d7bc22a1ba2fa76438a8 tablet id=12260 tablet uid=12448d512110c026-493b3078ea5db4bf schema hash=1603701057 txn=1095 current valid tablet uid=474ac277dd62d6a2-f86b79ff00adaeb9
W1207 17:06:36.162786 9741 system_allocator.cc:56] Disable aggressive memory decommit
W1207 17:06:36.176303 10038 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W1207 17:06:36.176327 10038 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
建议/proc/sys/vm/overcommit_memory配置为1,配置方式echo 1 > /proc/sys/vm/overcommit_memory。
plz,我的集群也有点问题,与上面这位问题一样,且已经试过echo 1 > /proc/sys/vm/overcommit_memory,还是起不来
be.warning的日志
W0624 21:26:59.879447 48849 system_allocator.cc:56] Disable aggressive memory decommit
W0624 21:26:59.887162 49137 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W0624 21:26:59.887208 49137 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W0624 21:27:18.246439 49956 system_allocator.cc:56] Disable aggressive memory decommit
W0624 21:27:18.251296 50244 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W0624 21:27:18.251317 50244 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W0624 21:37:00.451493 52591 system_allocator.cc:56] Disable aggressive memory decommit
W0624 21:37:00.461750 52880 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W0624 21:37:00.461782 52880 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W0624 21:37:11.214723 54369 system_allocator.cc:56] Disable aggressive memory decommit
W0624 21:37:11.219420 54657 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W0624 21:37:11.219442 54657 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W0624 21:37:28.886039 55818 system_allocator.cc:56] Disable aggressive memory decommit
W0624 21:37:28.892069 56106 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W0624 21:37:28.892094 56106 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W0624 21:45:30.336508 57805 system_allocator.cc:56] Disable aggressive memory decommit
W0624 21:45:30.341571 58093 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W0624 21:45:30.341598 58093 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W0624 22:02:17.537082 61563 system_allocator.cc:56] Disable aggressive memory decommit
W0624 22:02:17.543267 61851 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W0624 22:02:17.543695 61851 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W0624 22:04:54.465580 62995 system_allocator.cc:56] Disable aggressive memory decommit
W0624 22:04:54.470453 63283 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W0624 22:04:54.470484 63283 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
W0624 22:13:29.391549 65379 system_allocator.cc:56] Disable aggressive memory decommit
W0624 22:13:29.401476 65667 utils.cpp:90] Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR
W0624 22:13:29.401499 65667 task_worker_pool.cpp:1116] Fail to report task to :0, err=-1
这样没有解决成功呢
救命啊 再帮忙看看
我的问题已解决 原因是导入的json格式不合法