【SR版本】2.0.5
【SR机器】3个be,与haoop集群在异地机房,网络互通,两集群时间是一致的
【导入语句】
load label wn.agg_mis_r_sales_a_achment_lep_prem_index_m_202112
(
data infile(“hdfs://hdfs01-shyp-sx-stg/user/hive/warehouse/sx_adm_safe.db/agg_mis_r_sales_a_achment_lep_prem_index_m/month=202112/data_from=PREM/margin_version=CUR/*”)
into table agg_mis_r_sales_a_achment_lep_prem_index_m
COLUMNS TERMINATED BY “\x01”
(几十个hive表字段,省略)
)
WITH BROKER ‘hdfs_broker’
(
“username” = “hadoop”,
“password” = “Bigdata123$”
)
【现象】每次broker load后,会卡在LOAD:0%长达近一小时时间
| 12051 | agg_mis_r_sales_a_achment_lep_prem_index_m_202112 | LOADING | ETL:100%; LOAD:0% | BROKER | NULL | cluster:N/A; timeout(s):14400; max_filter_ratio:0.0 | NULL | 2022-06-10 16:36:11 | 2022-06-10 16:36:12 | 2022-06-10 16:36:12 | 2022-06-10 16:36:12 | NULL | NULL | {“Unfinished backends”:{“7c40a024-1888-4b67-a4b8-c55e48bc7757”:[10005]},“ScannedRows”:1055374,“TaskNumber”:1,“All backends”:{“7c40a024-1888-4b67-a4b8-c55e48bc7757”:[10005,10006,10004]},“FileNumber”:1,“FileSize”:6064849574}
过一小时报错hive数据格式不对:
| 12051 | agg_mis_r_sales_a_achment_lep_prem_index_m_202112 | CANCELLED | ETL:N/A; LOAD:N/A | BROKER | unselected.rows=0; dpp.abnorm.ALL=1573733; dpp.norm.ALL=0 | cluster:N/A; timeout(s):14400; max_filter_ratio:0.0 | type:ETL_QUALITY_UNSATISFIED; msg:quality not good enough to cancel | 2022-06-10 16:36:11 | 2022-06-10 16:36:12 | 2022-06-10 16:36:12 | 2022-06-10 16:36:12 | 2022-06-10 17:26:23 | http://30.104.226.30:8040/api/_load_error_log?file=__shard_3/error_log_insert_stmt_7c40a024-1888-4b67-a4b8-c55e48bc7758_7c40a02418884b67_a4b8c55e48bc7758 | {“Unfinished backends”:{“7c40a024-1888-4b67-a4b8-c55e48bc7757”:[]},“ScannedRows”:1573733,“TaskNumber”:1,“All backends”:{“7c40a024-1888-4b67-a4b8-c55e48bc7757”:[10005,10006,10004]},“FileNumber”:1,“FileSize”:6064849574}