【详述】
csv格式的数据,早excel可以打开的,格式估计没问题,但是在starrocks导入的时候,解析有问题,分析了问题后发现是因为有些列存在逗号造成的,于是尝试设置format_type_options,各种组合设置都不能解决问题,个人觉得应该是配置了enclose=’"’(双引号)的话,那么是不是如果数字列没有双引号导致解析不了了?具体设置和错误如下:
broker load创建:
LOAD LABEL events_attributions_2023_07_25_30
(
DATA INFILE(“gs://attributions/2023/07/25/2023-07-25-00-00-00.csv.gz”)
INTO TABLE events_attributions
COLUMNS TERMINATED BY “,”
(
skip_header=1
)
)
WITH BROKER
(…)
得到错误如下:
Error: Value count does not match column count. Expect 64, but got 61. Row: “2023-07-25 00:00:00+00:00”,“XXOOXXOO-7788-XXOO-XXOO-XXOOXXOOXXOO”,“idfv”,“ios”,“2023-07-25 00:16:14+00:00”,1690244175,“true”,"",“false”,“fingerprint”,“com.demo.xxoo”,“1.9.0”,“Kanazawa”,“JP”,""“XXOOXXOO-7788-XXOO-XXOO-XXOOXXOOXXOO”"",“16.5.1”,“Ishikawa”,"",“Apple”,“iPhone14,6”,"","","","","","","",“91a0f486-7775-46c5-90a5-3d1d8f17ec61”,"","","",“2023-07-25 00:15:54+00:00”,1690244154,“180.36.149.198”,"",“ironSource”,“ironSource”,"","",“CPM”,“8491006”,“MRZ_IS_iOS_Playable_mulitgeo_20220614”,“c7de6c80-2a7f-11ee-b3d8-d5e89189ffb8.opp_101.3”,“483898”,“mrz_playable_tutorial_2merge_3waves_forest_easy_blond_ironsource”,“Unknown”,“538209”,"","","",“is”,"","",“MRZ_iOS_IS”,“l5dp”,"","",“valid”,"","","","","",""
【业务影响】无法导入数据
【StarRocks版本】例如:3.0.2
【集群规模】例如:3fe(1 follower+2observer)+3be(fe与be混部)
【机器信息】CPU虚拟核/内存/网卡