skew join bad case计算结果异常

为了更快的定位您的问题,请提供以下信息,谢谢
【详述】生成的执行计划中带了skew join 导致join的时候最终结果不对,换了个低版本的STARROCKS没出现这个问题,升级到最新的3.3.17也一样存在


【背景】做过哪些操作?
【业务影响】
【是否存算分离】
【StarRocks版本】3.3.14 3.3.17
【集群规模】例如:3fe(1 follower+2observer)+5be(fe与be混部)
【机器信息】CPU虚拟核/内存/网卡,例如:48C/64G/万兆
【联系方式】社区群22 hpp
【附件】
复现SQL
SELECT
/*+ SET_VAR(“enable_stats_to_optimize_skew_join”= ‘false’) */
dq
FROM (SELECT CASE WHEN UD_1_BUFU9_kdxssj6.EXPR$0 THEN UD_1_BUFU9_kdxssj6.dq ELSE UD_1_BUFU9_kdxssj60.dq END AS dq
FROM ((SELECT UD_1_BUFU9_kdxssj6.dq, UD_1_BUFU9_kdxssj6.dq IS NOT NULL AS EXPR$0
FROM UD_1_BUFU9_kdxssj6) AS UD_1_BUFU9_kdxssj6 INNER JOIN (SELECT UD_1_BUFU9_kdxssj6.dq
FROM UD_1_BUFU9_kdxssj6) AS UD_1_BUFU9_kdxssj60 ON UD_1_BUFU9_kdxssj6.dq = UD_1_BUFU9_kdxssj60.dq)) AS t
GROUP BY dq
ORDER BY dq IS NULL DESC, dq DESC

建表语句
CREATE TABLE UD_1_BUFU9_kdxssj6 (
id varchar(64) NOT NULL COMMENT “主键id(系统)”,
sjnyr datetime NULL COMMENT “时间(年月日)”,
cplx varchar(1048576) NULL COMMENT “产品类型”,
dq varchar(1048576) NULL COMMENT “地区”,
fjsc varchar(1048576) NULL COMMENT “分级市场”,
jgd varchar(1048576) NULL COMMENT “价格段”,
pp varchar(1048576) NULL COMMENT “品牌”,
xsqd varchar(1048576) NULL COMMENT “销售渠道”,
lr double NULL COMMENT “利润”,
xse varchar(1048576) NULL COMMENT “销售额”,
xsl double NULL COMMENT “销售量”,
create_id varchar(1048576) NULL COMMENT “创建人(系统)”,
update_id varchar(1048576) NULL COMMENT “修改人(系统)”,
create_time datetime NULL COMMENT “创建时间(系统)”,
update_time datetime NULL COMMENT “修改时间(系统)”,
sys_sort double NULL COMMENT “系统排序(系统)”,
data_source varchar(1048576) NULL COMMENT “数据来源(系统)”,
field_s9mrqz8hir varchar(1048576) NULL COMMENT “111”,
field_xv4gbizlue varchar(1048576) NULL COMMENT “111”,
field_qnchad0lrz double NULL COMMENT “333”
) ENGINE=OLAP
PRIMARY KEY(id)
COMMENT “空调销售数据 (6)”
DISTRIBUTED BY HASH(id)
PROPERTIES (
“compression” = “LZ4”,
“enable_persistent_index” = “true”,
“fast_schema_evolution” = “true”,
“replicated_storage” = “true”,
“replication_num” = “1”
);
测试数据构建
insert into UD_1_BUFU9_kdxssj6 select * from FILES(
“path” = “s3a://xxxx/test/a53de096-71f4-11f0-a73e-02426a3db909_0_0_0.parquet”,
“format” = “parquet”,
“compression” = “uncompressed”,
“aws.s3.access_key” = “xxx”,
“aws.s3.secret_key” = “x”,
“aws.s3.region” = “us-west-1”,
“aws.s3.endpoint” = “http://xxxxx4:9000
);
a53de096-71f4-11f0-a73e-02426a3db909_0_0_0.parquet (117.8 KB)

1赞