【背景】现有一张hudi表,建表语句如下
create table dwd_tms_hudi_test_0328 (
event_id string
,app_id int
,operation string
,page_type string
,page_section string
,target_type string
,user_id bigint
,log_timestamp bigint
,local_hour string
,region string
) using hudi tblproperties (
‘type’ = ‘mor’
,‘primaryKey’ = ‘event_id’
,‘preCombineField’ = ‘log_timestamp’
,‘hoodie.index.bucket.engine’ = ‘CONSISTENT_HASHING’
,‘hoodie.bucket.index.hash.field’ = ‘event_id’
,‘hoodie.index.type’ = ‘BUCKET’
,‘hoodie.bucket.index.num.buckets’ = ‘32’
,‘hoodie.bucket.index.max.num.buckets’ = ‘800’
,‘hoodie.bucket.index.min.num.buckets’ = ‘20’
)
partitioned by (local_hour, region)
;
想为其创建hourly分区的物化视图,但是表达式分区不支持str_to_date/str_to_jodatime等可以转成datetime的函数,请教大佬,我该如何做呢?
我们尝试过用int类型来做,int类型会被识别成list分区。所以好像无法手动刷新部分分区?
REFRESH MATERIALIZED VIEW [database.]mv_name
[PARTITION START ("<partition_start_date>") END ("<partition_end_date>")]
[FORCE]
[WITH { SYNC | ASYNC } MODE]
看起来手动刷新分区只支持range类型分区?
我们预期是能自动&&手动刷新部分分区,即使自动不行,也希望手动能够刷新部分分区
解决了,用int类型做,hudi的int类型分区字段会被识别成range分区