StarRocks的全库备份出错

【详述】对StarRocks的一个库备份到S3,结果在UPLOADING状态报超时错误,任务超时时间我未设置,走的默认86400
【背景】该库数据量在200G左右,换了一个500M的库测试还是一样的错误
【StarRocks版本】例如:3.1.4
【集群规模】例如:1fe+1be
[258979: Fail to rename file: s3a://starrocks/__starrocks_repository_v1_repo/__ss_dwd_bak_20231219/__ss_content/__db_10103/__tbl_224978/__part_224977/__idx_224979/__224982/0200000000080102234877b9adfea7efc62785bd36d9de8a_0.dat.part to: s3a://starrocks/__starrocks_repository_v1_repo/__ss_dwd_bak_20231219/__ss_content/__db_10103/__tbl_224978/__part_224977/__idx_224979/__224982/0200000000080102234877b9adfea7efc62785bd36d9de8a_0.dat.707a9246aa1cd663daf75c95f79105d4 msg:Fail to copy from src s3a://starrocks/__starrocks_repository_v1_repo/__ss_dwd_bak_20231219/__ss_content/__db_10103/__tbl_224978/__part_224977/__idx_224979/__224982/0200000000080102234877b9adfea7efc62785bd36d9de8a_0.dat.part to target s3a://starrocks/__starrocks_repository_v1_repo/__ss_dwd_bak_20231219/__ss_content/__db_10103/__tbl_224978/__part_224977/__idx_224979/__224982/0200000000080102234877b9adfea7efc62785bd36d9de8a_0.dat.707a9246aa1cd663daf75c95f79105d4, msg: curlCode: 28, Timeout was reached], [258975: Fail to rename file: s3a://starrocks/__starrocks_repository_v1_repo/__ss_dwd_bak_20231219/__ss_content/__db_10103/__tbl_108873/__part_108872/__idx_108874/__108877/020000000006bdff234877b9adfea7efc62785bd36d9de8a_0.dat.part to: s3a://starrocks/__starrocks_repository_v1_repo/__ss_dwd_bak_20231219/__ss_content/__db_10103/__tbl_108873/__part_108872/__idx_108874/__108877/020000000006bdff234877b9adfea7efc62785bd36d9de8a_0.dat.715e3e8cfee08d15f293de06baeb9451 msg:Fail to copy from src s3a://starrocks/__starrocks_repository_v1_repo/__ss_dwd_bak_20231219/__ss_content/__db_10103/__tbl_108873/__part_108872/__idx_108874/__108877/020000000006bdff234877b9adfea7efc62785bd36d9de8a_0.dat.part to target s3a://starrocks/__starrocks_repository_v1_repo/__ss_dwd_bak_20231219/__ss_content/__db_10103/__tbl_108873/__part_108872/__idx_108874/__108877/020000000006bdff234877b9adfea7efc62785bd36d9de8a_0.dat.715e3e8cfee08d15f293de06baeb9451, msg: curlCode: 28, Timeout was reached], [258974: Fail to rename file: s3a://starrocks/__starrocks_repository_v1_repo/__ss_dwd_bak_20231219/__ss_content/__db_10103/__tbl_108873/__part_108872/__idx_108874/__108875/020000000006bdfe234877b9adfea7efc62785bd36d9de8a_0.dat.part to: s3a://starrocks/__starrocks_repository_v1_repo/__ss_dwd_bak_20231219/__ss_content/__db_10103/__tbl_108873/__part_108872/__idx_108874/__108875/020000000006bdfe234877b9adfea7efc62785bd36d9de8a_0.dat.eb5e064ab6db3224f9b1eef7cfdefed4 msg:Fail to copy from src s3a://starrocks/__starrocks_repository_v1_repo/__ss_dwd_bak_20231219/__ss_content/__db_10103/__tbl_108873/__part_108872/__idx_108874/__108875/020000000006bdfe234877b9adfea7efc62785bd36d9de8a_0.dat.part to target s3a://starrocks/__starrocks_repository_v1_repo/__ss_dwd_bak_20231219/__ss_content/__db_10103/__tbl_108873/__part_108872/__idx_108874/__108875/020000000006bdfe234877b9adfea7efc62785bd36d9de8a_0.dat.eb5e064ab6db3224f9b1eef7cfdefed4, msg: curlCode: 28, Timeout was reached]

测试了一张250M的表,是可以备份成功的,逐步增大数据量,当backup任务处于上传状态,刚开始日志显示finished to rename file ,但是当执行了一段时间后,会出现 curlCode: 28, Timeout was reached

插眼 :thinking:

你好后续是怎么处理的呢

https://github.com/StarRocks/starrocks/pull/48860 可以关注后续这个修复的pr

当前可以先调整Be的object_storage_request_timeout_ms = 30000尝试一下

1赞