【存算分离】【hadoop】Unknown error 255

为了更快的定位您的问题,请提供以下信息,谢谢
【详述】搭建存算分离集群,使用hadoop做存储引擎,hadoop版本3.1
【背景】建表操作
【业务影响】 执行建表操作后,三台be中的两台be直接挂掉,重启后也会重新挂掉
【StarRocks版本】3.0.3
【集群规模】例如:3fe(3 follower)+3be(fe与be混部)
【机器信息】CPU虚拟核/内存/网卡,例如:4C/32G/千兆
【附件】

  • fe.log/beINFO/相应截图

    刚开始怀疑是hadoop集群没有权限,遂手动创建starrocks对应目录,并给与777权限。

    应该不是权限问题,很郁闷,建表失败后,be节点除了登录mysql客户端的机器全挂了,而且建表也不能成功,
    建表语句
    CREATE TABLE IF NOT EXISTS detail_demo (
    recruit_date DATE NOT NULL COMMENT “YYYY-MM-DD”,
    region_num TINYINT COMMENT “range [-128, 127]”,
    num_plate SMALLINT COMMENT "range [-32768, 32767] ",
    tel INT COMMENT “range [-2147483648, 2147483647]”,
    id BIGINT COMMENT “range [-2^63 + 1 ~ 2^63 - 1]”,
    password LARGEINT COMMENT “range [-2^127 + 1 ~ 2^127 - 1]”,
    name CHAR(20) NOT NULL COMMENT "range char(m),m in (1-255) ",
    profile VARCHAR(500) NOT NULL COMMENT “upper limit value 65533 bytes”,
    ispass BOOLEAN COMMENT “true/false”
    )
    DUPLICATE KEY(recruit_date, region_num)
    DISTRIBUTED BY HASH(recruit_date, region_num)
    PROPERTIES (
    “enable_storage_cache” = “true”,
    “storage_cache_ttl” = “2592000”,
    “enable_async_write_back” = “false”
    );

CREATE TABLE IF NOT EXISTS customer (
c_custkey int(11) NOT NULL COMMENT “”,
c_name varchar(26) NOT NULL COMMENT “”,
c_address varchar(41) NOT NULL COMMENT “”,
c_city varchar(11) NOT NULL COMMENT “”,
c_nation varchar(16) NOT NULL COMMENT “”,
c_region varchar(13) NOT NULL COMMENT “”,
c_phone varchar(16) NOT NULL COMMENT “”,
c_mktsegment varchar(11) NOT NULL COMMENT “”
) ENGINE=OLAP
DUPLICATE KEY(c_custkey)
COMMENT “OLAP”
DISTRIBUTED BY HASH(c_custkey) BUCKETS 12
PROPERTIES (
“replication_num” = “1”,
“in_memory” = “false”,
“storage_format” = “DEFAULT”
);

有没有大佬帮忙看看这个是哪里的原因。

看一下be.out和jni.log两个文件里的日志, 应该有一些具体错误信息.

贴上来了
could not find method getRootCauseMessage from class org/apache/commons/lang3/exception/ExceptionUtils with signature (Ljava/lang/Throwable;)Ljava/lang/String;
could not find method getStackTrace from class org/apache/commons/lang3/exception/ExceptionUtils with signature (Ljava/lang/Throwable;)Ljava/lang/String;
initCachedClasses failed error:
(unable to get root cause for java.lang.NoClassDefFoundError)
(unable to get stack trace for java.lang.NoClassDefFoundError)
getJNIEnv: getGlobalJNIEnv failed
3.0.3 RELEASE (build fe5e3a1)
query_id:00000000-0000-0000-0000-000000000000, fragment_instance:00000000-0000-0000-0000-000000000000
tracker:process consumption: 60207640
tracker:query_pool consumption: 8485984
tracker:load consumption: 0
tracker:metadata consumption: 0
tracker:tablet_metadata consumption: 0
tracker:rowset_metadata consumption: 0
tracker:segment_metadata consumption: 0
tracker:column_metadata consumption: 0
tracker:tablet_schema consumption: 0
tracker:segment_zonemap consumption: 0
tracker:short_key_index consumption: 0
tracker:column_zonemap_index consumption: 0
tracker:ordinal_index consumption: 0
tracker:bitmap_index consumption: 0
tracker:bloom_filter_index consumption: 0
tracker:compaction consumption: 0
tracker:schema_change consumption: 0
tracker:column_pool consumption: 0
tracker:page_cache consumption: 0
tracker:update consumption: 0
tracker:chunk_allocator consumption: 16392
tracker:clone consumption: 0
tracker:consistency consumption: 0
*** Aborted at 1690434464 (unix time) try “date -d @1690434464” if you are using GNU date ***
PC: @ 0x5caf9d0 methodIdFromClass.part.0
*** SIGSEGV (@0x0) received by PID 3596 (TID 0x7fc0b563d700) from PID 0; stack trace: ***
@ 0x62e4042 google::(anonymous namespace)::FailureSignalHandler()
@ 0x7fc162e637bb os::Linux::chained_handler()
@ 0x7fc162e683fd JVM_handle_linux_signal
@ 0x7fc162e5b158 signalHandler()
@ 0x7fc162339630 (unknown)
@ 0x5caf9d0 methodIdFromClass.part.0
@ 0x5cafa3a constructNewObjectOfJclass
@ 0x5cafd7d constructNewObjectOfCachedClass
@ 0x5cb2539 hdfsBuilderConnect
@ 0x5c53308 staros::starlet::fslib::HdfsFileSystem::initialize()
@ 0x5c29888 staros::starlet::fslib::FileSystemFactoryImpl::new_filesystem()
@ 0x5c299f0 staros::starlet::fslib::FileSystemFactory::new_filesystem()
@ 0x5c3e7d9 staros::starlet::fslib::CacheFileSystem::initialize()
@ 0x5c29888 staros::starlet::fslib::FileSystemFactoryImpl::new_filesystem()
@ 0x5c299f0 staros::starlet::fslib::FileSystemFactory::new_filesystem()
@ 0x4fe651f starrocks::StarOSWorker::new_shared_filesystem()
@ 0x4feb02d starrocks::StarOSWorker::build_filesystem_from_shard_info()
@ 0x4feb8c4 starrocks::StarOSWorker::get_shard_filesystem()
@ 0x46488f5 starrocks::StarletFileSystem::iterate_dir()
@ 0x4e7e022 starrocks::lake::metadata_gc()
@ 0x49887bc _ZNSt17_Function_handlerIFvvEZN9starrocks4lakeL11metadata_gcEPNS2_13TabletManagerERKSt3setINSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEESt4lessISB_ESaISB_EElEUlvE_E9_M_invokeERK
St9_Any_data @ 0x50e93c2 starrocks::ThreadPool::dispatch_thread()
@ 0x50e3eba starrocks::thread::supervise_thread()
@ 0x7fc162331ea5 start_thread
@ 0x7fc16194cb0d __clone
@ 0x0 (unknown)

1赞

BE JAVA_HOME 配的看起来有问题

JDK版本的问题吧 JDK >=11吗