Skip to content

Issues in julia 1.12 with older system? #557

@evelyne-ringoot

Description

@evelyne-ringoot

I am getting some weird issues on a HPC cluster (stampede3), after removing .julia and installing julia 1.12, unset $LD_LIBRARY_PATH, starting a new project and doing ]add oneMKL, I get: (this is with gcc/13.2.0+ onemkl/24.0 loaded)

julia> using oneAPI
terminate called after throwing an instance of 'std::out_of_range'
  what():  bitset::set: __position (which is 4294967288) >= _Nb (which is 9)

[3522716] signal 6 (-6): Aborted
in expression starting at REPL[2]:1
__pthread_kill_implementation at /lib64/libc.so.6 (unknown line)
raise at /lib64/libc.so.6 (unknown line)
abort at /lib64/libc.so.6 (unknown line)
__verbose_terminate_handler at /workspace/srcdir/gcc-14.2.0/libstdc++-v3/libsupc++/vterminate.cc:95
__terminate at /workspace/srcdir/gcc-14.2.0/libstdc++-v3/libsupc++/eh_terminate.cc:48
terminate at /workspace/srcdir/gcc-14.2.0/libstdc++-v3/libsupc++/eh_terminate.cc:58
__cxa_throw at /workspace/srcdir/gcc-14.2.0/libstdc++-v3/libsupc++/eh_throw.cc:98
__throw_out_of_range_fmt at /workspace/srcdir/gcc-14.2.0/libstdc++-v3/src/c++11/functexcept.cc:101
_ZN3NEO10EngineInfo9mapEngineEPKNS_11IoctlHelperERKNS_18EngineCapabilitiesERKNS_21RootDeviceEnvironmentERPKN10aub_stream10EngineTypeERNS0_14EngineCountersEj at /home1/11134/eringoot/.julia/artifacts/722b403773ba4cd6fe4104cc541d2301b426c8d0/lib/libze_intel_gpu.so.1 (unknown line)
_ZN3NEO15IoctlHelperI91516createEngineInfoEb at /home1/11134/eringoot/.julia/artifacts/722b403773ba4cd6fe4104cc541d2301b426c8d0/lib/libze_intel_gpu.so.1 (unknown line)
_ZN3NEO3Drm15queryEngineInfoEb at /home1/11134/eringoot/.julia/artifacts/722b403773ba4cd6fe4104cc541d2301b426c8d0/lib/libze_intel_gpu.so.1 (unknown line)
_ZN3NEO3Drm6createEOSt10unique_ptrINS_13HwDeviceIdDrmESt14default_deleteIS2_EERNS_21RootDeviceEnvironmentE at /home1/11134/eringoot/.julia/artifacts/722b403773ba4cd6fe4104cc541d2301b426c8d0/lib/libze_intel_gpu.so.1 (unknown line)
_ZN3NEO18initDrmOsInterfaceEOSt10unique_ptrINS_10HwDeviceIdESt14default_deleteIS1_EEjPNS_21RootDeviceEnvironmentE at /home1/11134/eringoot/.julia/artifacts/722b403773ba4cd6fe4104cc541d2301b426c8d0/lib/libze_intel_gpu.so.1 (unknown line)
_ZN3NEOL23initHwDeviceIdResourcesERNS_20ExecutionEnvironmentEOSt10unique_ptrINS_10HwDeviceIdESt14default_deleteIS3_EEj.lto_priv.0 at /home1/11134/eringoot/.julia/artifacts/722b403773ba4cd6fe4104cc541d2301b426c8d0/lib/libze_intel_gpu.so.1 (unknown line)
_ZN3NEO25prepareDeviceEnvironmentsERNS_20ExecutionEnvironmentERNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEEj at /home1/11134/eringoot/.julia/artifacts/722b403773ba4cd6fe4104cc541d2301b426c8d0/lib/libze_intel_gpu.so.1 (unknown line)
_ZN2L09DriverImp10initializeEP12_ze_result_t at /home1/11134/eringoot/.julia/artifacts/722b403773ba4cd6fe4104cc541d2301b426c8d0/lib/libze_intel_gpu.so.1 (unknown line)
_ZZNSt9once_flag18_Prepare_executionC4IZSt9call_onceIZN2L09DriverImp10driverInitEvEUlvE_JEEvRS_OT_DpOT0_EUlvE_EERS7_ENUlvE_4_FUNEv at /home1/11134/eringoot/.julia/artifacts/722b403773ba4cd6fe4104cc541d2301b426c8d0/lib/libze_intel_gpu.so.1 (unknown line)
__pthread_once_slow at /lib64/libc.so.6 (unknown line)
_ZN2L010initDriverEv at /home1/11134/eringoot/.julia/artifacts/722b403773ba4cd6fe4104cc541d2301b426c8d0/lib/libze_intel_gpu.so.1 (unknown line)
_ZN6loader9context_t11init_driverERNS_8driver_tEjP27_ze_init_driver_type_desc_tP21_ze_global_dditable_tP22_zes_global_dditable_tb at /home1/11134/eringoot/.julia/artifacts/1a96eae820fa2371f1d3e4c8260ee44953d35e35/lib/libze_loader.so (unknown line)
_ZN6loader9context_t13check_driversEjP27_ze_init_driver_type_desc_tP21_ze_global_dditable_tP22_zes_global_dditable_tPbb at /home1/11134/eringoot/.julia/artifacts/1a96eae820fa2371f1d3e4c8260ee44953d35e35/lib/libze_loader.so (unknown line)
zelLoaderDriverCheck at /home1/11134/eringoot/.julia/artifacts/1a96eae820fa2371f1d3e4c8260ee44953d35e35/lib/libze_loader.so (unknown line)
_ZN6ze_lib9context_t4InitEjbP27_ze_init_driver_type_desc_t at /home1/11134/eringoot/.julia/artifacts/1a96eae820fa2371f1d3e4c8260ee44953d35e35/lib/libze_loader.so (unknown line)
_ZZSt9call_onceIZ6zeInitEUlvE_JEEvRSt9once_flagOT_DpOT0_ENUlvE0_4_FUNEv at /home1/11134/eringoot/.julia/artifacts/1a96eae820fa2371f1d3e4c8260ee44953d35e35/lib/libze_loader.so (unknown line)
__pthread_once_slow at /lib64/libc.so.6 (unknown line)
zeInit at /home1/11134/eringoot/.julia/artifacts/1a96eae820fa2371f1d3e4c8260ee44953d35e35/lib/libze_loader.so (unknown line)
macro expansion at /home1/11134/eringoot/.julia/packages/oneAPI/CHy01/lib/level-zero/libze.jl:2150 [inlined]
#zeInit##0 at /home1/11134/eringoot/.julia/packages/GPUToolbox/JLBB1/src/ccalls.jl:34 [inlined]
#check##0 at /home1/11134/eringoot/.julia/packages/oneAPI/CHy01/lib/level-zero/libze.jl:15 [inlined]
retry_reclaim at /home1/11134/eringoot/.julia/packages/oneAPI/CHy01/lib/level-zero/utils.jl:4
check at /home1/11134/eringoot/.julia/packages/oneAPI/CHy01/lib/level-zero/libze.jl:13 [inlined]
zeInit at /home1/11134/eringoot/.julia/packages/GPUToolbox/JLBB1/src/ccalls.jl:33 [inlined]
__init__ at /home1/11134/eringoot/.julia/packages/oneAPI/CHy01/lib/level-zero/oneL0.jl:164
jfptr___init___2713 at /home1/11134/eringoot/.julia/compiled/v1.12/oneAPI/X4ZbC_ovvmr.so (unknown line)
jl_apply at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/julia.h:2391 [inlined]
jl_module_run_initializer at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/toplevel.c:68
run_module_init at ./loading.jl:1440
register_restored_modules at ./loading.jl:1428
#_include_from_serialized#867 at ./loading.jl:1316
_include_from_serialized at ./loading.jl:1271 [inlined]
#_require_search_from_serialized#883 at ./loading.jl:2112
_require_search_from_serialized at ./loading.jl:2006 [inlined]
__require_prelocked at ./loading.jl:2624
_require_prelocked at ./loading.jl:2490
macro expansion at ./loading.jl:2418 [inlined]
macro expansion at ./lock.jl:376 [inlined]
__require at ./loading.jl:2383
jfptr___require_62821.1 at /home1/11134/eringoot/julia-1.12.2/lib/julia/sys.so (unknown line)
jl_apply at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/julia.h:2391 [inlined]
jl_f_invoke_in_world at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/builtins.c:900
require at ./loading.jl:2359
jfptr_require_66738.1 at /home1/11134/eringoot/julia-1.12.2/lib/julia/sys.so (unknown line)
jl_apply at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/julia.h:2391 [inlined]
call_require at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/toplevel.c:494 [inlined]
eval_import_path at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/toplevel.c:532
jl_toplevel_eval_flex at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/toplevel.c:830
ijl_toplevel_eval at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/toplevel.c:1047
eval_body at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/interpreter.c:655
jl_interpret_toplevel_thunk at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/interpreter.c:898
jl_toplevel_eval_flex at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/toplevel.c:1035
__repl_entry_eval_expanded_with_loc at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/usr/share/julia/stdlib/v1.12/REPL/src/REPL.jl:301
jl_apply at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/julia.h:2391 [inlined]
jl_f_invokelatest at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/builtins.c:881
toplevel_eval_with_hooks at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/usr/share/julia/stdlib/v1.12/REPL/src/REPL.jl:308
toplevel_eval_with_hooks at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/usr/share/julia/stdlib/v1.12/REPL/src/REPL.jl:312
toplevel_eval_with_hooks at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/usr/share/julia/stdlib/v1.12/REPL/src/REPL.jl:305 [inlined]
eval_user_input at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/usr/share/julia/stdlib/v1.12/REPL/src/REPL.jl:330
repl_backend_loop at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/usr/share/julia/stdlib/v1.12/REPL/src/REPL.jl:452
#start_repl_backend#41 at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/usr/share/julia/stdlib/v1.12/REPL/src/REPL.jl:427
start_repl_backend at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/usr/share/julia/stdlib/v1.12/REPL/src/REPL.jl:424 [inlined]
#run_repl#50 at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/usr/share/julia/stdlib/v1.12/REPL/src/REPL.jl:653
run_repl at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/usr/share/julia/stdlib/v1.12/REPL/src/REPL.jl:639
jfptr_run_repl_19709.1 at /home1/11134/eringoot/julia-1.12.2/share/julia/compiled/v1.12/REPL/u0gqU_DlLf6.so (unknown line)
run_std_repl at ./client.jl:478
jfptr_run_std_repl_36841.1 at /home1/11134/eringoot/julia-1.12.2/lib/julia/sys.so (unknown line)
jl_apply at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/julia.h:2391 [inlined]
jl_f_invokelatest at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/builtins.c:881
run_main_repl at ./client.jl:499
repl_main at ./client.jl:586 [inlined]
_start at ./client.jl:561
jfptr__start_47004.1 at /home1/11134/eringoot/julia-1.12.2/lib/julia/sys.so (unknown line)
jl_apply at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/julia.h:2391 [inlined]
true_main at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/jlapi.c:971
jl_repl_entrypoint at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/src/jlapi.c:1139
main at /cache/build/builder-amdci5-2/julialang/julia-release-1-dot-12/cli/loader_exe.c:58
__libc_start_call_main at /lib64/libc.so.6 (unknown line)
__libc_start_main at /lib64/libc.so.6 (unknown line)
unknown function (ip: 0x4010b8) at /workspace/srcdir/glibc-2.17/csu/../sysdeps/x86_64/start.S
Allocations: 20133225 (Pool: 20130998; Big: 2227); GC: 66
Aborted (core dumped)

Maybe it's some compatibility error with the older intel version on the system?

I tried to load intel modules (24.0) instead of gcc/mkl and got the below:

  ✗ oneAPI
  ✗ Atomix → AtomixoneAPIExt
Precompiling packages finished.
  1 dependency successfully precompiled in 30 seconds. 85 already precompiled.
  2 dependencies errored.
  For a report of the errors see `julia> err`. To retry use `pkg> precompile`

julia> err
PkgPrecompileError: The following 1 direct dependency failed to precompile:

oneAPI

Failed to precompile oneAPI [8f75cd03-7ff8-4ecb-9b8f-daf728133b1b] to "/home1/11134/eringoot/.julia/compiled/v1.12/oneAPI/jl_9N7wJX".
ERROR: LoadError: InitError: could not load library "/home1/11134/eringoot/.julia/artifacts/a5555ca7240cdc4c7fc2353452ac51b417d954c8/lib/liboneapi_support.so"
/home1/11134/eringoot/.julia/artifacts/a5555ca7240cdc4c7fc2353452ac51b417d954c8/lib/libmkl_sycl_vm.so.5: undefined symbol: vmsYnI_64
Stacktrace:
  [1] #dlopen#3
    @ ./libdl.jl:120 [inlined]
  [2] dlopen(s::String, flags::UInt32)
    @ Base.Libc.Libdl ./libdl.jl:119
  [3] macro expansion
    @ ~/.julia/packages/JLLWrappers/m2Pjh/src/products/library_generators.jl:63 [inlined]
  [4] __init__()
    @ oneAPI_Support_jll ~/.julia/packages/oneAPI_Support_jll/szLg7/src/wrappers/x86_64-linux-gnu-cxx11.jl:11
  [5] run_module_init(mod::Module, i::Int64)
    @ Base ./loading.jl:1440
  [6] register_restored_modules(sv::Core.SimpleVector, pkg::Base.PkgId, path::String)
    @ Base ./loading.jl:1428
  [7] _include_from_serialized(pkg::Base.PkgId, path::String, ocachepath::String, depmods::Vector{Any}; register::Bool)
    @ Base ./loading.jl:1316
  [8] _include_from_serialized
    @ ./loading.jl:1271 [inlined]
  [9] _require_search_from_serialized(pkg::Base.PkgId, sourcepath::String, build_id::UInt128, stalecheck::Bool; reasons::Dict{String, Int64}, DEPOT_PATH::Vector{String})
    @ Base ./loading.jl:2112
 [10] _require_search_from_serialized
    @ ./loading.jl:2006 [inlined]
 [11] __require_prelocked(pkg::Base.PkgId, env::String)
    @ Base ./loading.jl:2624
 [12] _require_prelocked(uuidkey::Base.PkgId, env::String)
    @ Base ./loading.jl:2490
 [13] macro expansion
    @ ./loading.jl:2418 [inlined]
 [14] macro expansion
    @ ./lock.jl:376 [inlined]
 [15] __require(into::Module, mod::Symbol)
    @ Base ./loading.jl:2383
 [16] require(into::Module, mod::Symbol)
    @ Base ./loading.jl:2359
 [17] top-level scope
    @ ~/.julia/packages/oneAPI/CHy01/src/oneAPI.jl:19
 [18] include(mod::Module, _path::String)
    @ Base ./Base.jl:306
 [19] include_package_for_output(pkg::Base.PkgId, input::String, depot_path::Vector{String}, dl_load_path::Vector{String}, load_path::Vector{String}, concrete_deps::Vector{Pair{Base.PkgId, UInt128}}, source::Nothing)
    @ Base ./loading.jl:3021
 [20] top-level scope
    @ stdin:5
 [21] eval(m::Module, e::Any)
    @ Core ./boot.jl:489
 [22] include_string(mapexpr::typeof(identity), mod::Module, code::String, filename::String)
    @ Base ./loading.jl:2867
 [23] include_string
    @ ./loading.jl:2877 [inlined]
 [24] exec_options(opts::Base.JLOptions)
    @ Base ./client.jl:315
 [25] _start()
    @ Base ./client.jl:550
during initialization of module oneAPI_Support_jll
in expression starting at /home1/11134/eringoot/.julia/packages/oneAPI/CHy01/src/oneAPI.jl:1
in expression starting at stdin:

Am I missing something obvious or should one of the above work?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions