amd hip gpu-computing rocm cpp20 bitnet llm-inference flash-decoding strix-halo gfx1151 1-58-bit ternary-llm
-
Updated
Jun 24, 2026 - C++