-
Updated
Jun 24, 2026 - Shell
#
ternary-llm
Here are 4 public repositories matching this topic...
rust amd mcp hip rocm inference-engine 1-bit bitnet npu openai-api llm llm-inference local-ai strix-halo gfx1151 1-58-bit ternary-llm xdna2
Windows-native BitNet and ternary LLM inference with CPU GGUF, GPU runtime, terminal and browser chat, and release zips.
windows cuda pytorch quantization bitnet fastapi llama-cpp local-llm llm-inference gguf 1-bit-llm ternary-llm falcon3
-
Updated
Mar 20, 2026 - Python
amd hip gpu-computing rocm cpp20 bitnet llm-inference flash-decoding strix-halo gfx1151 1-58-bit ternary-llm
-
Updated
Jun 24, 2026 - C++
Run BitNet 1.58-bit and ternary LLMs on Windows with CPU and GPU inference, chat tools, and release-ready builds
windows cuda pytorch quantization bitnet fastapi llama-cpp local-llm llm-inference gguf ternary-llm falcon3
-
Updated
Jun 26, 2026
Improve this page
Add a description, image, and links to the ternary-llm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the ternary-llm topic, visit your repo's landing page and select "manage topics."