ternary-llm

Here are 4 public repositories matching this topic...

rust amd mcp hip rocm inference-engine 1-bit bitnet npu openai-api llm llm-inference local-ai strix-halo gfx1151 1-58-bit ternary-llm xdna2

Windows-native BitNet and ternary LLM inference with CPU GGUF, GPU runtime, terminal and browser chat, and release zips.

windows cuda pytorch quantization bitnet fastapi llama-cpp local-llm llm-inference gguf 1-bit-llm ternary-llm falcon3

amd hip gpu-computing rocm cpp20 bitnet llm-inference flash-decoding strix-halo gfx1151 1-58-bit ternary-llm

Run BitNet 1.58-bit and ternary LLMs on Windows with CPU and GPU inference, chat tools, and release-ready builds

windows cuda pytorch quantization bitnet fastapi llama-cpp local-llm llm-inference gguf ternary-llm falcon3

Add a description, image, and links to the ternary-llm topic page so that developers can more easily learn about it.

To associate your repository with the ternary-llm topic, visit your repo's landing page and select "manage topics."