Skip to content
#

sm60

Here is 1 public repository matching this topic...

Run modern hybrid/MoE LLMs correctly and fast on cheap old Tesla P100 / GTX 1080 Ti cards. Fork of ik_llama.cpp: clean concurrent (np>1) Gated-DeltaNet hybrid decoding + Pascal sm_60 FP16 build tuning + built-in fan-out decomposer.

  • Updated Jun 7, 2026
  • Shell

Improve this page

Add a description, image, and links to the sm60 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sm60 topic, visit your repo's landing page and select "manage topics."

Learn more