Align GPTQ-Pro kernel with Ampere GPUs (auto-selection + native SASS) by Copilot · Pull Request #7 · groxaxo/GPTQ-Pro

Copilot · 2026-04-28T19:36:21Z

Raise SUPPORTS_FORMATS priority 0 → 95 so GPTQ-Pro enters auto-selection on Ampere
Add -gencode flags for sm_80/sm_86/sm_87 (native Ampere SASS)
Add -gencode arch=compute_87,code=compute_87 PTX fallback for Ada/Hopper forward compatibility

Agent-Logs-Url: https://github.com/groxaxo/GPTQ-Pro/sessions/09c93bdb-a0ca-4654-a385-c74bbab8ce98 Co-authored-by: groxaxo <76023196+groxaxo@users.noreply.github.com>

…build flags) Agent-Logs-Url: https://github.com/groxaxo/GPTQ-Pro/sessions/e4d78097-be29-48bb-8f75-d71ac6547cc9 Co-authored-by: groxaxo <76023196+groxaxo@users.noreply.github.com>

groxaxo · 2026-04-29T04:18:57Z

@copilot review the changes please.

Agent-Logs-Url: https://github.com/groxaxo/GPTQ-Pro/sessions/647ddaba-a7c7-4078-a93e-f5d23bda896e Co-authored-by: groxaxo <76023196+groxaxo@users.noreply.github.com>

Copilot · 2026-04-29T04:22:56Z

@copilot review the changes please.

Here's the review of all changes in this PR:

Commit 1 — QUANTIZE_BLACK_LIST (a81fc20)

Correct. QUANTIZE_BLACK_LIST was an empty dict {}, so no method was ever blocked. The change correctly lists non-GPTQ methods (GGUF, FP8, BitsAndBytes, QQQ, AWQ, EXL3, PARO) and improves the error message to tell the user what method to use instead.

Commit 2 — Ampere alignment (39e5aae)

SUPPORTS_FORMATS priority 0 → 95 — Correct. Priority 0 permanently excluded GPTQ-Pro from BACKEND.AUTO selection (the selector skips anything with priority <= 0). At 95 it ranks above Marlin (90) and ExllamaV2 (80). The existing validate_device() guard (major >= 8) ensures transparent fallthrough to Marlin on pre-Ampere hardware.
-gencode flags — Correct, with one issue I found and fixed: _validate_gptq_pro_device_support() checks major >= 8, which also passes on Ada Lovelace (sm_89 / RTX 4090) and Hopper (sm_90 / H100). With only SASS cubins for sm_80/86/87 and no PTX fallback, loading the kernel on those devices would fail with "no kernel image available". Fixed in 0457356 by adding -gencode arch=compute_87,code=compute_87, which embeds sm_87 PTX that the CUDA driver can forward-JIT for any post-Ampere device.

Copilot AI and others added 2 commits April 28, 2026 03:58

feat: restrict quantization to GPTQ-only via QUANTIZE_BLACK_LIST

a81fc20

Agent-Logs-Url: https://github.com/groxaxo/GPTQ-Pro/sessions/09c93bdb-a0ca-4654-a385-c74bbab8ce98 Co-authored-by: groxaxo <76023196+groxaxo@users.noreply.github.com>

feat: align GPTQ-Pro kernel with Ampere GPUs (priority + sm_80/86/87 …

39e5aae

…build flags) Agent-Logs-Url: https://github.com/groxaxo/GPTQ-Pro/sessions/e4d78097-be29-48bb-8f75-d71ac6547cc9 Co-authored-by: groxaxo <76023196+groxaxo@users.noreply.github.com>

Copilot AI assigned Copilot and groxaxo Apr 28, 2026

Copilot created this pull request from a session on behalf of groxaxo April 28, 2026 19:36 View session

groxaxo marked this pull request as ready for review April 29, 2026 04:18

Copilot started work on behalf of groxaxo April 29, 2026 04:19 View session

fix: add PTX fallback gencode for post-Ampere forward compatibility

0457356

Agent-Logs-Url: https://github.com/groxaxo/GPTQ-Pro/sessions/647ddaba-a7c7-4078-a93e-f5d23bda896e Co-authored-by: groxaxo <76023196+groxaxo@users.noreply.github.com>

Copilot AI requested a review from groxaxo April 29, 2026 04:23

Copilot finished work on behalf of groxaxo April 29, 2026 04:23

groxaxo merged commit 299ecd8 into main Apr 29, 2026

groxaxo deleted the copilot/check-bottlenecks-gptq-inferencing branch April 30, 2026 01:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Align GPTQ-Pro kernel with Ampere GPUs (auto-selection + native SASS)#7

Align GPTQ-Pro kernel with Ampere GPUs (auto-selection + native SASS)#7
groxaxo merged 3 commits into
mainfrom
copilot/check-bottlenecks-gptq-inferencing

Copilot AI commented Apr 28, 2026 •

edited

Loading

Uh oh!

groxaxo commented Apr 29, 2026

Uh oh!

Copilot AI commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

groxaxo commented Apr 29, 2026

Uh oh!

Copilot AI commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Apr 28, 2026 •

edited

Loading