Clarify Intel XPU support gaps and unsupported model scope in XPU model matrix docs by Copilot · Pull Request #6 · zhenwei-intel/vllm

Copilot · 2026-03-14T09:12:15Z

The issue asked for a clear view of what Intel XPU still lacks and which models are not supported. This PR adds that missing summary directly to the Intel XPU hardware support page so users can quickly determine current capability boundaries.

Documentation: Intel XPU feature gaps
- Added a new Current Gaps on Intel XPU section in docs/models/hardware_supported_models/xpu.md.
- Captures key runtime limitations currently relevant to users:
  - CUDA graph mode not available on Intel XPU
  - Flash Attention with float32 falls back to Triton
  - bfloat16 blocked on Arc A770 due to known accuracy issues
  - Additional XPU graph capture constraints in multi-GPU communication scenarios
Documentation: Intel GPU quantization gaps
- Added an explicit unsupported quantization list for Intel GPU, sourced from the quantization hardware matrix:
  - Marlin (GPTQ/AWQ/FP8/FP4)
  - INT8 (W8A8)
  - FP8 (W8A8)
  - bitsandbytes
  - DeepSpeedFP
  - GGUF
- Added a direct relative link to the quantization matrix for traceability.
Documentation: unsupported model interpretation
- Added a Model Support Scope section clarifying that the page is a validated allowlist, not an exhaustive unsupported-model list.
- Defined practical criteria for treating a model as unsupported / not yet validated on Intel XPU.

Example snippet from the new guidance:

For Intel XPU, treat a model as **not supported / not yet validated** when any of the following is true:

- The model architecture or checkpoint is **not listed** in the validated tables below.
- The model depends on a quantization method listed above as unsupported on Intel GPU.
- The model only works with unsupported XPU feature combinations.

📱 Kick off Copilot coding agent tasks wherever you are with GitHub Mobile, available on iOS and Android.

Co-authored-by: zhenwei-intel <109187816+zhenwei-intel@users.noreply.github.com>

Initial plan

f52d1f8

Copilot AI assigned Copilot and zhenwei-intel Mar 14, 2026

Copilot started work on behalf of zhenwei-intel March 14, 2026 09:12 View session

docs: clarify Intel XPU feature and model support gaps

67b43e1

Co-authored-by: zhenwei-intel <109187816+zhenwei-intel@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Analyze missing features and unsupported models for Intel XPU~~ Clarify Intel XPU support gaps and unsupported model scope in XPU model matrix docs Mar 14, 2026

Copilot AI requested a review from zhenwei-intel March 14, 2026 09:18

Copilot finished work on behalf of zhenwei-intel March 14, 2026 09:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarify Intel XPU support gaps and unsupported model scope in XPU model matrix docs#6

Clarify Intel XPU support gaps and unsupported model scope in XPU model matrix docs#6
Copilot wants to merge 2 commits into
mainfrom
copilot/intel-xpu-feature-analysis

Copilot AI commented Mar 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Mar 14, 2026 •

edited

Loading