[Fix] Prevent symbolic over-unification in multi-modality torch.split and add comprehensive tests by cennn · Pull Request #26 · SandAI-org/MagiCompiler

cennn · 2026-04-26T10:03:38Z

🗂️ PR Category

📝 Description

When a modality has 0 tokens during initial compilation (e.g. a CP rank receives only video tokens), Dynamo unifies symbolic variables (total_tokens == video_tokens), causing AssertionError: expected size X==Y on cache reuse with different modality distributions.

Fix: Use a carrier tensor with mark_unbacked dimensions so each modality size becomes an independent unbacked SymInt (u0, u1, u2), preventing symbolic unification. In the two-level compile architecture (@torch.compile outer + @magi_compile inner), tolist() triggers a graph break; the is_compiling() guard ensures mark_unbacked executes in eager without hitting the forbidden callable error.

Additional changes:

Disable triton.autotune_at_compile_time in standalone_compile to avoid CUDA illegal-memory-access with unbacked SymInt dimensions; tuning happens at first runtime invocation instead.
Skip absolute perf thresholds on non-H100 GPUs (parity check only).
Tests (test_symbolic_unification.py, 9 cases):

Part A: Reproduce symbolic over-unification bug (single-level compile)
Part B: Verify carrier tensor + mark_unbacked fix
Part C: CP4-like cache reuse across rank distributions
Part D: Two-level compile — good/bad order, Inductor cache symbol verification (u0,u1,u2 independence), is_compiling() guard necessity

…d symbolic unification tests - Disable triton.autotune_at_compile_time in standalone_compile to avoid CUDA illegal-memory-access with unbacked SymInt dimensions; tuning happens at first runtime invocation instead. - Add comprehensive tests for symbolic over-unification (Part A-D): single-level and two-level compile, CP4 cache reuse, bad-order compilation, and Inductor cache symbol verification. - Skip absolute perf thresholds on non-H100 GPUs (parity check only).

jiahy0825

LGTM

cennn force-pushed the fix/unbacked-symint-symbolic-unification branch from 9e531fb to 3dc7fd7 Compare April 26, 2026 10:07

cennn force-pushed the fix/unbacked-symint-symbolic-unification branch from 3dc7fd7 to 676a246 Compare April 26, 2026 10:12

jiahy0825 approved these changes Apr 26, 2026

View reviewed changes

jiahy0825 merged commit 6df0b5f into SandAI-org:main Apr 26, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] Prevent symbolic over-unification in multi-modality torch.split and add comprehensive tests#26

[Fix] Prevent symbolic over-unification in multi-modality torch.split and add comprehensive tests#26
jiahy0825 merged 1 commit into
SandAI-org:mainfrom
cennn:fix/unbacked-symint-symbolic-unification

cennn commented Apr 26, 2026

Uh oh!

jiahy0825 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cennn commented Apr 26, 2026

🗂️ PR Category

📝 Description

Additional changes:

Uh oh!

jiahy0825 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants