[BUG] Qwen models crash with DTensor dispatch error under TP > 1

## Checklist

- [x] The error occurs when using our provided Docker image.
- [x] I can consistently reproduce the bug across multiple trials or random seeds.
- [ ] If the error causes experiment abortion, I've verified that this error is the root
  cause, not a secondary error caused by peer workers.

## Detailed Information

### Describe the bug

When using FSDP2 engine with Tensor Parallelism (TP > 1) for Qwen family models (Qwen3, etc.), the forward pass crashes with a DTensor dispatch error.

The root cause is that Qwen models have intermediate operations (aten.alias, aten.slice) between the final LayerNorm and lm_head in their forward() method. These ops are not registered in PyTorch's DTensor dispatch table, so when the norm output is a DTensor with Shard(1) placement, the subsequent slice/alias ops fail to propagate the sharding metadata, causing a dispatch error before reaching lm_head.


### Expected behavior

Qwen models should work correctly with FSDP2 + TP > 1, the same as Llama models. The forward pass through final_norm → lm_head should complete without DTensor dispatch errors.

### Full logs

<img width="1739" height="1601" alt="Image" src="https://github.com/user-attachments/assets/191273fb-1c45-4114-9381-9902aeabdc33" />

## To Reproduce

### Commit ID

Please provide your Git commit ID.

### Environment

Please provide your software and hardware information if you're not using a
containerized environment.

### Script

The bash script or YAML configuration to run:


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Qwen models crash with DTensor dispatch error under TP > 1 #1366

Checklist

Detailed Information

Describe the bug

Expected behavior

Full logs

To Reproduce

Commit ID

Environment

Script

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[BUG] Qwen models crash with DTensor dispatch error under TP > 1 #1366

Description

Checklist

Detailed Information

Describe the bug

Expected behavior

Full logs

To Reproduce

Commit ID

Environment

Script

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions