[Issue]: arch_supports_fp8() for gfx950 and gfx1201?

### Problem Description

We can read here that FP8 is enabled only for gfx942. According to my understanding gfx950 and gfx1201 should also support it on architecture level.

https://github.com/ROCm/flash-attention/blob/ea8fe36e8418fd8e41705b0f7a8d17cddfb46ab0/flash_attn/flash_attn_triton_amd/utils.py#L774-L776

Maybe there are other blockers that has to be resolved before enabling FP8 for those platforms in flash-attention. You can consider this issue as bug report if this is as simple as updating this check or feature request if more changes are needed, but eventually I think FP8 can be enabled for gfx950 and gfx1201.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Issue]: arch_supports_fp8() for gfx950 and gfx1201? #146

Problem Description

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

	@functools.cache
	def arch_supports_fp8():
	return is_hip() and get_arch() in ('gfx942')

[Issue]: arch_supports_fp8() for gfx950 and gfx1201? #146

Description

Problem Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions