Skip to content

Support byte_mirco_perf on intel gaudi2#141

Open
yupengzh-intel wants to merge 11 commits into
bytedance:mainfrom
yupengzh-intel:main
Open

Support byte_mirco_perf on intel gaudi2#141
yupengzh-intel wants to merge 11 commits into
bytedance:mainfrom
yupengzh-intel:main

Conversation

@yupengzh-intel
Copy link
Copy Markdown

Add HPU backend in byte_micro_perf/backends folder.
Copy HPU supported ops from GPU backend.
Gaudi doesn't support flash attention, we use fused_sdpa to support prefill for flash_attention op.
Add mark_step in byte_micro_perf/core/op.py, to break HPU graph under lazy mode, preventing graph creating issue when preparing too much tensors.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant