-
Notifications
You must be signed in to change notification settings - Fork 4k
[ci] chore: npu ci use cann9.0.0 #6520
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
0681b0d
d8b2f18
f4eccac
54b6452
3b4bbec
c50a951
cdc3f0d
628b6f4
b14f8bc
f9e3f4a
ec34896
0e8a583
34f7579
18b5ffc
b46c8b2
271ac25
740c3f8
f66b1f8
3579090
235745c
a27f6bf
3118992
8e50ac3
2632927
b066c9a
3ed8fb6
36cee54
da60771
d099f36
6afa5eb
965c8fd
a64c3c5
0bcbdca
6d995f5
0e4ed22
f6e2472
061a4a1
c7b42ae
39d0e55
387566d
8e70485
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -62,15 +62,15 @@ concurrency: | |
|
|
||
| permissions: | ||
| contents: read | ||
|
|
||
| jobs: | ||
| llm_rl_job: | ||
| if: github.repository_owner == 'verl-project' | ||
| name: E2E Ascend testing for RL training scenarios of LLM models | ||
| runs-on: linux-aarch64-a3-8 | ||
| timeout-minutes: 120 | ||
| container: | ||
| image: swr.cn-southwest-2.myhuaweicloud.com/modelfoundry/ascend-ci/verl/verl:verl-8.5.0-a3-ubuntu22.04-py3.11-latest | ||
| image: swr.cn-southwest-2.myhuaweicloud.com/modelfoundry/ascend-ci/verl/verl:verl-9.0.0-a3-ubuntu22.04-py3.11-latest | ||
|
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 升级后,权重转换部分要这样改 |
||
| options: >- | ||
| --shm-size 16g | ||
| env: | ||
|
|
@@ -183,7 +183,7 @@ jobs: | |
| runs-on: linux-aarch64-a3-8 | ||
| timeout-minutes: 120 | ||
| container: | ||
| image: swr.cn-southwest-2.myhuaweicloud.com/modelfoundry/ascend-ci/verl/verl:verl-8.5.0-a3-ubuntu22.04-py3.11-latest | ||
| image: swr.cn-southwest-2.myhuaweicloud.com/modelfoundry/ascend-ci/verl/verl:verl-9.0.0-a3-ubuntu22.04-py3.11-latest | ||
| options: >- | ||
| --shm-size 16g | ||
| env: | ||
|
|
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -21,6 +21,8 @@ | |
| import pytest | ||
| import torch | ||
|
|
||
| import verl.utils.device as device_module | ||
|
|
||
|
|
||
| def _load_mcore_util_with_stubbed_megatron(monkeypatch, tp_size: int = 4): | ||
| megatron = types.ModuleType("megatron") | ||
|
|
@@ -39,6 +41,7 @@ def _load_mcore_util_with_stubbed_megatron(monkeypatch, tp_size: int = 4): | |
| monkeypatch.setitem(sys.modules, "megatron.core", core) | ||
| monkeypatch.setitem(sys.modules, "megatron.core.parallel_state", parallel_state) | ||
| monkeypatch.setitem(sys.modules, "megatron.core.packed_seq_params", packed_seq_params) | ||
| monkeypatch.setattr(device_module, "is_npu_available", False) | ||
|
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. pytest -s -x --ignore-glob="test_special_.py" --ignore-glob="on_cpu.py" --ignore-glob="test_vllm" --ignore-glob="_sglang*" --ignore-glob="_hf_rollout" --ignore-glob="tests/models/" --ignore-glob="tests/special*" --ignore-glob="tests/experimental" --ignore-glob="tests/workers/reward_model" --ignore-glob="test_rvdz" --ignore-glob="test_ray_collectives" --ignore-glob="test_nvtx_profile" --ignore-glob="tests/checkpoint_engine" --ignore-glob="test_shared_memory" --ignore-glob="tests/workers/rollout/rollout_trtllm" --ignore-glob="test_fsdp_lora_merge" --ignore-glob="test_activation_offload" --ignore-glob="test_normalize_peft_param_name.py" tests/ -k "not test_preprocess_bshd_engine_preserves_topk_dense_dim_on_gpu" skip这个用例 |
||
|
|
||
| util_path = Path(__file__).parents[2] / "verl" / "models" / "mcore" / "util.py" | ||
| spec = importlib.util.spec_from_file_location("mcore_util_regression", util_path) | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
npu unit test加上
import verl.utils.device as device_module
monkeypatch.setattr(device_module, "is_npu_available", False)