Skip to content

Bump actions/stale from 9.0.0 to 10.0.0#86

Closed
dependabot[bot] wants to merge 34 commits into
mainfrom
dependabot/github_actions/actions/stale-10.0.0
Closed

Bump actions/stale from 9.0.0 to 10.0.0#86
dependabot[bot] wants to merge 34 commits into
mainfrom
dependabot/github_actions/actions/stale-10.0.0

Conversation

@dependabot
Copy link
Copy Markdown
Contributor

@dependabot dependabot Bot commented on behalf of github Sep 8, 2025

Bumps actions/stale from 9.0.0 to 10.0.0.

Release notes

Sourced from actions/stale's releases.

v10.0.0

What's Changed

Breaking Changes

Enhancement

Dependency Upgrades

Documentation changes

New Contributors

Full Changelog: actions/stale@v9...v10.0.0

v9.1.0

What's Changed

New Contributors

Full Changelog: actions/stale@v9...v9.1.0

Commits

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

sducouedic and others added 30 commits December 19, 2024 12:16
This PR allows support for IBM's Spyre accelerator.

This work was carried out in an private fork of vLLM. We are now moving
the code into the open, and all future work will be done using this
public fork.

---------

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Signed-off-by: Nikolaos Papandreou <npo@zurich.ibm.com>
Signed-off-by: Burkhard Ringlein <ngl@zurich.ibm.com>
Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
Signed-off-by: Sophie du Couedic <sophie.du.couedic.de.kergoualer@ibm.com>
Co-authored-by: Sophie du Couédic <Sophie.Du.Couedic.de.Kergoualer@ibm.com>
Co-authored-by: Nick Hill <nickhill@us.ibm.com>
Co-authored-by: Thomas Parnell <tpa@zurich.ibm.com>
Co-authored-by: Nikolaos Papandreou <npo@zurich.ibm.com>
Co-authored-by: TRAVIS JOHNSON <tsjohnso@us.ibm.com>
Co-authored-by: Burkhard Ringlein <NGL@zurich.ibm.com>
Co-authored-by: Yannick Schnider <Yannick.Schnider1@ibm.com>
Co-authored-by: Jan van Lunteren <jvl@zurich.ibm.com>
Co-authored-by: Maximilien Philippe Marie de Bayser <mbayser@br.ibm.com>
Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
This PR enables the Spyre tests to run as a Github action. 

I realized that the model we were using for the tests `llama-194m` is
not available on HF hub, but if we want to run the tests externally we
need to use some model that is available. I've replaced it with this
one: https://huggingface.co/JackFram/llama-160m

Note I haven't actually changed the model name in the tests, I just
"hacked" it for now using a soft link in the docker container. This is
because there is some ongoing work to introduce environment variables to
control the tests and I don't want to complicate things.

For this model I see some quite weird behaviour where the tokens
produced by vLLM and HF Transformers are identical but the decode text
is slightly different (they are the same up to a leading space). I don't
think this difference is related to Spyre so I've just changed the test
to compare token ids instead.

---------

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Merge from upstream to include new embedding model fixes. There were new
changes to the platform code and the task names were refactored, so I
had to fix some of our code.
Some models such as `sentence-transformers/all-MiniLM-L12-v2` don't have
special tokens such as "bos_token" in their tokenizer configuration.
This causes a key error when the warmup logic tries to get the id for
these tokens. However, since the IDs are only used to exclude them from
the set of tokens that can be generated during the warmup, it doesn't
make a difference if they don't exist.
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Fix issue with batch padding changing during decoding (e.g., if one
sequence finished before the others).
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
This function no longer exists upstream and we don't see to use
`VLLM_INSTANCE_ID` anywhere else in the code.
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
These now work due to upstream changes that were pulled in.
…ncies. These comments cause the packages to be stripped out when running use_existing_torch, but the packages are required dependencies.

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
…ncies (#68)

I already tried to fix this using #66
but upstream didn't like that change (the behaviour to filter out
comments containing torch was intentional). After [some
discussion](vllm-project/vllm#12255), we agreed
on a different solution implemented in this PR. Note that I reverted the
changes from #66 by force pushing main.

Note this has already been merged upstream by
vllm-project/vllm#12260 but I'm cherry-picking
the fix here since it is blocking the CI builds.
Signed-off-by: Yannick Schnider <Yannick.Schnider1@ibm.com>
Signed-off-by: Yannick Schnider <Yannick.Schnider1@ibm.com>
Small typo fix referring to wrong test script...
tdoublep and others added 4 commits January 28, 2025 20:42
This PR reworks our code according to some important upstream changes.
In particular, there is no longer any need to have a separate
`SpyreExecutor` and `MultiprocessingSpyreExecutor`. Upstream has added
generic classes for this that work across different platforms. Acutally,
it simplifies our code quite a lot.

The model runner classes now inherit from `ModelRunnerBase` and we need
to define a `ModelInputForSpyre` class accordingly.

This is current passing all CPU tests, but needs to be tested on Spyre
and needs careful review since it quite a big change.


**Note:** the target for this PR is a branch `upstream-2025-01-17`
containing upstream changes merged into our current branch. I've done it
like this so it is easier to review the changes. If this PR is approved,
we can then merge the changes into `upstream-2025-01-17` and then merge
that one into main.

---------

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Signed-off-by: Yannick Schnider <Yannick.Schnider1@ibm.com>
Co-authored-by: Yannick Schnider <Yannick.Schnider1@ibm.com>
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
This PR fixes a previously unidentified bug and adds pytests for
validation.

**Changes**: 
- addressing the logic error described below by introducing
`SpyreCausalLM.indices` containing a mask indicating the unfinished
sequences in the current batch. ->
[commit](3f087a7)
- adapting the generation functions in
[tests/spyre/spyre_util.py](main...ysc-fix-variable-max-tokens#diff-d232e0cf89b92b0ec7da17e322bb2ca675af8a704099e5ae0c54995ddb4a3f9a)
for `hf` and `vllm` to accept different number of max decoding token for
sequences within the same batch ->
[commit](f632e8e)
- adding
[tests/spyre/test_spyre_max_new_tokens.py](main...ysc-fix-variable-max-tokens#diff-82d9214a22b1db2e524795c8a649a40c115fd95a40b279e4d3245c7820e6ddf8)
to validate functionality when sequences in a batch finish decoding
before others. ->
[commit](f632e8e)

**Bug description**:

Having a different number of requested output tokens within the same
batch will lead to some sequences being removed from the batch while
others are still decoding. Previously the code did not take into account
the offset a removed sequence introduces in the `positions` (ids) and
(attention) `masks`. This error remains undetected if all prompts are of
the same length (they will have the same position ids and attention
masks) or if always the last sequence in a batch finishes early (the
offset at the end will not affect sequences with smaller indices within
the same batch).

_bug example_: 
<img width="1392" alt="Screenshot 2025-01-31 at 12 39 26"
src="https://github.com/user-attachments/assets/b19deee5-af32-48cd-9b1a-051e9f074737"
/>

---------

Signed-off-by: Yannick Schnider <Yannick.Schnider1@ibm.com>
Bumps [actions/stale](https://github.com/actions/stale) from 9.0.0 to 10.0.0.
- [Release notes](https://github.com/actions/stale/releases)
- [Changelog](https://github.com/actions/stale/blob/main/CHANGELOG.md)
- [Commits](actions/stale@28ca103...3a9db7e)

---
updated-dependencies:
- dependency-name: actions/stale
  dependency-version: 10.0.0
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot Bot added dependencies Pull requests that update a dependency file github_actions Pull requests that update GitHub Actions code labels Sep 8, 2025
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Sep 8, 2025

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

  • Add ready label to the PR
  • Enable auto-merge.

🚀

@dependabot @github
Copy link
Copy Markdown
Contributor Author

dependabot Bot commented on behalf of github Sep 9, 2025

Looks like actions/stale is up-to-date now, so this is no longer needed.

@dependabot dependabot Bot closed this Sep 9, 2025
@dependabot dependabot Bot deleted the dependabot/github_actions/actions/stale-10.0.0 branch September 9, 2025 13:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file github_actions Pull requests that update GitHub Actions code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants