-
-
Notifications
You must be signed in to change notification settings - Fork 7.5k
[V0][V1][Core] Add outlines integration for V1, and update V0 integration. #15975
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
[V0][V1][Core] Add outlines integration for V1, and update V0 integration. #15975
Conversation
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add 🚀 |
NOTE: Can't be merged until next version of outlines_core is released. |
Thank you for the PR! I will review it this week. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I reviewed the v0 code path. One ask is to add tests for this for disabling cache path.
And we should update the requirements/common.txt to the lowest version of outlines-core supported.
vllm/model_executor/guided_decoding/outlines_logits_processors.py
Outdated
Show resolved
Hide resolved
This pull request has merge conflicts that must be resolved before it can be |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
First round of review. A few things needs to be addressed here. but great progress so far.
vllm/model_executor/guided_decoding/outlines_logits_processors.py
Outdated
Show resolved
Hide resolved
vllm/model_executor/guided_decoding/outlines_logits_processors.py
Outdated
Show resolved
Hide resolved
vllm/model_executor/guided_decoding/outlines_logits_processors.py
Outdated
Show resolved
Hide resolved
vllm/model_executor/guided_decoding/outlines_logits_processors.py
Outdated
Show resolved
Hide resolved
This pull request has merge conflicts that must be resolved before it can be |
This pull request has merge conflicts that must be resolved before it can be |
Ok, that one got merged, let's proceed with this one and get this in for 0.9.0 |
commit f861a35 Author: Nathan Hoos <thwackyy.y@gmail.com> Date: Thu May 1 18:39:16 2025 -0500 Update vllm/v1/structured_output/backend_outlines.py Co-authored-by: Aaron Pham <Aaronpham0103@gmail.com> commit 50886e0 Author: Nathan Hoos <thwackyy.y@gmail.com> Date: Thu May 1 13:22:46 2025 -0500 Fix typo in merge conflict resolution commit cf92ccf Merge: 7bc8167 7423cf0 Author: Nathan Hoos <thwackyy.y@gmail.com> Date: Thu May 1 11:51:47 2025 -0500 Merge branch 'main' into update-outlines-integration commit 7bc8167 Author: Nathan Hoos <thwackyy.y@gmail.com> Date: Thu Apr 24 00:33:40 2025 +0000 Make pre-commit happy Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com> commit 5f2855d Merge: 0bf4bdf b07d741 Author: Nathan Hoos <thwackyy.y@gmail.com> Date: Wed Apr 23 19:07:34 2025 -0500 Merge branch 'main' into update-outlines-integration commit 0bf4bdf Merge: 2d67e88 5536b30 Author: Nathan Hoos <thwackyy.y@gmail.com> Date: Tue Apr 22 11:38:21 2025 -0500 Merge branch 'main' into update-outlines-integration commit 2d67e88 Author: Nathan Hoos <thwackyy.y@gmail.com> Date: Mon Apr 21 21:43:08 2025 +0000 format test last commit Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com> commit d5bdab7 Author: Nathan Hoos <thwackyy.y@gmail.com> Date: Mon Apr 21 16:44:32 2025 +0000 fix part of failing test which I can fix. Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com> commit 058ab4b Author: Nathan Hoos <thwackyy.y@gmail.com> Date: Sun Apr 20 05:35:45 2025 +0000 Fix bugs from outlines -> outlines_core behavior differences Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com> commit 04928c8 Author: Nathan Hoos <thwackyy.y@gmail.com> Date: Sat Apr 19 17:32:36 2025 +0000 Add outlines backend to V1 Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com> Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
db324d3
to
8022c7a
Compare
Merge conflicts with your last PR are resolved. The test failure also seems to be unrelated to any changes in this PR. |
hmm maybe tried to merge from main ? |
@unaidedelf8777 I think only the logit processor tests in v0 is related to this PR here (at least for the outlines case) PTAL |
Head branch was pushed to by a user without write access
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
64c0086
to
3d4c13d
Compare
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
Signed-off-by: Nathan Hoos <thwackyy.y@gmail.com>
e70ee76
to
05023f6
Compare
Adds outlines as a guided decoding backend for V1, and updates the integration for V0.
The aim of this is three fold:
outlines
, and only useoutlines_core
write_mask_into
method onGuide
to write a bitmask in-place for use in logits masking.Because the dependency on
outlines
will be removed, support for grammar based decoding with the outlines backend will also be removed (CFG classes reside in theoutlines
package)cc @aarnphm