-
-
Notifications
You must be signed in to change notification settings - Fork 7k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[V1][PP] Optimization: continue scheduling prefill chunks
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#17080
opened Apr 23, 2025 by
ruisearch42
Loading…
[Bugfix] Fix Gemma3 multimodal placeholder replacement
#17074
opened Apr 23, 2025 by
tristanleclercq
•
Draft
[Minor] Use max_num_seqs=1024 for A100/B100/B200/MI300x
#17073
opened Apr 23, 2025 by
WoosukKwon
Loading…
[TPU][V1] Add support for top-logprobs
tpu
Related to Google TPUs
v1
#17072
opened Apr 23, 2025 by
NickLucche
Loading…
[Kernel][Hardware][AMD] Bf16 mfma opt for ROCm skinny GEMMs
#17071
opened Apr 23, 2025 by
amd-hhashemi
Loading…
[TPU][V1][CI] Set
VLLM_XLA_CACHE_PATH=
to avoid disk-full error
ci/build
#17064
opened Apr 23, 2025 by
NickLucche
Loading…
[Docs] Propose a deprecation policy for the project
documentation
Improvements or additions to documentation
#17063
opened Apr 23, 2025 by
russellb
Loading…
Add missing rocm_skinny_gemms kernel test to CI
ready
ONLY add when PR is ready to merge/full CI is needed
#17060
opened Apr 23, 2025 by
mgoin
Loading…
existing torch installation pip command fix for docs
documentation
Improvements or additions to documentation
#17059
opened Apr 23, 2025 by
atilla00
Loading…
[BugFix] Don't raise exception when no FA3
ci/build
#17058
opened Apr 23, 2025 by
LucasWilkinson
•
Draft
fix setuptools-scm was unable to detect version for workspace
ci/build
#17050
opened Apr 23, 2025 by
akoserwal
Loading…
Fix: Python package installation for opentelmetry
ci/build
#17049
opened Apr 23, 2025 by
dilipgb
Loading…
[Misc] Make cached tokenizer pickle-compatible
ready
ONLY add when PR is ready to merge/full CI is needed
#17048
opened Apr 23, 2025 by
DarkLight1337
Loading…
[Core] Prevent side-channel attacks via cache salting
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
v1
#17045
opened Apr 23, 2025 by
dr75
Loading…
Addendum Fix to support FIPS enabled machines with MD5 hashing
ready
ONLY add when PR is ready to merge/full CI is needed
#17043
opened Apr 23, 2025 by
sydarb
Loading…
[Misc] refactor example series - structured outputs
documentation
Improvements or additions to documentation
structured-output
#17040
opened Apr 23, 2025 by
reidliu41
Loading…
[Misc] Change buckets of histogram_iteration_tokens to [1, 8, 16, 32, 64, 128, 256, 512, 1024, 2048, 4096, 8096] to represent number of tokens
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#17033
opened Apr 23, 2025 by
sfc-gh-zhwang
Loading…
[Frontend] Add /classify endpoint
documentation
Improvements or additions to documentation
frontend
#17032
opened Apr 23, 2025 by
frieda-huang
Loading…
[Misc] Benchmark Serving Script Support Appending Results
#17028
opened Apr 23, 2025 by
LucasWilkinson
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.