-
-
Notifications
You must be signed in to change notification settings - Fork 7k
Issues: vllm-project/vllm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug]: Importing DeepSpeed causes crash in vLLM when running with data parallelism and TP=1
bug
Something isn't working
#17079
opened Apr 23, 2025 by
qgallouedec
1 task done
[Bug]: noop elimination for slice errors when end = -1
bug
Something isn't working
#17078
opened Apr 23, 2025 by
BoyuanFeng
1 task done
[Bug]: Aria model error due to version mismatch with transformers
bug
Something isn't working
#17077
opened Apr 23, 2025 by
musoles
1 task done
[RFC]: Implement structural_tag support in structured output
RFC
structured-output
#17076
opened Apr 23, 2025 by
russellb
1 task done
[Usage]: OpenAI Server API
usage
How to use vllm
#17075
opened Apr 23, 2025 by
lfoppiano
1 task done
[Feature]: GGUF support for GLM4
feature request
New feature or request
#17069
opened Apr 23, 2025 by
DefinitlyEvil
1 task done
[Bug]: CI Build image failure due to mamba-ssm==2.2.4 installation error
bug
Something isn't working
#17068
opened Apr 23, 2025 by
csy1204
1 task done
[RFC]: All Ops should be determined during init and wrapped in a Layer Module to avoid envs.ENVIRON overhead
RFC
#17067
opened Apr 23, 2025 by
tjtanaa
1 task done
[Performance]: UVA vs UVM for CPU offloading on v0.8.4+
performance
Performance-related issues
#17062
opened Apr 23, 2025 by
rajesh-s
1 task done
[Usage]: Does vLLM support QwQ 32B + tool calling?
usage
How to use vllm
#17061
opened Apr 23, 2025 by
ItzAmirreza
1 task done
[Bug]: Issue with SpecDecode when using data parallel
bug
Something isn't working
#17056
opened Apr 23, 2025 by
zoman-sgd
[Bug]: ValueError when using Multi-Instance GPU
bug
Something isn't working
#17047
opened Apr 23, 2025 by
rantahar
1 task done
[Usage]: I have 2 nodes 16 GPUs, how can i use 16 dp+16 ep to run deepseek v3?
usage
How to use vllm
#17041
opened Apr 23, 2025 by
nannaer
[Bug]: Many endpoints are returning 500 Internal Server Error
bug
Something isn't working
good first issue
Good for newcomers
#17038
opened Apr 23, 2025 by
tarukumar
1 task done
[Bug]: Undocumented HTTP Status Codes for vllm endpoints
bug
Something isn't working
good first issue
Good for newcomers
#17037
opened Apr 23, 2025 by
tarukumar
1 task done
[Bug]: Multiple openai endpoint Missing Content-Type Header
bug
Something isn't working
good first issue
Good for newcomers
#17036
opened Apr 23, 2025 by
tarukumar
1 task done
[Usage]: DeepSeek R1 on a 8xH200 node is too slow
usage
How to use vllm
#17035
opened Apr 23, 2025 by
jorgeantonio21
1 task done
[Bug]: vllm 0.8.3 v1 engine has different computation performance per iteration when serving multi-lora with different chunk size
bug
Something isn't working
#17034
opened Apr 23, 2025 by
sjtu-zwh
1 task done
[Bug]: ValueError: Model architectures ['OPTForCausalLM'] failed to be inspected.
bug
Something isn't working
#17031
opened Apr 23, 2025 by
sydarb
1 task done
[Feature]: add hostname in metrics for clustering deployment
feature request
New feature or request
#17029
opened Apr 23, 2025 by
liougehooa
1 task done
[Bug]: The shape of the kv cache in the FlashAttention component of the LLM model in Qwen2.5 is very strange.
bug
Something isn't working
#17023
opened Apr 23, 2025 by
Jeremy-J-J
1 task done
[Bug]: When adding the parameter tensor_parallel_size, a TypeError occurred: BackendCompilerFailed.__init__() is missing one required positional argument: 'inner_exception'.
bug
Something isn't working
#17018
opened Apr 23, 2025 by
genanalucy
1 task done
[Installation]: Cannot install vllm due to xformers: ERROR: Failed building wheel for xformers fatal: Not a git repository (or any parent up to mount point /scratch) assert len(sources) > 0 AssertionError
installation
Installation problems
#17015
opened Apr 23, 2025 by
tjoymeed
1 task done
[Bug]: Something isn't working
image_grid_thw
not set in CachedRequestState
- Qwen2.5 VL 3B
bug
#17007
opened Apr 22, 2025 by
francescotaioli
1 task done
[Performance]: Distributed Inference w/ & w/o RDMA over Infiniband
performance
Performance-related issues
#17006
opened Apr 22, 2025 by
surajssd
1 task done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.