Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q2 2025
#15735 opened Mar 29, 2025 by simon-mo
Open 8
[V1] Feedback Thread
#12568 opened Jan 30, 2025 by simon-mo
Open 88
Beta
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: noop elimination for slice errors when end = -1 bug Something isn't working
#17078 opened Apr 23, 2025 by BoyuanFeng
1 task done
[Bug]: Aria model error due to version mismatch with transformers bug Something isn't working
#17077 opened Apr 23, 2025 by musoles
1 task done
[Usage]: OpenAI Server API usage How to use vllm
#17075 opened Apr 23, 2025 by lfoppiano
1 task done
[Feature]: GGUF support for GLM4 feature request New feature or request
#17069 opened Apr 23, 2025 by DefinitlyEvil
1 task done
[Bug]: CI Build image failure due to mamba-ssm==2.2.4 installation error bug Something isn't working
#17068 opened Apr 23, 2025 by csy1204
1 task done
[Performance]: UVA vs UVM for CPU offloading on v0.8.4+ performance Performance-related issues
#17062 opened Apr 23, 2025 by rajesh-s
1 task done
[Usage]: Does vLLM support QwQ 32B + tool calling? usage How to use vllm
#17061 opened Apr 23, 2025 by ItzAmirreza
1 task done
[Bug]: Issue with SpecDecode when using data parallel bug Something isn't working
#17056 opened Apr 23, 2025 by zoman-sgd
[Bug]: ValueError when using Multi-Instance GPU bug Something isn't working
#17047 opened Apr 23, 2025 by rantahar
1 task done
[Bug]: Many endpoints are returning 500 Internal Server Error bug Something isn't working good first issue Good for newcomers
#17038 opened Apr 23, 2025 by tarukumar
1 task done
[Bug]: Undocumented HTTP Status Codes for vllm endpoints bug Something isn't working good first issue Good for newcomers
#17037 opened Apr 23, 2025 by tarukumar
1 task done
[Bug]: Multiple openai endpoint Missing Content-Type Header bug Something isn't working good first issue Good for newcomers
#17036 opened Apr 23, 2025 by tarukumar
1 task done
[Usage]: DeepSeek R1 on a 8xH200 node is too slow usage How to use vllm
#17035 opened Apr 23, 2025 by jorgeantonio21
1 task done
[Bug]: ValueError: Model architectures ['OPTForCausalLM'] failed to be inspected. bug Something isn't working
#17031 opened Apr 23, 2025 by sydarb
1 task done
[Feature]: add hostname in metrics for clustering deployment feature request New feature or request
#17029 opened Apr 23, 2025 by liougehooa
1 task done
[Bug]: image_grid_thw not set in CachedRequestState - Qwen2.5 VL 3B bug Something isn't working
#17007 opened Apr 22, 2025 by francescotaioli
1 task done
[Performance]: Distributed Inference w/ & w/o RDMA over Infiniband performance Performance-related issues
#17006 opened Apr 22, 2025 by surajssd
1 task done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.