[Feature]: Support for Running Classification Task in Online Server #13567

sam-h-bean · 2025-02-19T20:57:19Z

🚀 The feature, motivation and pitch

I would like it to be easy to stand up models for sequence classification using the vllm online inference pattern. Currently this is available for offline inference but it would be nice to expose this server in kubernetes similar to how we host OpenAI compatible servers.

Alternatives

We could train a causal lm where we treat special tokens as the classification labels. We could then take the softmaxed logprobs for those 2 tokens to threshold. However this is going to require slightly more code on the client side.

Additional context

No response

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

dipatidar · 2025-02-20T22:36:34Z

@DarkLight1337 I'd like to work on this.

DarkLight1337 · 2025-02-21T03:32:03Z

Thanks for helping out!

frieda-huang · 2025-04-20T04:40:07Z

Hi @dipatidar Are you still working on this? If not, I would like to pick up the issue!

frieda-huang · 2025-04-20T21:49:19Z

Hi @DarkLight1337! I’m digging into the “sequence classification” issue. I noticed that OpenAI deprecated the /v1/classifications endpoint in December 2022. If we add our own /v1/classifications route in vLLM for online inference, we’d be diverging from the current OpenAI spec. Do you think it’s worth offering that “bonus” endpoint for ergonomics, or should we stick strictly to OpenAI’s live endpoints (completions/embeddings) for compatibility, as suggested here?

DarkLight1337 · 2025-04-21T03:46:46Z

I think to keep things simple, we can simply create an online version of LLM.classify. To distinguish from the OpenAI endpoint we can have the endpoint located at /classify

frieda-huang · 2025-04-21T04:31:50Z

I think to keep things simple, we can simply create an online version of LLM.classify. To distinguish from the OpenAI endpoint we can have the endpoint located at /classify

Gotcha. Could you assign the issue to me?

sam-h-bean added the feature request New feature or request label Feb 19, 2025

DarkLight1337 added help wanted Extra attention is needed good first issue Good for newcomers labels Feb 20, 2025

DarkLight1337 assigned frieda-huang Apr 21, 2025

This was referenced Apr 23, 2025

[Frontend] Add /classify endpoint #17030

Closed

[Frontend] Add /classify endpoint #17032

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Support for Running Classification Task in Online Server #13567

[Feature]: Support for Running Classification Task in Online Server #13567

sam-h-bean commented Feb 19, 2025

dipatidar commented Feb 20, 2025

DarkLight1337 commented Feb 21, 2025

frieda-huang commented Apr 20, 2025 •

edited

Loading

frieda-huang commented Apr 20, 2025

DarkLight1337 commented Apr 21, 2025

frieda-huang commented Apr 21, 2025

[Feature]: Support for Running Classification Task in Online Server #13567

[Feature]: Support for Running Classification Task in Online Server #13567

Comments

sam-h-bean commented Feb 19, 2025

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

dipatidar commented Feb 20, 2025

DarkLight1337 commented Feb 21, 2025

frieda-huang commented Apr 20, 2025 • edited Loading

frieda-huang commented Apr 20, 2025

DarkLight1337 commented Apr 21, 2025

frieda-huang commented Apr 21, 2025

frieda-huang commented Apr 20, 2025 •

edited

Loading