Skip to content

feat: add support for vLLM response format in reranking logic and up… #5954

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 4 commits into from
Jun 16, 2025

Conversation

Iamshankhadeep
Copy link
Contributor

@Iamshankhadeep Iamshankhadeep commented Jun 3, 2025

Description

vLLM's rerank endpoint returns results.results instead of results.data like everyone else. Our code was trying to sort undefined.
her's the link Re-rank API

I aslo added proper TypeScript types (VllmRerankResponse), but build is failing because of that.

Checklist

  • I've read the contributing guide
  • The relevant docs, if any, have been updated or created
  • [] The relevant tests, if any, have been updated or created

fixes #5869

@Iamshankhadeep Iamshankhadeep requested a review from a team as a code owner June 3, 2025 06:17
@Iamshankhadeep Iamshankhadeep requested review from sestinj and removed request for a team June 3, 2025 06:17
Copy link

cubic-dev-ai bot commented Jun 3, 2025

Your cubic subscription is currently inactive. Please reactivate your subscription to receive AI reviews and use cubic.

Copy link

netlify bot commented Jun 3, 2025

Deploy Preview for continuedev failed. Why did it fail? →

Name Link
🔨 Latest commit cbb5c21
🔍 Latest deploy log https://app.netlify.com/projects/continuedev/deploys/68430f0d6ae3b80008e6b25d

@dosubot dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Jun 3, 2025
@Iamshankhadeep Iamshankhadeep changed the title feaet: add support for vLLM response format in reranking logic and up… feat: add support for vLLM response format in reranking logic and up… Jun 3, 2025
Copy link
Contributor

@sestinj sestinj left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Iamshankhadeep thanks for looking into this! I'm thinking that instead of adding this extra logic to the BaseLLM class, we could just add a rerank method to the dedicated VLLM class. You can find an example of such a rerank method in the Voyage class.

Let me know if this makes sense or if I can explain further

@github-project-automation github-project-automation bot moved this from Todo to In Progress in Issues and PRs Jun 4, 2025
@Iamshankhadeep
Copy link
Contributor Author

@sestinj yup totally makes sense,
Let me implement that. Thanks

@Iamshankhadeep
Copy link
Contributor Author

Iamshankhadeep commented Jun 6, 2025

@sestinj should be ready for review. Let me know which issue I can work on next. thanks 🔥

Copy link

recurseml bot commented Jun 13, 2025

✨ No issues found! Your code is sparkling clean! ✨

Copy link
Contributor

@sestinj sestinj left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the updates, this looks great!

@dosubot dosubot bot added the lgtm This PR has been approved by a maintainer label Jun 16, 2025
@sestinj sestinj merged commit 4d63b4c into continuedev:main Jun 16, 2025
31 of 35 checks passed
@github-project-automation github-project-automation bot moved this from In Progress to Done in Issues and PRs Jun 16, 2025
@github-actions github-actions bot locked and limited conversation to collaborators Jun 16, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
lgtm This PR has been approved by a maintainer size:M This PR changes 30-99 lines, ignoring generated files.
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

codebase context provider fts query bug
2 participants