Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add MultiVectorRetriever as a foundation for ParentDocumentRetriever #2349

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

seungy0
Copy link

@seungy0 seungy0 commented Feb 28, 2025

Hello.

I noticed that there is an open issue requesting a ParentDocumentRetriever in Spring AI #63, and I’d like to contribute by implementing it.

Following the approach used in LangChain, I am introducing a MultiVectorRetriever that retrieves small document chunks from a vector store. The ParentDocumentRetriever will then extend this retriever to fetch the full parent documents based on the retrieved chunks.

As a first step, this PR introduces:

MultiVectorRetriever: Retrieves relevant small chunks from a VectorStore.
DocumentStore Interface: A key-value storage interface for retrieving full parent documents.
I’d love to get your feedback on this approach before proceeding with the ParentDocumentRetriever implementation. If this direction looks good, I’ll continue working on the next part.

Looking forward to your thoughts! Thanks!

@seungy0 seungy0 changed the title Gh 63 Add MultiVectorRetriever as a foundation for ParentDocumentRetriever Feb 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant