Skip to content

Add vectorsearch Remote Index Build docs #9575

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 12 commits into from
Apr 30, 2025

Conversation

owenhalpert
Copy link
Contributor

@owenhalpert owenhalpert commented Apr 7, 2025

Description

Add documentation for the Remote Build Service (META issue, blog post, metrics)

Issues Resolved

Closes #9574

Version

3.0.0 and above

Frontend features

N/A

Checklist

  • By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license and subject to the Developers Certificate of Origin.
    For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Copy link

github-actions bot commented Apr 7, 2025

Thank you for submitting your PR. The PR states are In progress (or Draft) -> Tech review -> Doc review -> Editorial review -> Merged.

Before you submit your PR for doc review, make sure the content is technically accurate. If you need help finding a tech reviewer, tag a maintainer.

When you're ready for doc review, tag the assignee of this PR. The doc reviewer may push edits to the PR directly or leave comments and editorial suggestions for you to address (let us know in a comment if you have a preference). The doc reviewer will arrange for an editorial review.

@owenhalpert owenhalpert force-pushed the remote-vector-docs branch 4 times, most recently from f342c79 to d796b71 Compare April 8, 2025 20:23
@kolchfa-aws kolchfa-aws added v3.0.0 release-notes PR: Include this PR in the automated release notes labels Apr 16, 2025
@owenhalpert owenhalpert changed the title Initial Remote Index Build draft Add vectorsearch Remote Index Build docs Apr 17, 2025
@owenhalpert owenhalpert marked this pull request as ready for review April 17, 2025 21:22
@kolchfa-aws
Copy link
Collaborator

@owenhalpert Please let me know when you finish addressing technical comments and the PR is ready for doc review. Thanks!

@owenhalpert
Copy link
Contributor Author

@kolchfa-aws I just resolved the open comment — ready for doc review! Thanks.


### Step 2: Create and register the remote vector repository

The remote vector repository will act as an intermediate object store between the OpenSearch cluster and the remote build service. The cluster will upload vectors and doc IDs to this repository. The remote build service will download these and eventually upload a completed index build in response. To create and register the repository, follow the [Register repository](https://opensearch.org/docs/latest/tuning-your-cluster/availability-and-recovery/snapshots/snapshot-restore/#register-repository) steps in the Snapshots documentation. Then, set `knn.remote_index_build.vector_repo` to be the name of this registered repository.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is knn.remote_index_build.vector_repo a dynamic setting that users can update via the API or yaml file?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does "completed index build" mean "built index"? Does "in response" mean that the build service uploads the finished (built) index as a result of processing the uploaded vectors and document IDs?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. Yes, a dynamic setting as per the definition in the "Dynamic settings" link you provided above.
  2. "Built index" is better here, thanks.
  3. Yes. It uploads the built index once it finishes the operation.

Copy link
Collaborator

@kolchfa-aws kolchfa-aws left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you, @owenhalpert! Please review my suggestions and let me know if they preserve technical accuracy. Once you confirm, we can move the PR to editorial review.

Signed-off-by: Fanit Kolchina <[email protected]>
kolchfa-aws and others added 2 commits April 29, 2025 19:21
Co-authored-by: kolchfa-aws <[email protected]>
Signed-off-by: owenhalpert <[email protected]>
@owenhalpert
Copy link
Contributor Author

Thanks so much for your review and comments @kolchfa-aws. Please let me know if there are other unresolved points, otherwise the PR is ready for the next stage of review.

@kolchfa-aws
Copy link
Collaborator

Thanks, @owenhalpert! Moving to editorial review.

@kolchfa-aws kolchfa-aws added the 5 - Editorial review PR: Editorial review in progress label Apr 29, 2025
Copy link
Collaborator

@natebower natebower left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@kolchfa-aws @owenhalpert Please see my changes and let me know if you have any questions. Thanks!

kolchfa-aws and others added 2 commits April 30, 2025 08:57
Co-authored-by: Nathan Bower <[email protected]>
Signed-off-by: kolchfa-aws <[email protected]>
@kolchfa-aws kolchfa-aws merged commit de2c17f into opensearch-project:main Apr 30, 2025
5 checks passed
KishoreKicha14 pushed a commit to KishoreKicha14/documentation-website that referenced this pull request Jun 13, 2025
* Initial Remote Index Build draft

Signed-off-by: owenhalpert <[email protected]>

* Initial Remote Index Build draft

Signed-off-by: owenhalpert <[email protected]>

* Doc review

Signed-off-by: Fanit Kolchina <[email protected]>

* Slight rewording based on answers

Signed-off-by: Fanit Kolchina <[email protected]>

* Don't refer to feature flag

Signed-off-by: Fanit Kolchina <[email protected]>

* Update _vector-search/api/knn.md

Signed-off-by: kolchfa-aws <[email protected]>

* Update _vector-search/remote-index-build.md

Signed-off-by: kolchfa-aws <[email protected]>

* Update _vector-search/remote-index-build.md

Signed-off-by: kolchfa-aws <[email protected]>

* Apply suggestions from code review

Co-authored-by: kolchfa-aws <[email protected]>
Signed-off-by: owenhalpert <[email protected]>

* Apply suggestions from code review

Co-authored-by: Nathan Bower <[email protected]>
Signed-off-by: kolchfa-aws <[email protected]>

---------

Signed-off-by: owenhalpert <[email protected]>
Signed-off-by: Fanit Kolchina <[email protected]>
Signed-off-by: kolchfa-aws <[email protected]>
Co-authored-by: Fanit Kolchina <[email protected]>
Co-authored-by: kolchfa-aws <[email protected]>
Co-authored-by: Nathan Bower <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
5 - Editorial review PR: Editorial review in progress experimental release-notes PR: Include this PR in the automated release notes v3.0.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[DOC] Add k-NN's Remote Vector Index Builder documentation
5 participants