[LuceneOnFaiss Part-8] Added `index.knn.memory_optimized_search` index setting. #2616

0ctopus13prime · 2025-03-19T05:34:02Z

Description

This PR adds index.knn.memory_optimized_search index settings.
Overall, this brings two major changes listed below along with the new setting definition:

In codec level, it relies on MapperService to retrieve index.knn.memory_optimized_search bool flag. Which then be used to determine whether to initialize partial loading on KNN fields.
But this is not always working, during recovery MapperService can be null, therefore adding lazy-loading logic in both constructor and search method. For 99%, it will be loaded in the constructor, so it won't affect search p99 latency as by the time the search method is called, MemoryOptimizedSearch should be loaded already.
Branching in KNNQueryBuilder
This PR will make each knn field type decide whether memory-optimized-search is supported or not.
Currently only Float HNSW is supported, therefore any binary HNSW graphs will return false to indicate the feature is not supported.
If target field in search is supported for memory-optimized-search, control will fallback to Lucene query to proceed vector search. In which, Lucene's HNSW graph searcher will perform KNN search + Radius search on FAISS index.

Related Issues

Resolves #[Issue number to be closed when this PR is merged]

RFC: #2401

Check List

New functionality includes testing.
New functionality has been documented.
API changes companion pull request created.
Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/NativeEngines990KnnVectorsReader.java

0ctopus13prime · 2025-03-19T18:08:50Z

Part-7 #2608 was merged.
Will rebase and re-raise a new revision.
In the new revision, it will have unit tests.

0ctopus13prime · 2025-03-20T04:23:28Z

Hi @jmazanec15 ,

In this revision, I've included a hybrid approach inspired by @shatejas 's suggestion.

Previously, I added lazy loading in search because accessing the index.knn.memory_optimized_search setting in the constructor seemed tricky. Retrieving this value requires knowing the index name, and more importantly, during recovery, the codec is loaded via Lucene's SPI, where OpenSearch's internal framework isn't available. That was the main reason for switching to lazy loading within search.

However, this scenario is rare. In most cases, OpenSearch's internal framework (e.g., IndexSettings) is available. Additionally, since we're trying to compress long[] on the fly during loading in #2609 , relying solely on lazy loading in search could negatively impact p99, as it might take several seconds for larger datasets.

With this hybrid approach, we attempt to initialize in the constructor. If IndexSettings is available, we fetch the boolean value; otherwise, we do nothing. This way, most searches will find the table already initialized, no need for lazy loading. Only searches on a just-recovered index will require it.

Let me know if you have any major concerns. I think this approach not only mitigates potential p99 issues but also avoids the Lucene SPI limitation.

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/NativeEngines990KnnVectorsReader.java

src/main/java/org/opensearch/knn/memoryoptsearch/faiss/MemoryOptimizedSearchSupportSpec.java

src/main/java/org/opensearch/knn/index/KNNSettings.java

src/main/java/org/opensearch/knn/index/codec/BasePerFieldKnnVectorsFormat.java

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/NativeEngines990KnnVectorsFormat.java

shatejas

Looks good overall - couple of comments

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/NativeEngines990KnnVectorsReader.java

src/main/java/org/opensearch/knn/index/query/KNNQueryBuilder.java

src/test/java/org/opensearch/knn/index/query/KNNQueryBuilderTests.java

Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]>

Will update in the next PR.
It's being tracked in #2401 (comment)

…arch-project#2616) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]>

… FAISS index. (#2630) * Adding basic building blocks for MemoryOptimizedSearch. At the moment, only FAISS is supporing this. (#2581) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added IxMp section loading logic from FAISS index. (#2590) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added FaissHNSW and bridge to Lucene HNSW graph. (#2594) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added FAISS float flat index. (#2598) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added FAISS byte index deserializer - FaissIndexScalarQuantizedFlat (#2604) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Enable memory optimized searching in VectorReader for FAISS engine. (#2608) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added FAISS CAGRA index. (#2621) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added index scope setting 'index.knn.memory_optimized_search' (#2616) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Apply monotonic increasing integer encoding to FAISS HNSW and IdMapIndex. (#2609) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Adding byte index, FP16 index decoding. (#2618) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added integration tests for LuceneOnFaiss (#2630) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> --------- Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]>

… FAISS index. (opensearch-project#2630) * Adding basic building blocks for MemoryOptimizedSearch. At the moment, only FAISS is supporing this. (opensearch-project#2581) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added IxMp section loading logic from FAISS index. (opensearch-project#2590) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added FaissHNSW and bridge to Lucene HNSW graph. (opensearch-project#2594) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added FAISS float flat index. (opensearch-project#2598) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added FAISS byte index deserializer - FaissIndexScalarQuantizedFlat (opensearch-project#2604) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Enable memory optimized searching in VectorReader for FAISS engine. (opensearch-project#2608) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added FAISS CAGRA index. (opensearch-project#2621) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added index scope setting 'index.knn.memory_optimized_search' (opensearch-project#2616) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Apply monotonic increasing integer encoding to FAISS HNSW and IdMapIndex. (opensearch-project#2609) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Adding byte index, FP16 index decoding. (opensearch-project#2618) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> * Added integration tests for LuceneOnFaiss (opensearch-project#2630) Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]> --------- Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]>

0ctopus13prime added the skip-changelog label Mar 19, 2025

0ctopus13prime self-assigned this Mar 19, 2025

0ctopus13prime requested review from heemin32, navneet1v, VijayanB, vamshin, jmazanec15, naveentatikonda, junqiu-lei, martin-gaievski, ryanbogan, luyuncheng, shatejas and Vikasht34 as code owners March 19, 2025 05:34

0ctopus13prime force-pushed the lucene-on-faiss-part8 branch from d53f941 to e8b446b Compare March 19, 2025 05:51

navneet1v previously requested changes Mar 19, 2025

View reviewed changes

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/NativeEngines990KnnVectorsReader.java Outdated Show resolved Hide resolved

0ctopus13prime mentioned this pull request Mar 19, 2025

[LuceneOnFaiss Part-7] Enable memory optimized searching in VectorReader for FAISS engine. #2608

Merged

5 tasks

0ctopus13prime force-pushed the lucene-on-faiss-part8 branch from e8b446b to 4e34f01 Compare March 19, 2025 16:50

0ctopus13prime force-pushed the lucene-on-faiss-part8 branch 4 times, most recently from fbd6e65 to fcef387 Compare March 20, 2025 02:13

jmazanec15 reviewed Mar 20, 2025

View reviewed changes

shatejas reviewed Mar 20, 2025

View reviewed changes

0ctopus13prime force-pushed the lucene-on-faiss-part8 branch 2 times, most recently from c0dc961 to 822f918 Compare March 21, 2025 22:48

shatejas reviewed Mar 23, 2025

View reviewed changes

0ctopus13prime force-pushed the lucene-on-faiss-part8 branch from 822f918 to 6132ada Compare March 24, 2025 00:25

jmazanec15 approved these changes Mar 24, 2025

View reviewed changes

shatejas approved these changes Mar 24, 2025

View reviewed changes

Added index scope setting 'index.knn.memory_optimized_search'

952723d

Signed-off-by: Dooyong Kim <[email protected]> Co-authored-by: Dooyong Kim <[email protected]>

0ctopus13prime force-pushed the lucene-on-faiss-part8 branch from 6132ada to 952723d Compare March 24, 2025 17:05

0ctopus13prime merged commit 58fcb37 into opensearch-project:lucene-on-faiss Mar 24, 2025
35 checks passed

0ctopus13prime mentioned this pull request Mar 26, 2025

[LuceneOnFaiss] Partial loading support (memory optimized search) for FAISS index. #2630

Merged

5 tasks

0ctopus13prime deleted the lucene-on-faiss-part8 branch April 5, 2025 00:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[LuceneOnFaiss Part-8] Added `index.knn.memory_optimized_search` index setting. #2616

[LuceneOnFaiss Part-8] Added `index.knn.memory_optimized_search` index setting. #2616

Uh oh!

0ctopus13prime commented Mar 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

0ctopus13prime commented Mar 19, 2025

Uh oh!

0ctopus13prime commented Mar 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shatejas left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[LuceneOnFaiss Part-8] Added index.knn.memory_optimized_search index setting. #2616

[LuceneOnFaiss Part-8] Added index.knn.memory_optimized_search index setting. #2616

Uh oh!

Conversation

0ctopus13prime commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Check List

Uh oh!

Uh oh!

0ctopus13prime commented Mar 19, 2025

Uh oh!

0ctopus13prime commented Mar 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shatejas left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[LuceneOnFaiss Part-8] Added `index.knn.memory_optimized_search` index setting. #2616

[LuceneOnFaiss Part-8] Added `index.knn.memory_optimized_search` index setting. #2616

0ctopus13prime commented Mar 19, 2025 •

edited

Loading