[LuceneOnFaiss - Part1] Added building blocks for memory optimized search. #2581

0ctopus13prime · 2025-03-05T17:55:17Z

Description

[1/11] This is the first PR to establish the building blocks for memory optimized search.
At the moment, FAISS is the only engine that supports memory optimized search where loading vectors in demand fashion.

Note that this will be merged into the feature branch first. Unit tests + IT tests will be covered in PR-9 and PR-10.

Related Issues

RFC : #2401

Check List

New functionality includes testing.
New functionality has been documented.
API changes companion pull request created.
Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

src/main/java/org/opensearch/knn/memoryoptsearch/MemoryOptimizedSearcherFactory.java

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/NativeEngines990KnnVectorsReader.java

src/main/java/org/opensearch/knn/memoryoptsearch/MemoryOptimizedSearcher.java

src/main/java/org/opensearch/knn/memoryoptsearch/faiss/FaissMemoryOptimizedSearcherFactory.java

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/NativeEngines990KnnVectorsReader.java

src/main/java/org/opensearch/knn/memoryoptsearch/faiss/FaissMemoryOptimizedSearcherFactory.java

src/main/java/org/opensearch/knn/memoryoptsearch/MemoryOptimizedSearcherFactory.java

src/main/java/org/opensearch/knn/memoryoptsearch/MemoryOptimizedSearcher.java

src/main/java/org/opensearch/knn/index/engine/KNNLibrary.java

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/NativeEngines990KnnVectorsReader.java

navneet1v · 2025-03-05T22:06:55Z

@0ctopus13prime please fix the CIs too

shatejas · 2025-03-05T22:16:54Z

src/main/java/org/opensearch/knn/index/engine/KNNEngine.java

@@ -216,4 +218,9 @@ public ResolvedMethodContext resolveMethod(
    public boolean supportsRemoteIndexBuild() {
        return knnLibrary.supportsRemoteIndexBuild();
    }
+
+    @Override
+    public Optional<MemoryOptimizedSearcherFactory> getMemoryOptimizedSearcherFactory() {


I am not really sure we should keep adding implementation details to the enum. Shouldn't a simple engine check inside the factory take care of this?

I think each Engine either supports or does not support memory_optimized_search mode.
So far, FAISS is the single engine that supports the mode but it can be expanded to other future engines.
From that perspective, it is natural to put factory-getter method in Engine.

What is your concern?

It will be per engine. Ideally, it would be included in something like https://github.com/opensearch-project/k-NN/blob/main/src/main/java/org/opensearch/knn/index/engine/KNNLibrarySearchContext.java. This can be retrieved via https://github.com/opensearch-project/k-NN/blob/main/src/main/java/org/opensearch/knn/index/engine/KNNLibrary.java#L121.

My problem with these enums is that its turning into giant static configuration database, along with a workaround to have dependency injection. Its a maintenance concern- It doesn't seem to be a standard coding pattern and I am not sure its the right way to go

Shouldn't a simple engine check inside the factory take care of this?

I dont think we should branch throughout the code based on engine. This will make engine extendability very difficult and also will created complex branching in the code - this has happened to some degree based in the query.

The purpose of KNNLibrarySearchContext was to let the given engine tell how we are supposed to search. This isnt totally complete, but the direction to kind of further improve this is in #2568. On the indexing/reader setup side, we would need the engines to return a PerFieldKnnVectorFormat that can be added via https://github.com/opensearch-project/k-NN/blob/main/src/main/java/org/opensearch/knn/index/codec/KNNCodecService.java#L43.

@0ctopus13prime merging the thread from here #2581 (comment) too. and also adding my thoughts from the discussion happening on this thread.

@jmazanec15, the idea of engine providing a PerFieldKnnVectorFormat based on the MappedfieldType is a step in right direction. But once we have the KNNVectorFormat which is engine specific then searcher should not come from KNNEngine. It should be job of Reader to find the right searcher for the algorithm.

We moving searcher currently in KNNEngine, I somehow feel a decision we are taking too early. Should we take this decision once we move to PerFieldKnnVectorFormat being returned via KNNEngine?

If we could align on having a dedicated FAISS VectorFormat, then will update accordingly in the next rev.
I will add a branch in BasePerFieldKnnVectorsFormat to return FaissVectorFormat returning a reader where it performs vector search on FAISS index for knn fields using FAISS engine.

// In BasePerFieldKnnVectorsFormat if (engine == FAISS) { return new FaissKnnVectorsFormat(...); }

My concern is that the engine/method will determine if the optimized memory searcher is supported and how its supported. In the current code, Im not sure Im seeing where the branching based on method logic is implemented - for instance, where would we say HNSW fp16 is not supported or IVF is not supported (I might just be missing it). But I think this can get fairly complex, and is why I think that it would be better to go through the engine that determines what functionality is supported.

So we could just add a method in KNNLibrarySearchContext like "getMemoryOptimizedSearcherFactory()". Then, we can retrieve it via knnEngine.getLibrarySearchContext(methodName) (we might need to take additional info like library params like what @owenhalpert is working on). With this, we keep the codec somewhat engine agnostic.

My concern is that the engine/method will determine if the optimized memory searcher is supported and how its supported. In the current code, Im not sure Im seeing where the branching based on method logic is implemented - for instance, where would we say HNSW fp16 is not supported or IVF is not supported (I might just be missing it). But I think this can get fairly complex, and is why I think that it would be better to go through the engine that determines what functionality is supported.

So we could just add a method in KNNLibrarySearchContext like "getMemoryOptimizedSearcherFactory()". Then, we can retrieve it via knnEngine.getLibrarySearchContext(methodName) (we might need to take additional info like library params like what @owenhalpert is working on). With this, we keep the codec somewhat engine agnostic.

This seems reasonable to me !!

0ctopus13prime · 2025-03-05T22:34:54Z

@0ctopus13prime please fix the CIs too

Hmm.. this PR should not affect anything at all 😅
Let me check, suspecting a flaky test

navneet1v · 2025-03-05T23:00:24Z

@0ctopus13prime please fix the CIs too

Hmm.. this PR should not affect anything at all 😅 Let me check, suspecting a flaky test

I also think so, but its always better to see build CIs passing and also DCO checks. This will ensure that branch is healthy

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/NativeEngines990KnnVectorsReader.java

0ctopus13prime · 2025-03-06T00:51:59Z

Please note that once get verbal approval on the implementation, then will ship unit tests in this PR.
Thanks.

…, only FAISS is supporing this. Signed-off-by: Dooyong Kim <[email protected]>

shatejas · 2025-03-07T00:38:19Z

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/NativeEngines990KnnVectorsReader.java


 /**
 * Vectors reader class for reading the flat vectors for native engines. The class provides methods for iterating
 * over the vectors and retrieving their values.
 */
+@Slf4j


Why not log4j?

Slf4j is a general logging framework without tied dependency over a specific logger framework like Log4j, Logback etc. It's a facade for different logging frameworks. When there's a log framework upgrade or change, it won't need any changes

let just use what we are using in the plugin to avoid conflicts for future.

why it conflicts? opensearch core is using slf4j with log4j

When I say conflict, I mean mainly if someone trying add a logger they will be confused whether to use slf4j or log4j. So people will have conflicts in mind what to pick. It mainly about consistency in the code. There is no specific reason from my side, for me its all about consistency in the code.

sounds good, will update in the next rev.
before raising a new PR, could you share your thoughts on the code?
If you leave comments, I will factor them into the next PR.

I think we should start using Slf4j and clean code by replacing log4j to Slf4j !! There are couple of benefits we have 1. We are aligning to core and Lucene , Lucene also uses Slf4j , and in future either if core replaces that with any other inline framework !! We get for free !!

in that case it should a be part of a separate GH and not scope of this PR.

Will update the annotation in the next rev! :)

0ctopus13prime · 2025-03-07T18:53:43Z

@jmazanec15
For those index types that don't support memory optimized searching, FaissIndex.load (which will be added in the next PR) will throw an exception. In vector reader, it catches it then moving on to the next field.
And in the index settings update event listener, it will make sure to allow it to be turned on ONLY IF all KNN fields have FAISS HNSW. Also the flag check in KnnQueryBuilder will make sure other index types but HNSW will be fallbacked to NativeQuery which will use native shared library to continue the search.

0ctopus13prime requested review from heemin32, navneet1v, VijayanB, vamshin, jmazanec15, naveentatikonda, junqiu-lei, martin-gaievski, ryanbogan, luyuncheng, shatejas and Vikasht34 as code owners March 5, 2025 17:55

0ctopus13prime self-assigned this Mar 5, 2025

0ctopus13prime added the skip-changelog label Mar 5, 2025

shatejas reviewed Mar 5, 2025

View reviewed changes

navneet1v reviewed Mar 5, 2025

View reviewed changes

shatejas reviewed Mar 5, 2025

View reviewed changes

0ctopus13prime changed the title ~~Added building blocks for memory optimized search. At the moment~~ [LuceneOnFaiss - Part1] Added building blocks for memory optimized search. At the moment Mar 5, 2025

0ctopus13prime changed the title ~~[LuceneOnFaiss - Part1] Added building blocks for memory optimized search. At the moment~~ [LuceneOnFaiss - Part1] Added building blocks for memory optimized search. Mar 5, 2025

0ctopus13prime force-pushed the lucene-on-faiss-part1 branch from 06716e7 to 83cd071 Compare March 6, 2025 00:48

0ctopus13prime commented Mar 6, 2025

View reviewed changes

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/NativeEngines990KnnVectorsReader.java Outdated Show resolved Hide resolved

0ctopus13prime force-pushed the lucene-on-faiss-part1 branch from 83cd071 to 3bc02d2 Compare March 6, 2025 21:12

0ctopus13prime mentioned this pull request Mar 6, 2025

[RFC] Partial loading with FAISS engine. #2401

Open

0ctopus13prime force-pushed the lucene-on-faiss-part1 branch from 3bc02d2 to 20ddf6b Compare March 6, 2025 21:29

Adding basic building blocks for MemoryOptimizedSearch. At the moment…

9be82c7

…, only FAISS is supporing this. Signed-off-by: Dooyong Kim <[email protected]>

0ctopus13prime force-pushed the lucene-on-faiss-part1 branch from 20ddf6b to 9be82c7 Compare March 6, 2025 22:04

shatejas approved these changes Mar 7, 2025

View reviewed changes

jmazanec15 approved these changes Mar 7, 2025

View reviewed changes

0ctopus13prime merged commit fbcdfbc into opensearch-project:lucene-on-faiss Mar 7, 2025
34 checks passed

0ctopus13prime deleted the lucene-on-faiss-part1 branch March 10, 2025 02:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LuceneOnFaiss - Part1] Added building blocks for memory optimized search. #2581

[LuceneOnFaiss - Part1] Added building blocks for memory optimized search. #2581

0ctopus13prime commented Mar 5, 2025

navneet1v commented Mar 5, 2025

shatejas Mar 5, 2025

0ctopus13prime Mar 5, 2025

jmazanec15 Mar 5, 2025

shatejas Mar 6, 2025 •

edited

Loading

jmazanec15 Mar 6, 2025

navneet1v Mar 7, 2025

0ctopus13prime Mar 7, 2025

jmazanec15 Mar 7, 2025

Vikasht34 Mar 7, 2025

0ctopus13prime commented Mar 5, 2025

navneet1v commented Mar 5, 2025

0ctopus13prime commented Mar 6, 2025

shatejas Mar 7, 2025

0ctopus13prime Mar 7, 2025

navneet1v Mar 7, 2025

0ctopus13prime Mar 7, 2025

navneet1v Mar 7, 2025

0ctopus13prime Mar 7, 2025

Vikasht34 Mar 7, 2025

navneet1v Mar 7, 2025

0ctopus13prime Mar 7, 2025

0ctopus13prime commented Mar 7, 2025

[LuceneOnFaiss - Part1] Added building blocks for memory optimized search. #2581

[LuceneOnFaiss - Part1] Added building blocks for memory optimized search. #2581

Conversation

0ctopus13prime commented Mar 5, 2025

Description

Related Issues

Check List

navneet1v commented Mar 5, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shatejas Mar 6, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

0ctopus13prime commented Mar 5, 2025

navneet1v commented Mar 5, 2025

0ctopus13prime commented Mar 6, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

0ctopus13prime commented Mar 7, 2025

shatejas Mar 6, 2025 •

edited

Loading