Deduplicating Vectors for Space Optimization - Patch #2840

sobhu17 · 2025-08-08T17:06:36Z

Description

This is the second PR which will make use of the changes from the first PR. It includes the C++ changes added as a patch in the KNN project.
Currently we store vectors in two files (.vec and .faiss). The two vector sets stored in each file may differ where Faiss index file (.faiss) might contains quantized vectors for fast approximate search, while the Lucene .vec file stores the full-precision vectors used during exact search.
For FP32, Byte, and Pure Binary cases, maintaining two copies of the same vectors in both .vec and .faiss files is redundant. Both approximate and exact search operations can run on a single set of vectors, making it possible to eliminate this duplication safely.

Adding the link of my local faiss repo for reference- faiss_changes

Files changed in external faiss:

io.h
index_write.cpp
index_read.cpp
index_read_utils.h
index_io.h

Related Issues

Resolves #2823

Check List

New functionality includes testing.
New functionality has been documented.
API changes companion pull request created.
Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: sobhu17 <[email protected]>

finnroblin · 2025-08-08T17:17:42Z

jni/patches/faiss/0007-Custom-patch-to-support-dedup-flatvector-faiss-file.patch

+     }
+ }
+
+-InvertedLists* read_InvertedLists(IOReader* f, int io_flags) {


Why change int -> uint64_t?

We use uint64_t because there’s a possibility that FAISS might introduce a flag in the future that uses the same bit position as ours.
By defining const uint64_t DEDUPE_VECTORS_OPT_DISABLED = (1ull << 63);
we ensure our io_flags value sits in a high, reserved bit, minimizing the risk of conflicting with any future FAISS flags.

Thanks for the info. That makes sense. I think the best way to document this in the code is to use this constant everywhere instead of raw 1ull << 63 value and add the info you shared here as a comment above the constant.

navneet1v · 2025-08-08T17:20:05Z

Adding the link of my local faiss repo for reference- faiss_changes

what is this patch actually doing?

finnroblin · 2025-08-08T17:22:06Z

jni/patches/faiss/0007-Custom-patch-to-support-dedup-flatvector-faiss-file.patch

+         idxf->code_size = idxf->d * sizeof(float);
+-        read_xb_vector(idxf->codes, f);
+        if(dedup_applied){
+            idxf->codes.resize(idxf->ntotal * idxf->code_size);


Could you please add comments here explaining the process?

A small overview for the patch:
During index loading, without this optimization the .faiss file contains the flat vector section. With this patch, the .faiss file will no longer store flat vectors, instead vectors are loaded from the .vec file into the C++ side using our custom VectorReader, which streams the vectors one by one. To enable this, vector writing to the .faiss file is disabled(index_write.cpp), and the loading path is updated to fetch vectors via VectorReader(index_read.cpp) at index load time.

finnroblin · 2025-08-08T17:24:35Z

jni/src/faiss_wrapper.cpp

    // Now that indexWriter is trained, we just load the bytes into an array and return
    faiss::VectorIOWriter vectorIoWriter;
-    faiss::write_index(indexWriter.get(), &vectorIoWriter);
+    faiss::write_index(indexWriter.get(), &vectorIoWriter, 1ull<<63);


Could the ioflags be moved into a constant with a descriptive name?

Yes, I will create constant for io_flags and use that.

Signed-off-by: sobhu17 <[email protected]>

finnroblin · 2025-08-08T17:37:11Z

jni/include/faiss_stream_support.h

+                env->ReleaseByteArrayElements(vector, elems, JNI_ABORT);
+            });
+
+            int vectorByteSize = sizeof(float) * length;


Shouldn't this be sizeof(byte) if it's the byte vector case?

There is a problem with that, for byte and other quantized(IxSQ) cases faiss expects float vectors and it internally performs quantization. I tried using the byte vectors array but that failed. So what we are doing here is getting byte vectors from VectorReader and then typecasting those vectors to float, then in read_index for IxSQ case what we are doing is using faiss sq.compute_codes so that faiss can get vectors in proper format.

finnroblin · 2025-08-08T17:38:17Z

jni/include/faiss_stream_support.h

+            if (env->ExceptionCheck() || vectorReaderGlobalRef == nullptr) return false;
+        }
+
+        if (isFloat) {


Can we add two helper functions to reduce branching? It's hard to tell where float and byte cases differ in this code, so helper would improve readability and maintainability.

Yes, it make sense, thank you.

finnroblin · 2025-08-08T17:39:01Z

jni/include/faiss_stream_support.h

+
+            float* floatDest = static_cast<float*>(dest);
+            for (int i = 0; i < length; ++i) {
+                floatDest[i] = static_cast<float>(elems[i]);


Let's use memcpy

Yes, Got it!

Signed-off-by: sobhu17 <[email protected]>

change C++ files to support dedup vector optimization

fa0f5ba

Signed-off-by: sobhu17 <[email protected]>

sobhu17 requested review from 0ctopus13prime, VijayanB, Vikasht34, heemin32, jmazanec15, junqiu-lei, luyuncheng, martin-gaievski, naveentatikonda, navneet1v, ryanbogan, shatejas and vamshin as code owners August 8, 2025 17:06

finnroblin reviewed Aug 8, 2025

View reviewed changes

add comment to patch file

95f2302

Signed-off-by: sobhu17 <[email protected]>

finnroblin reviewed Aug 8, 2025

View reviewed changes

patch desciption

1f25e25

Signed-off-by: sobhu17 <[email protected]>

Deduplicating Vectors for Space Optimization - Patch #2840

Are you sure you want to change the base?

Deduplicating Vectors for Space Optimization - Patch #2840

Conversation

sobhu17 commented Aug 8, 2025

Description

Related Issues

Check List

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

navneet1v commented Aug 8, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sobhu17 Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sobhu17 Aug 8, 2025 •

edited

Loading