opensearch-jvector-plugin

Welcome!
Project Resources
Credits and Acknowledgments
Code of Conduct
License
Copyright

Welcome!

OpenSearch jVector Plugin enables you to run the nearest neighbor search on billions of documents across thousands of dimensions with the same ease as running any regular OpenSearch query. You can use aggregations and filter clauses to further refine your similarity search operations. k-NN similarity search powers use cases such as product recommendations, fraud detection, image and video search, related document search, and more.

Why use OpenSearch jVector Plugin?

High Level

Scalable: Run similarity search on billions of documents across thousands of dimensions without exceeding memory and without choking on the disk by using DiskANN
Fast: Blazing fast pure Java implementation with minimal overhead (see benchmarks)
Lightweight: Pure Java implementation. Self-contained, builds in seconds, no need to deal with native dependencies and complex flaky builds or additional 100,000s lines of code you didn't ask for.

Unique Features

DiskANN - JVector is a pure Java implementation capable to perform vector ANN search in a way that is optimized for RAM bound environments with minimal additional overhead. No need involving native dependencies (FAISS) and cumbersome JNI mechanism.
Thread Safety - JVector is a threadsafe index that supports concurrent modification and inserts with near perfect scalability as you add cores, Lucene is not threadsafe; This allows us to ingest much higher volume of vectors a lot faster without unnecessary merge operations to parallelize ingestion concurrency.
quantized index construction - JVector can perform index construction w/ quantized vectors, saving memory = larger segments = fewer segments = faster searches
Quantized Disk ANN - JVector supports DiskANN style quantization with rerank, it's quite easy (in principle) to demonstrate that this is a massive difference in performance for larger-than-memory indexes (in practice it takes days/weeks to insert enough vectors into Lucene to show this b/c of the single threaded problem, that's the only hard part)
PQ and BQ support - As part of (3) JVector supports PQ as well as the BQ that Lucene offers, it seems that this is fairly rare (pgvector doesn't do PQ either) because (1) the code required to get high performance ADC with SIMD is a bit involved and (2) it requires a separate codebook which Lucene isn't set up to easily accommodate. PQ at 64x compression gives you higher relevance than BQ at 32x
Fused ADC - Features that nobody else has like Fused ADC and NVQ and Anisotropic PQ
Compatibility - JVector is compatible with Cassandra. Which allows to more easily transfer vector encoded data from Cassandra to OpenSearch and vice versa.

Project Resources

Project Website
Downloads.
Documentation
Need help? Try the Forum
Project Principles
Contributing to OpenSearch jVector Plugin
Maintainer Responsibilities
Release Management
Admin Responsibilities
Security

Benchmarks

As of today we have collected compelling data that shown us clear performance advantages for jVector compared to other KNN engines which is why we think it will be a good addition to OpenSearch vector search engines. For example: We have compared with the default Lucene HNSW graph implementation and noticed a significantly increasing latency benefits even from as small as 10,000 vectors datasets but exponentially increasing with the dataset size and further increasing by several order of magnitude with the decrease in the Operating System File System cache (as is expected in RAM constrained environments)

JMH engine benchmark outputs

Important note: JMH numbers are qualitative and relative and should not be treated as "globally consistent". Or in other words, the numbers below only illustrate the relative ratio of performance difference and while they may vary across systems, the ratios should remain constant.

RandomVectors:

Benchmark                                               (codecType)  (dimension)  (numDocs)  Mode  Cnt  Score   Error  Units
FormatBenchmarkRandomVectors.benchmarkSearch  jvector_not_quantized          128       1000  avgt    5  0.146 ± 0.002  ms/op
FormatBenchmarkRandomVectors.benchmarkSearch  jvector_not_quantized          128      10000  avgt    5  0.332 ± 0.003  ms/op
FormatBenchmarkRandomVectors.benchmarkSearch  jvector_not_quantized          128     100000  avgt    5  0.451 ± 0.004  ms/op
FormatBenchmarkRandomVectors.benchmarkSearch      jvector_quantized          128       1000  avgt    5  0.147 ± 0.001  ms/op
FormatBenchmarkRandomVectors.benchmarkSearch      jvector_quantized          128      10000  avgt    5  0.181 ± 0.002  ms/op
FormatBenchmarkRandomVectors.benchmarkSearch      jvector_quantized          128     100000  avgt    5  0.194 ± 0.002  ms/op
FormatBenchmarkRandomVectors.benchmarkSearch              Lucene101          128       1000  avgt    5  0.707 ± 0.016  ms/op
FormatBenchmarkRandomVectors.benchmarkSearch              Lucene101          128      10000  avgt    5  1.578 ± 0.022  ms/op
FormatBenchmarkRandomVectors.benchmarkSearch              Lucene101          128     100000  avgt    5  2.156 ± 0.080  ms/op

Visualization:
Benchmark: FormatBenchmarkRandomVectors.benchmarkSearch (dimension: 128)
Grouped by Number of Documents (numDocs)
Scaling: max bar width corresponds to 2.156 ms/op → 50 characters
--------------------------------------------------------------------------------
NumDocs: 1000
------------------------------------------------------------
Codec Type             Score (ms/op)    Visualization
------------------------------------------------------------
jvector_not_quantized  0.146 ms/op      |███
jvector_quantized      0.147 ms/op      |███
Lucene101              0.707 ms/op      |████████████████

NumDocs: 10000
------------------------------------------------------------
Codec Type             Score (ms/op)    Visualization
------------------------------------------------------------
jvector_not_quantized  0.332 ms/op      |████████
jvector_quantized      0.181 ms/op      |████
Lucene101              1.578 ms/op      |█████████████████████████████████████

NumDocs: 100000
------------------------------------------------------------
Codec Type             Score (ms/op)    Visualization
------------------------------------------------------------
jvector_not_quantized  0.451 ms/op      |██████████
jvector_quantized      0.194 ms/op      |█████
Lucene101              2.156 ms/op      |██████████████████████████████████████████████████
--------------------------------------------------------------------------------
---------------------------------------------------------------------------

sift-128-euclidean:

Benchmark                                                   (codecType)            (datasetName)  Mode  Cnt  Score   Error  Units
FormatBenchmarkWithKnownDatasets.benchmarkSearch  jvector_not_quantized  sift-128-euclidean.hdf5  avgt    4  0.292 ± 0.002  ms/op
FormatBenchmarkWithKnownDatasets.benchmarkSearch              Lucene101  sift-128-euclidean.hdf5  avgt    4  1.160 ± 0.015  ms/op

JMH Benchmark Results
Benchmark: FormatBenchmarkWithKnownDatasets.benchmarkSearch
Dataset: sift-128-euclidean.hdf5

Visualization:
Codec Type            Avg Time (ms/op)   Visualization
---------------------------------------------------------------
jvector_not_quantized  0.292 ms/op       |█████████████                        
Lucene101              1.160 ms/op       |██████████████████████████████████████████████████
---------------------------------------------------------------

The numbers above were collected in an environment that all data was cached in either JVM or Operating System cache memory. And we can already see a significant difference in performance!

When we are moving to a RAM constrained environment we are expecting to see a significant difference of a number of orders of magnitude. For example if Lucene is doing 100x the number of disk reads compared to JVector, and the disk is 100x slower than RAM, then we can expect JVector to be 10,000x faster than Lucene in this scenario.

OpenSearch-Benchmark output on single node for sift-128-euclidean-L2

Credits and Acknowledgments

This project uses two similarity search libraries to perform Approximate Nearest Neighbor Search: the Apache 2.0-licensed Lucene and jVector.

Code of Conduct

This project has adopted the Placeholder Open Source Code of Conduct. For more information see the Code of Conduct FAQ, or contact placeholder with any additional questions or comments.

License

This project is licensed under the Apache v2.0 License.

Copyright

Copyright OpenSearch Contributors. See NOTICE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.github		.github
benchmark-jmh		benchmark-jmh
build-tools		build-tools
buildSrc		buildSrc
gradle		gradle
jenkins		jenkins
qa		qa
release-notes		release-notes
scripts		scripts
src		src
.gitignore		.gitignore
.whitesource		.whitesource
ADMINS.md		ADMINS.md
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DEVELOPER_GUIDE.md		DEVELOPER_GUIDE.md
LICENSE.txt		LICENSE.txt
MAINTAINERS.md		MAINTAINERS.md
NOTICE.txt		NOTICE.txt
README.md		README.md
RELEASING.md		RELEASING.md
SECURITY.md		SECURITY.md
build.gradle		build.gradle
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
latency.png		latency.png
recall.png		recall.png
settings.gradle		settings.gradle
throughput.png		throughput.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

opensearch-jvector-plugin

Welcome!

Why use OpenSearch jVector Plugin?

High Level

Unique Features

Project Resources

Benchmarks

JMH engine benchmark outputs

OpenSearch-Benchmark output on single node for sift-128-euclidean-L2

Credits and Acknowledgments

Code of Conduct

License

Copyright

About

Uh oh!

Releases 8

Packages

Uh oh!

Contributors 6

Uh oh!

Languages

License

opensearch-project/opensearch-jvector

Folders and files

Latest commit

History

Repository files navigation

opensearch-jvector-plugin

Welcome!

Why use OpenSearch jVector Plugin?

High Level

Unique Features

Project Resources

Benchmarks

JMH engine benchmark outputs

OpenSearch-Benchmark output on single node for sift-128-euclidean-L2

Credits and Acknowledgments

Code of Conduct

License

Copyright

About

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Packages