Skip to content

Increase the minimum doc count to 5 #52

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Apr 14, 2025

Conversation

rchitale7
Copy link
Member

@rchitale7 rchitale7 commented Apr 12, 2025

Description

During integration testing, we observed that CAGRA would fail to build an index when the doc count was 1, 3 or 4. Specifically, the copyTo method used to convert the GPU index to CPU index would hang. This would cause certain kNN integration tests to fail - see #31 and #30. We tested up to having 23,000 docs, and did not see this issue with any other doc count. So, I believe it is sufficient to raise the minimum doc count threshold to 5, to prevent this issue from happening.

I also refactored a lot of the unit tests to make testing these types of changes easier in the future

Issues Resolved

Resolves #31 and #30.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Rohan Chitale <[email protected]>
@rchitale7 rchitale7 marked this pull request as ready for review April 14, 2025 17:11
@navneet1v
Copy link
Collaborator

@rchitale7 is the branch properly rebased. The number of changes this small change has introduced seems to be pretty high.

@rchitale7
Copy link
Member Author

rchitale7 commented Apr 14, 2025

@navneet1v Yeah its rebased, I refactored some of the unit tests so thats why theres a lot of changes. Just wanted to clean things up/make tests more clear

@navneet1v
Copy link
Collaborator

@navneet1v Yeah its rebased, I refactored some of the unit tests so thats why theres a lot of changes. Just wanted to clean things up/make tests more clear

Please try to mention this in description, this will help reviewers. :)

@rchitale7 rchitale7 merged commit 4e7dd21 into opensearch-project:main Apr 14, 2025
4 checks passed
@rchitale7 rchitale7 deleted the doc_count_fix branch April 14, 2025 18:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] FilteredSearchANNSearchIT Failure When Building Index Remotely
3 participants