Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add standard blueprint for vector search #3659

Merged
merged 3 commits into from
Mar 18, 2025

Conversation

mingshl
Copy link
Collaborator

@mingshl mingshl commented Mar 14, 2025

Description

Earlier, we have the these blueprint for embedding model which has pre and post processing function to format the model input and output format, so that it can be use for neural search query.
But now that we have ml inference processor that allow users to map the model input and output format, then user can register connector and model without pre and post processing function, We want to provide a new set of blueprint for embedding model with no pre and post processing function.

Related Issues

#3619

Check List

  • New functionality includes testing.
  • New functionality has been documented.
  • API changes companion pull request created.
  • Commits are signed per the DCO using --signoff.
  • Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

@mingshl mingshl force-pushed the main-new-blueprints branch from a27e123 to e0ef6e7 Compare March 14, 2025 16:30
@mingshl mingshl had a problem deploying to ml-commons-cicd-env March 14, 2025 16:32 — with GitHub Actions Error
@mingshl mingshl temporarily deployed to ml-commons-cicd-env March 14, 2025 16:32 — with GitHub Actions Inactive
@mingshl mingshl had a problem deploying to ml-commons-cicd-env March 14, 2025 16:32 — with GitHub Actions Failure
@mingshl mingshl temporarily deployed to ml-commons-cicd-env March 14, 2025 16:32 — with GitHub Actions Inactive
Copy link
Contributor

@brianf-aws brianf-aws left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Left some minor comments, this is really great info to share!

# Model Blueprints for Vector Search

OpenSearch provides two approaches for implementing embedding models, depending on your needs:

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- Standard Blueprints
- Legacy Blueprints

This way they can follow what is the two approaches I get what you mean but I had to think a bit before I understood the idea.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

you got it the other way around. the new blueprint without pre and post processing function is called Standard blueprint now. The earlier blueprint which has pre and post processing functions are called legacy blueprint

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh I see so one is or ML inference processor and one is for connector (just using prediction)?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no, both are using connectors. One is without pre and post processing functions, but the other has pre or post processing functions

This is recommended for models to use the ML inference processor to handle input/output mapping.
Note that if using a model that requires pre and post processing functions, you must provide the functions in the blueprint. Please refer to legacy blueprint: [Cohere Embedding Connector Blueprint for image embedding mode](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/cohere_connector_image_embedding_blueprint.md)

- embed-english-v3.0 1024
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lets be careful user might think the model name is embed-english-v3.0 1024 and not embed-english-v3.0

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

1024 is the token length I think

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So i looked up the models and saw that these are dimension sizes. This is pretty important info to communicate, when user wants to do semantic search.

Please see here
https://arc.net/l/quote/ekkxjzmv

}
```

## 2. Create a connector
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lets create a connector for managed too

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cohere is not AWS service, user can use the same connector in open source and also on AWS, if you look at the existing legacy blueprint, you will find out it's the same

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Got it, Lets link an example since they would need a secret Arn and a role Arn from my understanding (Using this as a reference https://github.com/opensearch-project/ml-commons/blob/main/docs/tutorials/aws/semantic_search_with_cohere_embedding_model.md#42-create-connector)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, third-party platforms would require both secret arn and role arn on managed service: https://docs.aws.amazon.com/opensearch-service/latest/developerguide/ml-external-connector.html#connector-external-create

"cohere_key": "<ENTER_COHERE_API_KEY_HERE>"
},
"parameters": {
"model": "<ENTER_MODEL_NAME_HERE>", // Choose a Model from the provided list above
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This part is tricky to communicate. If user copys and paste they might not realize this comment cant be parsed. Maybe on the top we can say something along the lines of; for ENTER_MODEL_NAME_HERE make sure to choose embed-english-v3.0 or embed-english-v2.0

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added embed-english-v3.0 as an example

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh is this in a future commit? I dont see it on my end 😅

POST /_plugins/_ml/models/<MODEL_ID_HERE>/_predict
{
"parameters": {
"images": [""]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh noticing the format is a bit different here? before we expressed only the base64 string but this one has the data URI scheme. I wonder does the model only work with the URI?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tried to use the same example from the legacy blueprint: https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/cohere_connector_image_embedding_blueprint.md would you suggest different format? I can try it out on my side

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep lets test the regular base64 string without the URI


## 2. Create connector for Amazon Bedrock:

If you are using self-managed Opensearch, you should supply AWS credentials:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thinking out loud, rather than doubling the number of blueprints based on AWS vs. self-managed, can we just persist one, and have a single call-out to the documentation describing how it is set up differently between AWS and self-managed? That should greatly reduce the amount of content.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ummm we already have AWS doc that talks about how to set up connectors on AWS. But I keep this just to be consist with legacy blueprints

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Right exactly, was thinking to have just a single blueprint and one callout on the different ways to handle just the credential field, just as a general improvement over the legacy blueprints.

Comment on lines 102 to 118
## 3. Create model group:

```json
POST /_plugins/_ml/model_groups/_register
{
"name": "remote_model_group_cohere",
"description": "model group for cohere models"
}
```

Sample response:
```json
{
"model_group_id": "IMobmY8B8aiZvtEZeO_i",
"status": "CREATED"
}
```
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can this be removed, now that there is default model groups and these steps aren't required?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

model group is not required. but it's recommended for users to better manager their models, I will add the optional to 3. Create model group(Optional):

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1. Its good to add this step as optional to reduce the steps.

@mingshl mingshl temporarily deployed to ml-commons-cicd-env March 15, 2025 20:57 — with GitHub Actions Inactive
@mingshl mingshl temporarily deployed to ml-commons-cicd-env March 15, 2025 20:57 — with GitHub Actions Inactive
@mingshl mingshl temporarily deployed to ml-commons-cicd-env March 15, 2025 20:57 — with GitHub Actions Inactive
@mingshl mingshl temporarily deployed to ml-commons-cicd-env March 15, 2025 20:57 — with GitHub Actions Inactive
@mingshl mingshl had a problem deploying to ml-commons-cicd-env March 15, 2025 21:59 — with GitHub Actions Failure
@mingshl mingshl had a problem deploying to ml-commons-cicd-env March 15, 2025 21:59 — with GitHub Actions Failure
Copy link
Contributor

@brianf-aws brianf-aws left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I left some comments for review, and once addressed, I believe everything will be set and ready to go!

# Model Blueprints for Vector Search

OpenSearch provides two approaches for implementing embedding models, depending on your needs:

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh I see so one is or ML inference processor and one is for connector (just using prediction)?

This is recommended for models to use the ML inference processor to handle input/output mapping.
Note that if using a model that requires pre and post processing functions, you must provide the functions in the blueprint. Please refer to legacy blueprint: [Cohere Embedding Connector Blueprint for image embedding mode](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/cohere_connector_image_embedding_blueprint.md)

- embed-english-v3.0 1024
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So i looked up the models and saw that these are dimension sizes. This is pretty important info to communicate, when user wants to do semantic search.

Please see here
https://arc.net/l/quote/ekkxjzmv

"cohere_key": "<ENTER_COHERE_API_KEY_HERE>"
},
"parameters": {
"model": "<ENTER_MODEL_NAME_HERE>", // Choose a Model from the provided list above
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh is this in a future commit? I dont see it on my end 😅

POST /_plugins/_ml/models/<MODEL_ID_HERE>/_predict
{
"parameters": {
"images": [""]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep lets test the regular base64 string without the URI

}
```

## 2. Create a connector
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Got it, Lets link an example since they would need a secret Arn and a role Arn from my understanding (Using this as a reference https://github.com/opensearch-project/ml-commons/blob/main/docs/tutorials/aws/semantic_search_with_cohere_embedding_model.md#42-create-connector)

@mingshl
Copy link
Collaborator Author

mingshl commented Mar 17, 2025

address commented in the last commit, can you please review the change and approve? @ohltyler @nathaliellenaa @brianf-aws

Signed-off-by: Mingshi Liu <[email protected]>
@mingshl mingshl force-pushed the main-new-blueprints branch from cccf5f0 to 8703ec4 Compare March 17, 2025 18:35
@mingshl mingshl had a problem deploying to ml-commons-cicd-env March 17, 2025 18:36 — with GitHub Actions Error
@mingshl mingshl had a problem deploying to ml-commons-cicd-env March 17, 2025 18:36 — with GitHub Actions Failure
@mingshl mingshl had a problem deploying to ml-commons-cicd-env March 17, 2025 18:36 — with GitHub Actions Failure
@mingshl mingshl had a problem deploying to ml-commons-cicd-env March 17, 2025 18:36 — with GitHub Actions Error
@mingshl mingshl temporarily deployed to ml-commons-cicd-env March 17, 2025 19:57 — with GitHub Actions Inactive
@mingshl mingshl had a problem deploying to ml-commons-cicd-env March 17, 2025 19:57 — with GitHub Actions Error
@mingshl mingshl had a problem deploying to ml-commons-cicd-env March 17, 2025 19:57 — with GitHub Actions Failure
@mingshl mingshl temporarily deployed to ml-commons-cicd-env March 17, 2025 19:57 — with GitHub Actions Inactive
Copy link
Contributor

@brianf-aws brianf-aws left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for addressing my comments approved!

@mingshl mingshl had a problem deploying to ml-commons-cicd-env March 17, 2025 21:00 — with GitHub Actions Failure
@mingshl mingshl had a problem deploying to ml-commons-cicd-env March 17, 2025 21:00 — with GitHub Actions Failure
Copy link
Contributor

@nathaliellenaa nathaliellenaa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks @mingshl!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants