-
Notifications
You must be signed in to change notification settings - Fork 152
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add standard blueprint for vector search #3659
Conversation
Signed-off-by: Mingshi Liu <[email protected]>
a27e123
to
e0ef6e7
Compare
...rence_blueprints/standard_blueprints/bedrock_connector_titan_embedding_standard_blueprint.md
Outdated
Show resolved
Hide resolved
...erence_blueprints/standard_blueprints/cohere_connector_image_embedding_standard_blueprint.md
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Left some minor comments, this is really great info to share!
# Model Blueprints for Vector Search | ||
|
||
OpenSearch provides two approaches for implementing embedding models, depending on your needs: | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Standard Blueprints | |
- Legacy Blueprints |
This way they can follow what is the two approaches I get what you mean but I had to think a bit before I understood the idea.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
you got it the other way around. the new blueprint without pre and post processing function is called Standard blueprint now. The earlier blueprint which has pre and post processing functions are called legacy blueprint
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh I see so one is or ML inference processor and one is for connector (just using prediction)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
no, both are using connectors. One is without pre and post processing functions, but the other has pre or post processing functions
...s/standard_blueprints/bedrock_connector_cohere_cohere.embed-english-v3_standard_blueprint.md
Outdated
Show resolved
Hide resolved
...ndard_blueprints/bedrock_connector_cohere_cohere.embed-multilingual-v3_standard_blueprint.md
Show resolved
Hide resolved
...rence_blueprints/standard_blueprints/bedrock_connector_titan_embedding_standard_blueprint.md
Outdated
Show resolved
Hide resolved
...erence_blueprints/standard_blueprints/cohere_connector_image_embedding_standard_blueprint.md
Outdated
Show resolved
Hide resolved
This is recommended for models to use the ML inference processor to handle input/output mapping. | ||
Note that if using a model that requires pre and post processing functions, you must provide the functions in the blueprint. Please refer to legacy blueprint: [Cohere Embedding Connector Blueprint for image embedding mode](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/cohere_connector_image_embedding_blueprint.md) | ||
|
||
- embed-english-v3.0 1024 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lets be careful user might think the model name is embed-english-v3.0 1024
and not embed-english-v3.0
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
1024 is the token length I think
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So i looked up the models and saw that these are dimension sizes. This is pretty important info to communicate, when user wants to do semantic search.
Please see here
https://arc.net/l/quote/ekkxjzmv
} | ||
``` | ||
|
||
## 2. Create a connector |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Lets create a connector for managed too
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
cohere is not AWS service, user can use the same connector in open source and also on AWS, if you look at the existing legacy blueprint, you will find out it's the same
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Got it, Lets link an example since they would need a secret Arn and a role Arn from my understanding (Using this as a reference https://github.com/opensearch-project/ml-commons/blob/main/docs/tutorials/aws/semantic_search_with_cohere_embedding_model.md#42-create-connector)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, third-party platforms would require both secret arn and role arn on managed service: https://docs.aws.amazon.com/opensearch-service/latest/developerguide/ml-external-connector.html#connector-external-create
"cohere_key": "<ENTER_COHERE_API_KEY_HERE>" | ||
}, | ||
"parameters": { | ||
"model": "<ENTER_MODEL_NAME_HERE>", // Choose a Model from the provided list above |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This part is tricky to communicate. If user copys and paste they might not realize this comment cant be parsed. Maybe on the top we can say something along the lines of; for ENTER_MODEL_NAME_HERE make sure to choose embed-english-v3.0 or embed-english-v2.0
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
added embed-english-v3.0
as an example
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh is this in a future commit? I dont see it on my end 😅
POST /_plugins/_ml/models/<MODEL_ID_HERE>/_predict | ||
{ | ||
"parameters": { | ||
"images": [""] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh noticing the format is a bit different here? before we expressed only the base64 string but this one has the data URI scheme. I wonder does the model only work with the URI?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I tried to use the same example from the legacy blueprint: https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/cohere_connector_image_embedding_blueprint.md would you suggest different format? I can try it out on my side
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep lets test the regular base64 string without the URI
...s/standard_blueprints/bedrock_connector_cohere_cohere.embed-english-v3_standard_blueprint.md
Outdated
Show resolved
Hide resolved
|
||
## 2. Create connector for Amazon Bedrock: | ||
|
||
If you are using self-managed Opensearch, you should supply AWS credentials: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thinking out loud, rather than doubling the number of blueprints based on AWS vs. self-managed, can we just persist one, and have a single call-out to the documentation describing how it is set up differently between AWS and self-managed? That should greatly reduce the amount of content.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ummm we already have AWS doc that talks about how to set up connectors on AWS. But I keep this just to be consist with legacy blueprints
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Right exactly, was thinking to have just a single blueprint and one callout on the different ways to handle just the credential field, just as a general improvement over the legacy blueprints.
## 3. Create model group: | ||
|
||
```json | ||
POST /_plugins/_ml/model_groups/_register | ||
{ | ||
"name": "remote_model_group_cohere", | ||
"description": "model group for cohere models" | ||
} | ||
``` | ||
|
||
Sample response: | ||
```json | ||
{ | ||
"model_group_id": "IMobmY8B8aiZvtEZeO_i", | ||
"status": "CREATED" | ||
} | ||
``` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can this be removed, now that there is default model groups and these steps aren't required?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
model group is not required. but it's recommended for users to better manager their models, I will add the optional to 3. Create model group(Optional):
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1. Its good to add this step as optional to reduce the steps.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I left some comments for review, and once addressed, I believe everything will be set and ready to go!
# Model Blueprints for Vector Search | ||
|
||
OpenSearch provides two approaches for implementing embedding models, depending on your needs: | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh I see so one is or ML inference processor and one is for connector (just using prediction)?
...s/standard_blueprints/bedrock_connector_cohere_cohere.embed-english-v3_standard_blueprint.md
Outdated
Show resolved
Hide resolved
This is recommended for models to use the ML inference processor to handle input/output mapping. | ||
Note that if using a model that requires pre and post processing functions, you must provide the functions in the blueprint. Please refer to legacy blueprint: [Cohere Embedding Connector Blueprint for image embedding mode](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/cohere_connector_image_embedding_blueprint.md) | ||
|
||
- embed-english-v3.0 1024 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So i looked up the models and saw that these are dimension sizes. This is pretty important info to communicate, when user wants to do semantic search.
Please see here
https://arc.net/l/quote/ekkxjzmv
"cohere_key": "<ENTER_COHERE_API_KEY_HERE>" | ||
}, | ||
"parameters": { | ||
"model": "<ENTER_MODEL_NAME_HERE>", // Choose a Model from the provided list above |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh is this in a future commit? I dont see it on my end 😅
POST /_plugins/_ml/models/<MODEL_ID_HERE>/_predict | ||
{ | ||
"parameters": { | ||
"images": [""] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep lets test the regular base64 string without the URI
} | ||
``` | ||
|
||
## 2. Create a connector |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Got it, Lets link an example since they would need a secret Arn and a role Arn from my understanding (Using this as a reference https://github.com/opensearch-project/ml-commons/blob/main/docs/tutorials/aws/semantic_search_with_cohere_embedding_model.md#42-create-connector)
address commented in the last commit, can you please review the change and approve? @ohltyler @nathaliellenaa @brianf-aws |
Signed-off-by: Mingshi Liu <[email protected]>
cccf5f0
to
8703ec4
Compare
Signed-off-by: Mingshi Liu <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for addressing my comments approved!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks @mingshl!
Description
Earlier, we have the these blueprint for embedding model which has pre and post processing function to format the model input and output format, so that it can be use for neural search query.
But now that we have ml inference processor that allow users to map the model input and output format, then user can register connector and model without pre and post processing function, We want to provide a new set of blueprint for embedding model with no pre and post processing function.
Related Issues
#3619
Check List
--signoff
.By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.