Support opensearch-neural-sparse-encoding-multilingual-v1 as OpenSearch-provided pretrained models #458

zhichao-aws · 2025-02-25T07:10:15Z

As opensearch-neural-sparse-encoding-multilingual-v1 is released at Hugging Face, we should also release it as OpenSearch-provided pretrained models to help users deploy it in OpenSearch clusters.

To release the model, we need to:

add a sparse tokenizer auto release pipeline.
enhance existing sparse model release pipeline to support model-side sparse vector prune.

zhichao-aws · 2025-02-25T07:10:30Z

Please feel free to assign this issue to me. Thanks.

zhichao-aws · 2025-02-25T08:27:01Z

To support these features, we'll need new parameters for the workflow. But the GH workflows have limits that supports at most 10 arguments. To resolve this, will replace some optional fields with a new "custom_params" field. And we'll use json string for these optional params

zhichao-aws added enhancement New feature or request untriaged labels Feb 25, 2025

zhichao-aws mentioned this issue Feb 25, 2025

Add sparse tokenizer release workflow; add model-side pruning for sparse encoding model #459

Open

5 tasks

dhrubo-os assigned zhichao-aws Feb 25, 2025

dhrubo-os removed the untriaged label Feb 25, 2025

dhrubo-os added this to Opensearch-py-ml projects Apr 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support opensearch-neural-sparse-encoding-multilingual-v1 as OpenSearch-provided pretrained models #458

Support opensearch-neural-sparse-encoding-multilingual-v1 as OpenSearch-provided pretrained models #458

zhichao-aws commented Feb 25, 2025

zhichao-aws commented Feb 25, 2025

zhichao-aws commented Feb 25, 2025

Support opensearch-neural-sparse-encoding-multilingual-v1 as OpenSearch-provided pretrained models #458

Support opensearch-neural-sparse-encoding-multilingual-v1 as OpenSearch-provided pretrained models #458

Comments

zhichao-aws commented Feb 25, 2025

zhichao-aws commented Feb 25, 2025

zhichao-aws commented Feb 25, 2025