weaviate-sbert-vectorizer

CPU-only Sentence Transformers vectorizer (embedding) service for Weaviate.

Vectorization / Embedding is the process by which human-understandable data (ie. text) is converted into machine-readable numerical representations (vectors) for use in AI systems. Vectors are stored in specialized vector databases that are optimized for rapid and nuanced information retrieval.

This service is designed for CPU vectorization on a single core using the ONNX Runtime. No GPU is required.

Built images contain only the minimum dependencies required, and are thus significantly smaller than those provided by Weaviate (by ~7GB).

Usage

The images of this service are intended to be used with Weaviate's text2vec-transformers module:

services:
  weaviate:
    environment:
      ENABLE_MODULES: text2vec-transformers
      TRANSFORMERS_INFERENCE_API: http://wstv:8080
  wstv:
    image: ghcr.io/metabronx/weaviate-sbert-vectorizer:all-MiniLM-L6-v2_quint8_avx2

Configuration

The embedding model is configurable via the MODEL_NAME and FILE_PATH environment variables, which correspond:

SentenceTransformer(
    MODEL_NAME,
    model_kwargs={"file_name": FILE_PATH},
    backend="onnx",
)

The default model is the 8-bit-quantized AVX2-optimized (quint8_avx2) variant of sentence-transformers/all-MiniLM-L6-v2.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
tests		tests
wstv		wstv
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yaml		docker-compose.yaml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

weaviate-sbert-vectorizer

Usage

Configuration

About

Uh oh!

Packages

Uh oh!

Languages

metabronx/weaviate-sbert-vectorizer

Folders and files

Latest commit

History

Repository files navigation

weaviate-sbert-vectorizer

Usage

Configuration

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Languages

Packages