ONNX Embedding Model

This repo houses chromadbs code to generate the "sentence-transformers/all-MiniLM-L6-v2" model into onnx as well as reference code for how to run it.

The model is stored on S3 and chromadb will fetch/cache it from there.

We do this because sentence-transformers introduces a lot of transitive dependencies that we don't want to have to install in the chromadb and some of those also don't work on newer python versions.

NOTE: We do not plan to support more than one default model in the near future in this way. If you want to use more models you should use chromadbs other embedding functions which depend on libraries like sentence-transformers.

Running the example model

pip install -r requirements.txt

and then

python run_onnx.py

The requirements in requirements.txt are the minimum requirements to run the model.

Generating the model

pip install -r requirements-dev.txt

and then

python create_onnx.py

Validating the model implementation is correct

pip install -r requirements-dev.txt

and then

python compare_onnx.py

This will compare the output of the onnx model to the output of the sentence-transformers model by evaluating the glue stsb benchmark as well as looking at the cosine similarity of the embeddings for the dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
onnx		onnx
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compare_onnx.py		compare_onnx.py
create_onnx.py		create_onnx.py
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
run_onnx.py		run_onnx.py
test_compare.py		test_compare.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ONNX Embedding Model

Running the example model

Generating the model

Validating the model implementation is correct

About

Releases

Packages

Languages

License

chroma-core/onnx-embedding

Folders and files

Latest commit

History

Repository files navigation

ONNX Embedding Model

Running the example model

Generating the model

Validating the model implementation is correct

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages