[ENH]: add parallel embedding support to SentenceTransformerEmbeddingFunction by Ashish-Abraham · Pull Request #6904 · chroma-core/chroma

Ashish-Abraham · 2026-04-14T12:18:54Z

Description of changes

Summarize the changes made by this PR.

This PR adds support for data-parallel embedding computation in the SentenceTransformerEmbeddingFunction. Fixes #6903 . This brings it to parity with other local embedding functions (like FastEmbed) and allows users to fully utilize multi-core CPUs and multi-GPU setups.

Improvements & Bug fixes
- N/A
New functionality
- batch_size support: Users can now specify a custom batch_size for local encoding. It defaults to None, which falls back to the sentence-transformers internal default (32).
- Multi-process pool support: Introduced the multiprocess_devices parameter. When provided (e.g., ["cpu", "cpu"] or ["cuda:0", "cuda:1"]), a persistent multi-process pool is initialized and used for all embedding calls.
- Automatic Cleanup: Implemented __del__ to ensure the multi-process pool is properly stopped when the embedding function instance is garbage collected.
- Schema Integration: Updated the sentence_transformer JSON schema to include the new parameters as nullable fields, ensuring configuration persistence and validation work correctly.

Test plan

How are these changes tested?

Tests pass locally with pytest for python
- Created chromadb/test/ef/test_sentence_transformer_ef.py with 7 new unit tests covering initialization, execution, resource cleanup, and configuration roundtrips.
- Updated chromadb/test/utils/test_embedding_function_schemas.py to verify that the updated schema correctly validates the new fields (including null values).

Migration plan

Are there any migrations, or any forwards/backwards compatibility changes needed in order to make sure this change deploys reliably?

No migrations are required. The changes are fully backwards compatible:

Existing collections without batch_size or multiprocess_devices in their config will default to None, maintaining the previous behavior (single-process, library-default batch size).
The JSON schema updates include null in the permitted types for the new fields to handle existing deployments.

Observability plan

What is the plan to instrument and monitor this change?

Standard python logging is used. No additional observability instrumentation is required for this local compute change.

Documentation Changes

Are all docstrings for user-facing APIs updated if required? Do we need to make documentation changes in the [docs section]?

For the [docs section], a small update to the Embedding Functions page would be beneficial to highlight the parallelization capabilities.

github-actions · 2026-04-14T12:19:11Z

propel-code-bot · 2026-04-14T12:19:31Z

Add parallel and batched encoding support to SentenceTransformerEmbeddingFunction

This PR introduces new functionality to SentenceTransformerEmbeddingFunction by adding configurable batch_size and persistent multi-process encoding via multiprocess_devices. The implementation initializes a shared sentence-transformers process pool when devices are provided, passes pool/batch options into encode, and includes cleanup logic in __del__ to stop the pool.

It also updates config persistence and validation paths so these new options round-trip correctly through get_config/build_from_config and JSON schema validation. Test coverage is expanded with a new focused test file for initialization, encode argument behavior, pool usage/cleanup, and config roundtrip, plus a schema test validating nullable fields.

This summary was automatically generated by @propel-code-bot

propel-code-bot

Review found no issues; changes appear well-implemented, tested, and low risk.

Status: No Issues Found | Risk: Low

Review Details

📁 4 files reviewed | 💬 0 comments

aabraha2 added 2 commits April 14, 2026 11:45

enabled usage of parallel processing from huggingface embeddings

390dc81

added test cases to verify

5982407

propel-code-bot bot reviewed Apr 14, 2026

View reviewed changes

Ashish-Abraham changed the title ~~ENH: add parallel embedding support to SentenceTransformerEmbeddingFunction~~ [ENH]: add parallel embedding support to SentenceTransformerEmbeddingFunction Apr 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH]: add parallel embedding support to SentenceTransformerEmbeddingFunction#6904

[ENH]: add parallel embedding support to SentenceTransformerEmbeddingFunction#6904
Ashish-Abraham wants to merge 2 commits intochroma-core:mainfrom
Ashish-Abraham:ashish/enh-st-ef-batch-size-multiprocess

Ashish-Abraham commented Apr 14, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Apr 14, 2026

Uh oh!

propel-code-bot bot commented Apr 14, 2026

Uh oh!

propel-code-bot bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Ashish-Abraham commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of changes

Test plan

Migration plan

Observability plan

Documentation Changes

Uh oh!

github-actions bot commented Apr 14, 2026

Reviewer Checklist

Testing, Bugs, Errors, Logs, Documentation

System Compatibility

Quality

Uh oh!

propel-code-bot bot commented Apr 14, 2026

Uh oh!

propel-code-bot bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Ashish-Abraham commented Apr 14, 2026 •

edited

Loading