TurboQuant encoding for Vectors by connortsui20 · Pull Request #7269 · vortex-data/vortex

connortsui20 · 2026-04-02T18:30:57Z

Continuation of #7167, authored by @lwwmanning

Summary

Lossy quantization for vector data (e.g., embeddings) based on TurboQuant (https://arxiv.org/abs/2504.19874). Supports both MSE-optimal and inner-product-optimal (Prod with QJL correction) variants at 1-8 bits per coordinate.

Key components:

Single TurboQuant array encoding with optional QJL correction fields, storing quantized codes, norms, centroids, and rotation signs as children.
Structured Random Hadamard Transform (SRHT) for O(d log d) rotation, fully self-contained with no external linear algebra library.
Max-Lloyd centroid computation on Beta(d/2, d/2) distribution.
Approximate cosine similarity and dot product compute directly on quantized arrays without full decompression.
Pluggable TurboQuantScheme for BtrBlocks, exposed via WriteStrategyBuilder::with_vector_quantization().
Benchmarks covering common embedding dimensions (128, 768, 1024, 1536).

Also refactors CompressingStrategy to a single constructor, and adds vortex_tensor::initialize() for session registration of tensor types, encodings, and scalar functions.

API Changes

Adds a new TurboQuant encoding + some other things. TODO

Testing

TODO

Lossy quantization for vector data (e.g., embeddings) based on TurboQuant (https://arxiv.org/abs/2504.19874). Supports both MSE-optimal and inner-product-optimal (Prod with QJL correction) variants at 1-8 bits per coordinate. Key components: - Single TurboQuant array encoding with optional QJL correction fields, storing quantized codes, norms, centroids, and rotation signs as children. - Structured Random Hadamard Transform (SRHT) for O(d log d) rotation, fully self-contained with no external linear algebra library. - Max-Lloyd centroid computation on Beta(d/2, d/2) distribution. - Approximate cosine similarity and dot product compute directly on quantized arrays without full decompression. - Pluggable TurboQuantScheme for BtrBlocks, exposed via WriteStrategyBuilder::with_vector_quantization(). - Benchmarks covering common embedding dimensions (128, 768, 1024, 1536). Also refactors CompressingStrategy to a single constructor, and adds vortex_tensor::initialize() for session registration of tensor types, encodings, and scalar functions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-Authored-By: Will Manning <will@willmanning.io> Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

We are going to implement this later as a separate encoding (if we decide to implement it at all because word on the street is that the MSE + QJL is not actually better than MSE on its own). Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

It doesn't really make a lot of sense for us to define this as an encoding for `FixedSizeList`. Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

- Use ExecutionCtx in TurboQuant compress path and import ExecutionCtx - Extend dtype imports with Nullability and PType to support extension types - Wire in extension utilities: extension_element_ptype and extension_list_size for vector extensions - Remove dimension and bit_width from slice/take compute calls to rely on metadata - Update TurboQuant mod docs to mention VortexSessionExecute - Change scheme.compress to use the provided compressor argument (not _compressor) - Add an extensive TurboQuant test suite (roundtrip, MSE bounds, edge cases, f64 input, serde roundtrip, and dtype checks) - Align vtable imports to new metadata handling (remove unused DeserializeMetadata/SerializeMetadata references) Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

codspeed-hq · 2026-04-03T22:05:56Z

Merging this PR will degrade performance by 29.05%

❌ 2 regressed benchmarks
✅ 1120 untouched benchmarks
⏩ 1530 skipped benchmarks¹

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Performance Changes

	Mode	Benchmark	`BASE`	`HEAD`	Efficiency
❌	Simulation	`take_map[(0.1, 1.0)]`	1.6 ms	2.3 ms	-29.05%
❌	Simulation	`take_map[(0.1, 0.5)]`	977.4 µs	1,209.5 µs	-19.2%

_{Comparing ct/turboquant (77a4288) with develop (e3c7401)}

1530 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

connortsui20 · 2026-04-03T23:04:17Z

I would say this is ready for review now, but only with respect to the structure. I have yet to go through the implementation and make sure things make sense, but I have made it so the structure makes sense and we correctly handle the different floating point type inputs as well as null vectors.

I think it would be good to get a review now, and maybe we should just merge this and iterate later.

gatesn

Let's do it

connortsui20 · 2026-04-04T00:38:09Z

I'll rebase and fix the errors tomorrow

connortsui20 added the changelog/feature A new feature label Apr 2, 2026

connortsui20 force-pushed the ct/turboquant branch 11 times, most recently from 6d390d6 to 35ddb9f Compare April 3, 2026 18:13

lwwmanning and others added 9 commits April 3, 2026 17:20

remove QJL

c7e8004

We are going to implement this later as a separate encoding (if we decide to implement it at all because word on the street is that the MSE + QJL is not actually better than MSE on its own). Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

TurboQuant is only an encoding for Vectors

7c83b94

It doesn't really make a lot of sense for us to define this as an encoding for `FixedSizeList`. Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

restructure modules for turboquant

b9b7cd8

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

even more cleanup

d3dd2ef

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

fix minor things

cfc3383

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

fix nullability handling

af3cb41

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

rebase to new vtable world

3d7dfed

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

connortsui20 force-pushed the ct/turboquant branch from fb6bbcf to 3d7dfed Compare April 3, 2026 22:02

connortsui20 added 2 commits April 3, 2026 18:54

fix cosine similarity and dot product

849d6a5

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

fix validity handling

77a4288

Signed-off-by: Connor Tsui <connor.tsui20@gmail.com>

connortsui20 marked this pull request as ready for review April 3, 2026 23:03

connortsui20 requested review from gatesn and lwwmanning April 3, 2026 23:04

gatesn approved these changes Apr 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TurboQuant encoding for Vectors#7269

TurboQuant encoding for Vectors#7269
connortsui20 wants to merge 11 commits intodevelopfrom
ct/turboquant

connortsui20 commented Apr 2, 2026

Uh oh!

codspeed-hq bot commented Apr 3, 2026 •

edited

Loading

Uh oh!

connortsui20 commented Apr 3, 2026 •

edited

Loading

Uh oh!

gatesn left a comment

Uh oh!

connortsui20 commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

connortsui20 commented Apr 2, 2026

Summary

API Changes

Testing

Uh oh!

codspeed-hq bot commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will degrade performance by 29.05%

Performance Changes

Footnotes

Uh oh!

connortsui20 commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gatesn left a comment

Choose a reason for hiding this comment

Uh oh!

connortsui20 commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codspeed-hq bot commented Apr 3, 2026 •

edited

Loading

connortsui20 commented Apr 3, 2026 •

edited

Loading