Binary Quantization

Binary quantization reduces the memory footprint of stored vectors by converting floating-point embeddings into compact binary representations. This approach maintains retrieval accuracy while significantly improving storage efficiency, making it suitable for resource-constrained environments and large-scale deployments.

To configure document store to use binary quantization, post a request to the `/api/kb/<storeName>/index` endpoint and specify `BINARY` as the value of the `type` property:

http request
 POST /api/kb/<storeName>/index

 {
    "type": "BINARY",
    "oversamplingFactor": 3
 }

The `oversamplingFactor` property in the payload above can be omitted, as it uses a default value. Its main purpose is to who you how you can override the default.