Cohere Rerank 3.5 (New)
The cohere.rerank.3-5
model takes in a query and a list of texts and produces an ordered array with each text assigned a relevance score. The relevance score is how the model ranks the documents, that's, how well each text matches the query.
Available in These Regions
- Brazil East (Sao Paulo) (dedicated AI cluster only)
- Germany Central (Frankfurt) (dedicated AI cluster only)
- Japan Central (Osaka) (dedicated AI cluster only)
- UK South (London) (dedicated AI cluster only)
- US Midwest (Chicago) (dedicated AI cluster only)
Key Features
- Dedicated mode only.
- Not available on-demand or in the playground.
- Access the model that's hosted on a cluster through API and SDK.
- For dedicated mode, create an endpoint on a hosting dedicated AI cluster, host the model on the cluster, and then run the RerankText API or its relevant SDK.
Dedicated AI Cluster for the Model
To reach a model through a dedicated AI cluster in any listed region, you must create an endpoint for that model on a dedicated AI cluster. For the cluster unit size that matches this model, see the following table.
Base Model | Fine-Tuning Cluster | Hosting Cluster | Pricing Page Information | Request Cluster Limit Increase |
---|---|---|---|---|
|
Not available for fine-tuning |
|
|
|
Tip
-
If you don't have enough cluster limits in your tenancy for hosting the Cohere Rerank 3.5 model on a dedicated AI cluster, request the
dedicated-unit-rerank-cohere-count
limit to increase by 1. - Review the Cohere Rerank 3.5 cluster performance benchmarks for different scenarios.
Release and Retirement Dates
Model | Release Date | On-Demand Retirement Date | Dedicated Mode Retirement Date |
---|---|---|---|
cohere.rerank.3-5
|
2025-05-14 | The on-demand option isn't available for this model. | At least 6 months after the release of the 1st replacement model. |
Rerank Model Parameter
For the Rerank model parameters, see the RerankText API documentation.