Use Reranking for Better RAG Results
Reranking models are primarily used to reassess and reorder an initial set of search results. This helps to improve the relevance and quality of search results in both similarity search and Retrieval Augmented Generation (RAG) scenarios.
In a RAG scenario, reranking plays a crucial role in improving the quality of information ingested into an LLM by ensuring that the most relevant documents or chunks are prioritized. This can reduce hallucinations and improve the accuracy of generated outputs. The reranking step is typically performed after an initial search that uses a faster but less precise embedding model. The reranker helps to identify the most pertinent information for a given query, but is more expensive in terms of resources because it often employs sophisticated matching methods that go beyond simple vector comparisons.
Here, you can use the RERANK
function from either the DBMS_VECTOR
or the DBMS_VECTOR_CHAIN
package, depending on your use case.
Oracle AI Vector Search supports reranking models provided by Cohere and Vertex AI.
WARNING:
Certain features of the database may allow you to access services offered separately by third-parties, for example, through the use of JSON specifications that facilitate your access to REST APIs.
Your use of these features is solely at your own risk, and you are solely responsible for complying with any terms and conditions related to use of any such third-party services. Notwithstanding any other terms and conditions related to the third-party services, your use of such database features constitutes your acceptance of that risk and express exclusion of Oracle's responsibility or liability for any damages resulting from such access.
To rerank results for the query "What are some interesting characteristics of the Jovian satellites?
", using a third-party reranking model:
Using the above Cohere model, the reranked results appear as follows:
[
{
"index" : "0",
"score" : "0.059319142",
"content" : "Jupiter boasts an impressive system of 95 known moons, including the four largest Galilean satellites."
},
{
"index" : "2",
"score" : "0.04352814",
"content" : "Io, one of Jupiter's Galilean moons, is the most volcanically active body in our solar system."
},
{
"index" : "4",
"score" : "0.04138472",
"content" : "Jupiter's composition is similar to that of the Sun, and it could have become a brown dwarf if its mass had been 80 times greater."
}
]
Related Topics
Parent topic: Use Retrieval Augmented Generation to Complement LLMs