NVIDIA Rerank
The NVIDIA Rerank component lets you improve the relevance of search results in your Nappai dashboard. By connecting it to a retriever that pulls documents from your data store, you can send those documents to NVIDIA’s reranking API, which reorders them so the most useful ones appear first.
How it Works
- Connect a Retriever – First, link a retriever component that knows how to fetch documents from your data source (for example, a vector store or a database).
- Set the API Key – Enter your NVIDIA API key. This key authenticates your requests to NVIDIA’s cloud service.
- Choose the Base URL – The default URL (
https://integrate.api.nvidia.com/v1
) points to NVIDIA’s integration endpoint. You can change it if you use a different environment. - Select a Model – Pick the reranking model from the dropdown. The current option is
nv-rerank-qa-mistral-4b:1
. - Enter a Search Query – Type the query you want to use to search your documents.
- Rerank – When you run the component, it sends the retrieved documents and the query to NVIDIA’s API. The API returns the same documents reordered by relevance.
- Outputs – The component gives you two outputs:
- A new retriever that already includes the reranking step.
- The list of search results, now sorted by relevance.
Inputs
- Retriever: Connect a retriever component that fetches documents from your data source.
- API Key: Enter the secret key that authorizes your requests to NVIDIA’s API.
- Base URL: The base URL of the NVIDIA API. Defaults to
https://integrate.api.nvidia.com/v1
. - Model: Choose the reranking model to use.
- Search Query: Type the text query you want to search for.
Outputs
- Retriever: A retriever that automatically reranks documents using NVIDIA’s API. You can feed this output into other components that expect a retriever.
- Search Results: A list of documents returned by the reranker, ordered from most to least relevant. These can be displayed in a table, sent to a downstream processor, or used to trigger actions.
Usage Example
- Add the NVIDIA Rerank component to your workflow.
- Connect a retriever (e.g., a vector store retriever) to the Retriever input.
- Paste your NVIDIA API key into the API Key field.
- Leave the Base URL as the default unless you have a custom endpoint.
- Select the model (
nv-rerank-qa-mistral-4b:1
). - Enter a search query such as “customer support tickets for billing issues”.
- Run the workflow. The component will return the reranked documents in the Search Results output, which you can then display in a dashboard table or pass to another component for further processing.
Related Components
- ContextualCompressionRetriever – A retriever that compresses context before searching.
- VectorStore – Stores embeddings of documents for fast similarity search.
- Retriever – Generic interface for components that fetch documents.
Tips and Best Practices
- Keep your API Key secure; store it in Nappai’s secret manager instead of hard‑coding it.
- Use concise, specific queries to get the best reranking results.
- If you have many documents, consider limiting the number of results before reranking to reduce API usage.
- Test with a small set of documents first to verify that the reranking behaves as expected.
Security Considerations
- The API Key is a sensitive credential. Store it in a protected secret vault and never expose it in logs or UI.
- The Base URL should only be changed if you are sure the endpoint is trustworthy.
- Ensure that any data sent to NVIDIA’s API complies with your organization’s data‑privacy policies.