Rerank Example¶

Document reranking example demonstrating various features of Lexilux Rerank API.

Lexilux supports two rerank modes:

OpenAI-compatible mode (mode="openai"): Standard rerank API format (default)
DashScope mode (mode="dashscope"): Alibaba Cloud DashScope rerank API

For a detailed comparison of both modes, see Rerank Modes Comparison.

Full Example¶

#!/usr/bin/env python
"""
21 Document Reranking - Sort by Relevance

Learn how to rerank documents based on their relevance to a query.
This is useful for search results and recommendation systems.

Level: Other APIs
"""

from config_loader import get_chat_config, parse_args

from lexilux import Rerank


def main():
    """Demonstrate document reranking."""
    args = parse_args()

    try:
        config = get_chat_config(config_path=args.config)
        print(
            "Note: Using chat config for rerank. "
            "Configure 'reranker' section for better results."
        )
    except (FileNotFoundError, KeyError) as e:
        print(f"Configuration error: {e}")
        print("\nUsing placeholder values. Please configure test_endpoints.json")
        config = {
            "base_url": "https://api.example.com/v1",
            "api_key": "your-api-key",
            "model": "rerank-model",
        }

    rerank = Rerank(**config)

    # Example 1: Basic reranking
    print("=" * 50)
    print("Example 1: Basic Reranking")
    print("=" * 50)

    query = "python programming tutorial"
    documents = [
        "Learn Python from scratch",
        "Python for data science",
        "Introduction to machine learning",
        "Python web development with Flask",
        "JavaScript basics for beginners",
    ]

    result = rerank(query, documents)

    print(f"Query: {query}\n")
    print("Reranked results:")
    for idx, score in result.results:
        print(f"  {idx}. [{score:.2f}] {documents[idx]}")
    print()

    # Example 2: Top-k filtering
    print("=" * 50)
    print("Example 2: Top-K Filtering (get top 3)")
    print("=" * 50)

    result = rerank(query, documents, top_k=3)

    print(f"Query: {query}\n")
    print("Top 3 results:")
    for idx, score in result.results:
        print(f"  {idx}. [{score:.2f}] {documents[idx]}")
    print()

    # Example 3: Include documents in results
    print("=" * 50)
    print("Example 3: Include Documents in Results")
    print("=" * 50)

    result = rerank(query, documents, top_k=3, include_docs=True)

    print(f"Query: {query}\n")
    print("Results with documents:")
    for idx, score, doc in result.results:
        print(f"  [{score:.2f}] {doc}")
    print()

    # Example 4: Search use case
    print("=" * 50)
    print("Example 4: Search Result Reranking")
    print("=" * 50)

    search_query = "how to install python"
    search_results = [
        "Download Python from python.org",
        "Python installation guide for Windows",
        "Best Python IDEs for development",
        "Install Python using package managers",
        "Python vs JavaScript comparison",
    ]

    result = rerank(search_query, search_results, include_docs=True)

    print(f"User searched for: {search_query}\n")
    print("Original order vs Reranked order:\n")

    print("Original:")
    for i, doc in enumerate(search_results):
        print(f"  {i + 1}. {doc}")

    print("\nReranked (most relevant first):")
    for rank, (idx, score, doc) in enumerate(result.results, 1):
        print(f"  {rank}. [{score:.2f}] {doc}")


if __name__ == "__main__":
    main()

Mode Selection¶

You can specify the mode when initializing the Rerank client:

# OpenAI-compatible mode
rerank = Rerank(
    base_url="https://api.example.com/v1",
    api_key="your-api-key",
    model="rerank-model",
    mode="openai"  # Use OpenAI-compatible format
)

# DashScope mode
rerank = Rerank(
    base_url="https://dashscope.aliyuncs.com/api/v1/services/rerank/text-rerank/text-rerank",
    api_key="your-api-key",
    model="qwen3-rerank",
    mode="dashscope"  # Use DashScope format
)

You can also override the mode for individual calls:

# Use OpenAI mode for this call only
result = rerank("query", docs, mode="openai")

Score Sorting Rules¶

Lexilux automatically handles different score formats returned by rerank APIs:

Positive Scores (e.g., 0.95, 0.80, 0.70):

Higher score = Better relevance
Sorted in descending order: 0.95 > 0.80 > 0.70

Negative Scores (e.g., -3.0, -4.0, -5.0):

Less negative = Better relevance
Sorted in descending order: -3.0 > -4.0 > -5.0
Note: -3.0 is mathematically greater than -4.0

The library automatically detects which format is used and applies the correct sorting order.

OpenAI-Compatible Mode¶

When using mode="openai", Lexilux uses the standard OpenAI-compatible rerank API format:

Request Format:

Endpoint: POST /rerank

Payload:

{
  "model": "rerank-model",
  "query": "search query",
  "documents": ["doc1", "doc2", "doc3"],
  "top_n": 3,
  "return_documents": true
}

Response Format:

Expected response:

{
  "results": [
    {
      "index": 0,
      "relevance_score": 0.95,
      "document": {"text": "doc1"}
    },
    {
      "index": 1,
      "relevance_score": 0.80,
      "document": {"text": "doc2"}
    }
  ],
  "usage": {"total_tokens": 100}
}

Key Differences: - Uses top_n instead of top_k - Uses return_documents instead of include_docs - Uses relevance_score instead of score - Document is wrapped in {"text": "..."} object

DashScope Mode¶

When using mode="dashscope", Lexilux uses the Alibaba Cloud DashScope rerank API format:

Request Format:

Endpoint: POST /text-rerank/text-rerank

Payload:

{
  "model": "qwen3-rerank",
  "input": {
    "query": "search query",
    "documents": ["doc1", "doc2", "doc3"]
  },
  "parameters": {
    "top_n": 3,
    "return_documents": true
  }
}

Response Format:

Expected response:

{
  "output": {
    "results": [
      {
        "index": 0,
        "relevance_score": 0.95,
        "document": {"text": "doc1"}
      }
    ]
  },
  "usage": {"total_tokens": 100}
}

Key Features: - Query and documents wrapped in input object - Additional parameters in parameters object - Results wrapped in output.results

Response Formats¶

Lexilux supports multiple response formats from rerank APIs:

Dictionary format with results:

{
  "results": [
    {"index": 0, "score": 0.95},
    {"index": 1, "score": 0.80}
  ]
}

Dictionary format with data:

{
  "data": [
    {"index": 0, "score": 0.95},
    {"index": 1, "score": 0.80}
  ]
}

Direct list format with document text:
```
[
  ["doc1", 0.95],
  ["doc2", 0.80]
]
```
Direct list format with index:
```
[
  [0, 0.95],
  [1, 0.80]
]
```

The library automatically detects and parses all these formats.