File size: 2,624 Bytes
b75393b
 
964e5bb
b75393b
 
 
 
 
 
 
 
964e5bb
 
 
 
 
 
b75393b
 
964e5bb
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
b8db62e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
---
title: MCPyLate
emoji: πŸ•
colorFrom: indigo
colorTo: green
sdk: gradio
sdk_version: 5.33.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: MCP server to perform search using PyLate
language:
- en
tags:
- ColBERT
- PyLate
- mcp-server-track
---

# MCPyLate

Multi-vector search has shown very strong performance compared to single dense vector search in numerous domain, including out-of-domain, [long-context](https://x.com/antoine_chaffin/status/1919396926736257521) and [reasoning-intensive](https://x.com/antoine_chaffin/status/1925555110521798925) retrieval.

They are thus particularly well suited for modern retrieval use cases, including agentic workflows.
[PyLate](https://lightonai.github.io/pylate/) is library built on top of sentence-transformers that allows to easily train and use multi-vector models.

This MCP server is a demonstration of the use of PyLate models alongside its index optimized for multi-vector search, PLAID.

This repository propose to search among the [leetcode split of the BRIGHT dataset](https://huggingface.co/datasets/xlangai/BRIGHT/viewer/documents/leetcode) using [Reason-ModernColBERT](https://huggingface.co/lightonai/Reason-ModernColBERT). This 150M parameters model outperforms 7B models on this reasoning-intensive retrieval benchmark, which requires reasoning to find relevant documents.
Although it can perform reasoning on its own, [it benefits from LLM reformulation](https://x.com/antoine_chaffin/status/1930625989643587598) and is thus particularly suited as a MCP to collaborate with an LLM.
Effectively, this demo allows to build a tool that can solve any leetcode problem by searching for problems that require similar algorithms and feed it to the LLM.

You can see it working in this example, where the agent find relevant leetcode solution, propose solutions with different complexity level as well as associated tests to validate them and even vizualisation tool.

https://youtu.be/Vvp2bPSfN68

The tool can easily be tweaked to index and search any dataset. For example, by indexing MS MARCO with [GTE-ModernColBERT](https://huggingface.co/lightonai/GTE-ModernColBERT-v1), this results in a Deep Research tool that can search among the web corpus and create reports based on successive queries.

https://youtu.be/WJx0XgYIlvA



If you want to use this repository as a remote MCP server, you can use this config:
```
{
    "mcpServers": {
      "MCPyLate": {
        "command": "npx",
        "args": [
          "mcp-remote",
          "https://agents-mcp-hackathon-mcpylate.hf.space/gradio_api/mcp/sse"
        ]
      }
    }
  }
```