MCP Protocol Reference

CiteKit implements the Model Context Protocol (MCP) to allow AI agents to browse, understand, and extract multimodal evidence directly.

Tools

The CiteKit MCP server exposes the following tools to the agent:

Returns a list of all resource IDs currently available in the local CiteKit index.

Retrieves the full semantic map (JSON) of a specific resource.

Input: {"resource_id": "string"}
Output: The full ResourceMap schema.
Agent Usage: Agents call this first to "see" what is inside a file without downloading it.

Fetches metadata for a specific node ID.

The "Resolution" tool. Converts a node into evidence.

Input:
- resource_id: (Required)
- node_id: (Required)
- virtual: (Optional, default: False)
Output: ResolvedEvidence object (contains absolute path to clip/slice).
Agent Usage: The final step. The agent takes the output_path and attaches it to the current chat context as grounded evidence.

Discovery: User asks "What happens at 5 minutes in the video?".
Mapping: Agent looks at listResources.
Selection: Agent calls getStructure(lecture_vid).
Pinpointing: Agent finds a node with id: "recursion_demo" and location: {start: 300, end: 360}.
Resolution: Agent calls resolve(lecture_vid, "recursion_demo").
Grounding: CiteKit extracts the 60s clip, returns the path, and the agent "sees" the video.

Add this to your claude_desktop_config.json:

json

{
  "mcpServers": {
    "citekit": {
      "command": "python",
      "args": ["-m", "citekit.cli", "serve"]
    }
  }
}

See the MCP Integration Guide for specific IDE setup.