Introduction
PDF Reader MCP is a Model Context Protocol (MCP) server that enables AI agents to read and extract content from PDF files.
What It Does
AI agents often need to access information from PDF documents - reports, invoices, research papers, manuals, and more. This server provides tools to extract:
- Full text content - Get all text from a PDF
- Page-specific text - Extract text from specific pages or page ranges
- Metadata - Author, title, creation date, and other document properties
- Page count - Total number of pages
- Embedded images - Extract images as base64-encoded PNG data
Key Features
Multiple Sources
Process PDFs from local files or URLs in a single request. Mix and match sources as needed.
Batch Processing
Send multiple PDF sources in one request. The server processes them concurrently for optimal performance.
Flexible Extraction
Choose exactly what data you need - full text, specific pages, metadata only, or everything including images.
Image Extraction
Extract embedded images from PDFs for AI vision analysis. Images are returned as base64-encoded PNG data.
Supported Clients
- Claude Desktop - Add to your
claude_desktop_config.json - Claude Code - Use
claude mcp addcommand - Cursor - Configure in MCP settings
- Any MCP Client - Standard MCP protocol over stdio