
Parseflow
Let Claude Code extract text, metadata, images, TOC, and search inside PDFs without manual copy-paste.
Overview
ParseFlow is an MCP server for the Build phase that extracts text, metadata, images, TOC, and search results from PDFs for your coding agent.
What is this MCP server?
- Full-text extraction and in-document search over PDFs via MCP tools
- Metadata and table-of-contents access for structured navigation
- Image extraction from pages for multimodal or asset workflows
- stdio npm package parseflow-mcp-server (v1.0.2) for local agent attach
- MCP integration for agents, not a standalone PDF desktop app
- Server version 1.0.2 on npm identifier parseflow-mcp-server
- stdio transport via Model Context Protocol
- Capabilities: text extraction, metadata, search, images, and TOC
Community signal: 2 GitHub stars.
What problem does it solve?
Agents cannot reliably read PDF attachments unless you build a custom parser or break documents apart by hand.
Who is it for?
Indie builders adding document-aware agents, RAG ingest, or internal tools that must consume PDF specs and reports.
Skip if: Teams that need enterprise OCR, redaction workflows, or batch conversion outside an agent IDE session.
What do I get? / Deliverables
Your agent can query PDF contents, structure, and images in-thread so features and summaries stay grounded in the actual file.
- Extracted PDF text and metadata usable in agent prompts
- TOC and image payloads for downstream chunking or UI
- In-PDF search results without manual page hunting
Recommended MCP Servers
Journey fit
PDF ingestion is wired into the product and agent stack during build, not a launch or growth tactic. ParseFlow is an MCP bridge for document pipelines—contracts, specs, and research PDFs the agent can query programmatically.
How it compares
MCP PDF toolkit, not a hosted document SaaS or a general web-scraping skill.
Common Questions / FAQ
Who is ParseFlow for?
Solo developers and agent builders who want PDF text, metadata, TOC, images, and search exposed as MCP tools inside Claude Code, Cursor, or similar hosts.
When should I use ParseFlow?
Use it during build when your product or agent workflow must read, search, or chunk PDFs programmatically instead of manual export.
How do I add ParseFlow to my agent?
Install the npm package parseflow-mcp-server, register it as a stdio MCP server in your client config, then call its tools from the agent session.