
Extract
Turn arbitrary URLs into clean Markdown with metadata and caching for RAG, briefs, and agent research loops.
Overview
Averra Extract is a MCP server for the Idea phase that fetches URLs and returns cached Markdown with metadata for AI research and pipeline ingestion.
What is this MCP server?
- URL-to-Markdown conversion tuned for AI pipelines
- Response metadata plus caching to avoid repeat fetches
- npm package @averra/extract-mcp with stdio MCP transport
- Version 0.1.5 on the official MCP registry schema
- Registry and npm version 0.1.5
- stdio transport via @averra/extract-mcp
- GitHub source at Swwyymm/averra-extract-mcp
What problem does it solve?
Raw HTML and ad-heavy pages clog agent context, so builders waste tokens cleaning sites by hand before every research or RAG task.
Who is it for?
Indie builders building research agents, newsletters, or RAG prototypes who want one MCP tool instead of bespoke fetch-and-parse scripts.
Skip if: Workflows that need full browser automation, logins, or heavy JavaScript rendering without a separate browser stack.
What do I get? / Deliverables
Agents receive consistent Markdown and metadata from URLs, with caching so repeated research passes stay fast and cheap.
- Markdown representation of a requested URL
- Structured metadata alongside extracted content
- Cache-backed repeats for the same URL in agent sessions
Recommended MCP Servers
Journey fit
URL extraction is shelved under idea because builders first pull source material from the web while exploring problems and markets. Research is the right subphase for fetching and normalizing external pages before validation artifacts or code exist.
How it compares
MCP URL ingestion server, not a headless browser skill or site-wide crawler product.
Common Questions / FAQ
Who is Averra Extract for?
Averra Extract is for solo builders and small teams who use AI agents to research the web and need Markdown-ready page content with metadata and caching.
When should I use Averra Extract?
Use it during idea research, when ingesting docs into a build pipeline, or when refreshing source pages for content and growth workflows.
How do I add Averra Extract to my agent?
Install npm package @averra/extract-mcp (registry version 0.1.5), configure stdio MCP transport in your client, and point tools at the URLs you want converted.