About
ScrapeGraph AI is an AI-powered web scraping and data extraction platform that enables intelligent extraction of structured data from websites using natural language prompts. Key capabilities of ScrapeGraph AI: - **AI-Powered Web Scraping**: Extract specific data from web pages using natural language prompts, with intelligent parsing of HTML into structured JSON or custom schemas. - **Markdown Conversion**: Transform entire web pages into clean, structured markdown format for easy consumption. - **Multi-Page Crawling**: SmartCrawler asynchronously crawls websites with configurable depth and page limits to extract data across entire site structures. - **Web Search Integration**: Execute AI-powered web searches and retrieve structured results from across the internet. - **Browser Automation**: Full JavaScript rendering support and infinite scroll handling for dynamic, JavaScript-heavy websites. - **Enterprise Reliability**: Production-grade error handling, timeout management, and async processing for robust performance at scale. The service integrates seamlessly with Claude Desktop, Cursor, and any MCP-compatible client, requiring only an API key from the ScrapeGraph dashboard to get started.
README
ScrapeGraph MCP Server
[](https://opensource.org/licenses/MIT) [](https://www.python.org/downloads/) [](https://smithery.ai/server/@ScrapeGraphAI/scrapegraph-mcp)
A production-ready Model Context Protocol (MCP) server that provides seamless integration with the ScrapeGraph AI API. This server enables language models to leverage advanced AI-powered web scraping capabilities with enterprise-grade reliability.
[](https://scrapegraphai.com/?utm_source=github&utm_medium=readme&utm_campaign=api_banner&utm_content=api_banner_image)
Table of Contents
Key Features
.agent/ folderQuick Start
1. Get Your API Key
Sign up and get your API key from the ScrapeGraph Dashboard
2. Install with Smithery (Recommended)
npx -y @smithery/cli install @ScrapeGraphAI/scrapegraph-mcp --client claude
3. Start Using
Ask Claude or Cursor:
That's it! The server is now available to your AI assistant.
Available Tools
The server provides 8 enterprise-ready tools for AI-powered web scraping:
Core Scraping Tools
#### 1. markdownify
Transform any webpage into clean, structured markdown format.
markdownify(website_url: str)
#### 2. smartscraper
Leverage AI to extract structured data from any webpage with support for infinite scrolling.
smartscraper(
user_prompt: str,
website_url: str,
number_of_scrolls: int = None,
markdown_only: bool = None
)
#### 3. searchscraper
Execute AI-powered web searches with structured, actionable results.
searchscraper(
user_prompt: str,
num_results: int = None,
number_of_scrolls: int = None,
time_range: str = None # Filter by: past_hour, past_24_hours, past_week, past_month, past_year
)
time_range to filter results by recency (e.g., "past_week" for recent results)Advanced Scraping Tools
#### 4. scrape
Basic scraping endpoint to fetch page content with optional heavy JavaScript rendering.
scrape(website_url: str, render_heavy_js: bool = None)
#### 5. sitemap
Extract sitemap URLs and structure for any website.
sitemap(website_url: str)
Multi-Page Crawling
#### 6. smartcrawler_initiate
Initiate intelligent multi-page web crawling (asynchronous operation).
```python smartcrawler_initiate( url: str, prompt: str = None, extraction_mode: str = "ai", depth: int = None, max_pages: int = None, same_domain_
Related MCP Servers
AI Research Assistant
hamid-vakilzadeh
AI Research Assistant provides comprehensive access to millions of academic papers through the Semantic Scholar and arXiv databases. This MCP server enables AI coding assistants to perform intelligent literature searches, citation network analysis, and paper content extraction without requiring an API key. Key features include: - Advanced paper search with multi-filter support by year ranges, citation thresholds, field of study, and publication type - Title matching with confidence scoring for finding specific papers - Batch operations supporting up to 500 papers per request - Citation analysis and network exploration for understanding research relationships - Full-text PDF extraction from arXiv and Wiley open-access content (Wiley TDM token required for institutional access) - Rate limits of 100 requests per 5 minutes with options to request higher limits through Semantic Scholar
Linkup
LinkupPlatform
Linkup is a real-time web search and content extraction service that enables AI assistants to search the web and retrieve information from trusted sources. It provides source-backed answers with citations, making it ideal for fact-checking, news gathering, and research tasks. Key features of Linkup: - Real-time web search using natural language queries to find current information, news, and data - Page fetching to extract and read content from any webpage URL - Search depth modes: Standard for direct-answer queries and Deep for complex research across multiple sources - Source-backed results with citations and context from relevant, trustworthy websites - JavaScript rendering support for accessing dynamic content on JavaScript-heavy pages
Math-MCP
EthanHenrickson
Math-MCP is a computation server that enables Large Language Models (LLMs) to perform accurate numerical calculations through the Model Context Protocol. It provides precise mathematical operations via a simple API to overcome LLM limitations in arithmetic and statistical reasoning. Key features of Math-MCP: - Basic arithmetic operations: addition, subtraction, multiplication, division, modulo, and bulk summation - Statistical analysis functions: mean, median, mode, minimum, and maximum calculations - Rounding utilities: floor, ceiling, and nearest integer rounding - Trigonometric functions: sine, cosine, tangent, and their inverses with degrees and radians conversion support