Price Per TokenPrice Per Token

Browser MCP Server

by Saik0s

0

About

Browser MCP Server wraps browser-use to give AI assistants control over a real web browser for automation tasks including navigation, form interaction, and data extraction. Key features: - Automates browser actions like navigating pages, filling forms, clicking buttons, and extracting structured data - HTTP-based transport designed for long-running operations (30-120+ seconds) that avoids stdio timeout issues common with browser automation - Web UI dashboard for monitoring task execution, viewing logs, and checking server health in real-time - Deep research capabilities for comprehensive information gathering workflows - Skills system for creating reusable browser automation patterns - Requires an LLM API key (e.g., Gemini) to power the browser agent's decision-making - Built on Playwright with Chromium browser support

README

mcp-server-browser-use

MCP server that gives AI assistants the power to control a web browser.

[](LICENSE)

---

Table of Contents

  • What is this?
  • Installation
  • Web UI
  • Web Dashboard
  • Configuration
  • CLI Reference
  • MCP Tools
  • Deep Research
  • Observability
  • Skills System
  • REST API Reference
  • Architecture
  • License
  • ---

    What is this?

    This wraps browser-use as an MCP server, letting Claude (or any MCP client) automate a real browser—navigate pages, fill forms, click buttons, extract data, and more.

    Why HTTP instead of stdio?

    Browser automation tasks take 30-120+ seconds. The standard MCP stdio transport has timeout issues with long-running operations—connections drop mid-task. HTTP transport solves this by running as a persistent daemon that handles requests reliably regardless of duration.

    ---

    Installation

    Claude Code Plugin (Recommended)

    Install as a Claude Code plugin for automatic setup:

    # Install the plugin
    /plugin install browser-use/mcp-browser-use
    

    The plugin automatically:

  • Installs Playwright browsers on first run
  • Starts the HTTP daemon when Claude Code starts
  • Registers the MCP server with Claude
  • Set your API key (the browser agent needs an LLM to decide actions):

    # Set API key (environment variable - recommended)
    export GEMINI_API_KEY=your-key-here

    Or use config file

    mcp-server-browser-use config set -k llm.api_key -v your-key-here

    That's it! Claude can now use browser automation tools.

    Manual Installation

    For other MCP clients or standalone use:

    # Clone and install
    git clone https://github.com/Saik0s/mcp-browser-use.git
    cd mcp-server-browser-use
    uv sync

    Install browser

    uv run playwright install chromium

    Start the server

    uv run mcp-server-browser-use server

    Add to Claude Desktop (~/Library/Application Support/Claude/claude_desktop_config.json):

    {
      "mcpServers": {
        "browser-use": {
          "type": "streamable-http",
          "url": "http://localhost:8383/mcp"
        }
      }
    }
    

    For MCP clients that don't support HTTP transport, use mcp-remote as a proxy:

    {
      "mcpServers": {
        "browser-use": {
          "command": "npx",
          "args": ["mcp-remote", "http://localhost:8383/mcp"]
        }
      }
    }
    

    ---

    Web UI

    Access the task viewer at http://localhost:8383 when the daemon is running.

    Features:

  • Real-time task list with status and progress
  • Task details with execution logs
  • Server health status and uptime
  • Running tasks monitoring
  • The web UI provides visibility into browser automation tasks without requiring CLI commands.

    ---

    Web Dashboard

    Access the full-featured dashboard at http://localhost:8383/dashboard when the daemon is running.

    Features:

  • Tasks Tab: Complete task history with filtering, real-time status updates, and detailed execution logs
  • Skills Tab: Browse, inspect, and manage learned skills with usage statistics
  • History Tab: Historical view of all completed tasks with filtering by status and time
  • Key Capabilities:

  • Run existing skills directly from the dashboard with custom parameters
  • Start learning sessions to capture new skills
  • Delete outdated or invalid skills
  • Monitor running tasks with live progress updates
  • View full task results and error details
  • The dashboard provides a comprehensive web interface for managing all aspects of browser automation without CLI commands.

    ---

    Configuration

    Settings are stored in ~/.config/mcp-server-browser-use/config.json.

    View current config:

    mcp-server-browser-use config view
    

    Change settings:

    mcp-server-browser-use config set -k llm.provider -v openai
    mcp-server-browser-use config set -k llm.model_name -v gpt-4o
    

    Note: Set API keys via environment variables (e.g., ANTHROPIC_API_KEY) for better security

    mcp-server-browser-use config set -k llm.api_key -v sk-...

    mcp-server-browser-use config set -k browser.headless -v false mcp-server-browser-use config set -k agent.max_steps -v 30

    Settings Reference

    | Key | Default | Description | |-----|---------|-------------| | llm.provider | google | LLM provider (anthropic, openai, google, azure_openai, groq, deepseek, cerebras, ollama, bedrock, browser_use, openrouter, vercel) | | llm.model_name | gemini-3-flash-preview | Model for the browser agent | | llm.api_key | - | API key for the provider (prefer env vars: GEMINI_API_KEY, ANTHROPIC_API_KEY, etc.) | | browser.headless | true | Run browser without GUI | | browser.cdp_url | - | Connect to existing Chrome (e.g., http://localhost:9222) | | browser.user_data_dir | - | Chrome profile dire

    Related MCP Servers

    AI Research Assistant

    AI Research Assistant

    hamid-vakilzadeh

    AI Research Assistant provides comprehensive access to millions of academic papers through the Semantic Scholar and arXiv databases. This MCP server enables AI coding assistants to perform intelligent literature searches, citation network analysis, and paper content extraction without requiring an API key. Key features include: - Advanced paper search with multi-filter support by year ranges, citation thresholds, field of study, and publication type - Title matching with confidence scoring for finding specific papers - Batch operations supporting up to 500 papers per request - Citation analysis and network exploration for understanding research relationships - Full-text PDF extraction from arXiv and Wiley open-access content (Wiley TDM token required for institutional access) - Rate limits of 100 requests per 5 minutes with options to request higher limits through Semantic Scholar

    Web & Search
    12 8
    Linkup

    Linkup

    LinkupPlatform

    Linkup is a real-time web search and content extraction service that enables AI assistants to search the web and retrieve information from trusted sources. It provides source-backed answers with citations, making it ideal for fact-checking, news gathering, and research tasks. Key features of Linkup: - Real-time web search using natural language queries to find current information, news, and data - Page fetching to extract and read content from any webpage URL - Search depth modes: Standard for direct-answer queries and Deep for complex research across multiple sources - Source-backed results with citations and context from relevant, trustworthy websites - JavaScript rendering support for accessing dynamic content on JavaScript-heavy pages

    Web & Search
    2 24
    Math-MCP

    Math-MCP

    EthanHenrickson

    Math-MCP is a computation server that enables Large Language Models (LLMs) to perform accurate numerical calculations through the Model Context Protocol. It provides precise mathematical operations via a simple API to overcome LLM limitations in arithmetic and statistical reasoning. Key features of Math-MCP: - Basic arithmetic operations: addition, subtraction, multiplication, division, modulo, and bulk summation - Statistical analysis functions: mean, median, mode, minimum, and maximum calculations - Rounding utilities: floor, ceiling, and nearest integer rounding - Trigonometric functions: sine, cosine, tangent, and their inverses with degrees and radians conversion support

    Developer Tools
    22 81