Price Per TokenPrice Per Token
Image Analysis Server

Image Analysis Server

by rentapad

0

About

Image Analysis Server is an MCP server that analyzes visual content using Google's Gemini 2.0 Flash model to generate detailed descriptions and insights from images and videos. Key features of Image Analysis Server: - Analyze images and videos via URLs or local file paths with high-precision AI recognition - Direct YouTube video URL analysis for content comprehension without downloading - Multi-modal analysis comparing relationships between multiple images or videos in a single request - Customizable prompts to guide analysis focus and extract specific information - Local file support with Base64 encoding and built-in security path validation - Handles various image and video MIME types for broad format compatibility

README

image-mcp-server-gemini

[](https://smithery.ai/server/@Rentapad/image-mcp-server-gemini) An MCP server that receives image/video URLs or local file paths and analyzes their content using the Gemini 2.0 Flash model.(forked from github.com/champierre/image-mcp-server)

Features

  • Analyzes content from one or more image/video URLs or local file paths.
  • Analyzes videos directly from YouTube URLs.
  • Can analyze relationships between multiple images or videos provided together.
  • Supports optional text prompts to guide the analysis.
  • High-precision recognition and description using the Gemini 2.0 Flash model.
  • URL validity checking and local file loading with Base64 encoding.
  • Basic security checks for local file paths.
  • Handles various image and video MIME types (see Usage section for details).
  • Installation

    Installing via Smithery

    To install Image Analysis Server for Claude Desktop automatically via Smithery:

    npx -y @smithery/cli install @Rentapad/image-mcp-server --client claude
    

    Manual Installation

    # Clone the repository
    git clone https://github.com/Rentapad/image-mcp-server-gemini.git 
    cd image-mcp-server-gemini

    Install dependencies

    npm install

    Compile TypeScript

    npm run build

    Configuration

    To use this server, you need a Gemini API key. Set the following environment variable:

    GEMINI_API_KEY=your_gemini_api_key
    

    MCP Server Configuration

    To use with tools like Cline, add the following settings to your MCP server configuration file:

    For Cline

    Add the following to cline_mcp_settings.json:

    {
      "mcpServers": {
        "image-video-analysis": { // Consider renaming for clarity
          "command": "node",
          "args": ["/path/to/image-mcp-server/dist/index.js"],
          "env": {
            "GEMINI_API_KEY": "your_gemini_api_key"
          }
        }
      }
    }
    

    For Claude Desktop App

    Add the following to claude_desktop_config.json:

    {
      "mcpServers": {
        "image-video-analysis": { // Consider renaming for clarity
          "command": "node",
          "args": ["/path/to/image-mcp-server/dist/index.js"],
          "env": {
            "GEMINI_API_KEY": "your_gemini_api_key"
          }
        }
      }
    }
    

    Usage

    Once the MCP server is configured, the following tools become available:

  • analyze_image: Receives one or more image URLs and analyzes their content.
  • - Arguments: imageUrls (array of strings, required), prompt (string, optional).
  • analyze_image_from_path: Receives one or more local image file paths and analyzes their content.
  • - Arguments: imagePaths (array of strings, required), prompt (string, optional).
  • analyze_video: Receives one or more video URLs and analyzes their content. Best for smaller videos (see Video Notes).
  • - Arguments: videoUrls (array of strings, required), prompt (string, optional).
  • analyze_video_from_path: Receives one or more local video file paths and analyzes their content. Best for smaller videos (see Video Notes).
  • - Arguments: videoPaths (array of strings, required), prompt (string, optional).
  • analyze_youtube_video: Receives a single YouTube video URL and analyzes its content.
  • - Arguments: youtubeUrl (string, required), prompt (string, optional).

    Usage Examples

    Analyzing a single image from URL:

    Please analyze this image: https://example.com/image.jpg
    

    Analyzing multiple images from local paths and comparing them:

    Analyze these images: /path/to/your/image1.png, /path/to/your/image2.jpeg. Which one contains a cat?
    
    *(The client would call analyze_image_from_path with imagePaths: ["/path/to/your/image1.png", "/path/to/your/image2.jpeg"] and prompt: "Which one contains a cat?")*

    Analyzing a video from URL with a specific prompt:

    Summarize the content of this video: https://example.com/video.mp4
    
    *(The client would call analyze_video with videoUrls: ["https://example.com/video.mp4"] and prompt: "Summarize the content of this video")*

    Analyzing a YouTube video:

    What is the main topic of this YouTube video? https://www.youtube.com/watch?v=dQw4w9WgXcQ
    
    *(The client would call analyze_youtube_video with youtubeUrl: "https://www.youtube.com/watch?v=dQw4w9WgXcQ" and prompt: "What is the main topic of this YouTube video?")*

    Video Notes

  • Size Limit: For videos provided via URL (analyze_video) or path (analyze_video_from_path), Gemini currently has limitations on the size of video data that can be processed directly (typically around 20MB after Base64 encoding). Larger videos may fail. YouTube analysis does not have this same client-side download limit.
  • Supported MIME Types: The server attempts to map and use MIME types supported by Gemini for video. Officially supported types include: video/mp4, video/mpeg, video/mov, video/avi, video/x-flv
  • Related MCP Servers

    AI Research Assistant

    AI Research Assistant

    hamid-vakilzadeh

    AI Research Assistant provides comprehensive access to millions of academic papers through the Semantic Scholar and arXiv databases. This MCP server enables AI coding assistants to perform intelligent literature searches, citation network analysis, and paper content extraction without requiring an API key. Key features include: - Advanced paper search with multi-filter support by year ranges, citation thresholds, field of study, and publication type - Title matching with confidence scoring for finding specific papers - Batch operations supporting up to 500 papers per request - Citation analysis and network exploration for understanding research relationships - Full-text PDF extraction from arXiv and Wiley open-access content (Wiley TDM token required for institutional access) - Rate limits of 100 requests per 5 minutes with options to request higher limits through Semantic Scholar

    Web & Search
    12 8
    Linkup

    Linkup

    LinkupPlatform

    Linkup is a real-time web search and content extraction service that enables AI assistants to search the web and retrieve information from trusted sources. It provides source-backed answers with citations, making it ideal for fact-checking, news gathering, and research tasks. Key features of Linkup: - Real-time web search using natural language queries to find current information, news, and data - Page fetching to extract and read content from any webpage URL - Search depth modes: Standard for direct-answer queries and Deep for complex research across multiple sources - Source-backed results with citations and context from relevant, trustworthy websites - JavaScript rendering support for accessing dynamic content on JavaScript-heavy pages

    Web & Search
    2 24
    Math-MCP

    Math-MCP

    EthanHenrickson

    Math-MCP is a computation server that enables Large Language Models (LLMs) to perform accurate numerical calculations through the Model Context Protocol. It provides precise mathematical operations via a simple API to overcome LLM limitations in arithmetic and statistical reasoning. Key features of Math-MCP: - Basic arithmetic operations: addition, subtraction, multiplication, division, modulo, and bulk summation - Statistical analysis functions: mean, median, mode, minimum, and maximum calculations - Rounding utilities: floor, ceiling, and nearest integer rounding - Trigonometric functions: sine, cosine, tangent, and their inverses with degrees and radians conversion support

    Developer Tools
    22 81