Price Per TokenPrice Per Token

LinkedIn Profile Data Mining Server

by amankale376

GitHub 3 9 uses Remote
0

About

LinkedIn Profile Data Mining Server enables automated discovery, extraction, and enrichment of LinkedIn professional profiles. It combines web scraping with AI-powered search and contact data enrichment to build comprehensive professional databases. Key features of LinkedIn Profile Data Mining Server: - Google Custom Search API integration with AI-powered query expansion using GPT-4o mini for discovering relevant profiles globally - Direct LinkedIn scraping with Nubela Proxycurl API fallback for extracting structured profile data including names, companies, job titles, descriptions, and follower counts - Apollo.io integration for contact enrichment with validated email addresses, phone numbers, and detailed company information - Support for multiple LLMs including OpenAI, Gemini, OpenRouter, and Ollama for profile summarization, relevance scoring, and query optimization - Persistent SQLite storage with automatic duplicate prevention and CSV export capabilities for CRM integration

README

LinkedIn Profile Data Mining MCP Server

A comprehensive Model Context Protocol (MCP) server for LinkedIn profile data mining, search, and contact information enrichment. This server integrates all the powerful features from the original data mining tool into an MCP-compatible interface.

Features

🔍 Advanced Search Capabilities

  • Google Search Integration: Uses Google Custom Search API for LinkedIn profile discovery
  • AI-Powered Query Expansion: Generates additional search queries using OpenAI GPT-4o mini
  • Smart Filtering: AI-based relevance filtering to ensure high-quality results
  • Location-Based Search: Supports global location targeting for comprehensive coverage
  • 📊 Profile Data Extraction

  • Direct LinkedIn Scraping: Extracts profile data directly from LinkedIn pages
  • Nubela Proxycurl Fallback: Uses Nubela API when direct scraping fails
  • Structured Data Parsing: Extracts JSON-LD structured data from LinkedIn profiles
  • Comprehensive Profile Fields: Name, company, job title, description, followers, etc.
  • 📞 Contact Information Enrichment

  • Apollo.io Integration: Enriches profiles with email addresses and phone numbers
  • Company Information: Retrieves detailed company descriptions and contact details
  • Professional Validation: Ensures contact information accuracy through API validation
  • 🤖 AI-Powered Features

  • Profile Summarization: Generates concise professional summaries using AI
  • Relevance Scoring: AI-based filtering to match search intent
  • Query Optimization: Intelligent search query generation and expansion
  • Multiple LLM Support: OpenAI, Gemini, OpenRouter, and Ollama compatibility
  • 💾 Data Management

  • SQLite Database: Persistent storage for all extracted profiles
  • CSV Export: Easy data export for analysis and CRM integration
  • Duplicate Prevention: Automatic detection and prevention of duplicate profiles
  • Data Validation: Ensures data quality and completeness
  • Installation

    1. Clone or navigate to the server directory:

       cd smithery-servers/profile-searcher
       

    2. Install dependencies:

       npm install
       

    3. Configure API keys:

       # Copy the example configuration file
       cp .env.example .env
       
       # Edit .env with your API keys
       nano .env  # or use your preferred editor
       

    4. Start the development server:

       npm run dev
       

    🔑 API Keys Configuration

    📋 See CONFIGURATION.md for detailed setup instructions

    Quick Setup:

    1. Required: Apollo.io API key → Get from apollo.io/settings/integrations 2. Required: OpenAI API key → Get from platform.openai.com/api-keys 3. Optional: Nubela API key → Get from nubela.co/proxycurl

    Environment Variables (.env file):

    APOLLO_API_KEY=your_apollo_api_key_here
    OPENAI_API_KEY=sk-your_openai_api_key_here
    NUBELA_API_KEY=your_nubela_api_key_here
    DEBUG=false
    

    For Claude Desktop (claude_desktop_config.json):

    {
      "mcpServers": {
        "profile-searcher": {
          "command": "node",
          "args": ["/path/to/smithery-servers/profile-searcher/dist/index.js"],
          "env": {
            "APOLLO_API_KEY": "your_apollo_api_key_here",
            "OPENAI_API_KEY": "sk-your_openai_api_key_here"
          }
        }
      }
    }
    

    Available Tools

    1. search_linkedin_profiles

    Search for LinkedIn profiles based on keywords.

    Parameters:

  • keywords (string): Search keywords (e.g., "AI podcast host")
  • num_results (number): Number of results to return (default: 20)
  • Example:

    {
      "keywords": "AI podcast host",
      "num_results": 10
    }
    

    2. extract_profile_data

    Extract detailed profile data from LinkedIn URLs.

    Parameters:

  • urls (array): Array of LinkedIn profile URLs
  • include_contact_info (boolean): Whether to include contact info (default: true)
  • Example:

    {
      "urls": [
        "https://www.linkedin.com/in/example-profile",
        "https://www.linkedin.com/in/another-profile"
      ],
      "include_contact_info": true
    }
    

    3. mine_linkedin_data

    Comprehensive data mining: search, extract, and enrich profile data.

    Parameters:

  • keywords (string): Keywords to search for
  • num_results (number): Number of profiles to process (default: 20)
  • export_csv (boolean): Whether to export to CSV (default: true)
  • csv_filename (string, optional): Custom CSV filename
  • Example:

    {
      "keywords": "blockchain developer",
      "num_results": 25,
      "export_csv": true,
      "csv_filename": "blockchain_developers.csv"
    }
    

    4. get_contact_info

    Get contact information for a specific person using Apollo API.

    Parameters:

  • person_name (string): Full name of the person
  • company_name (string): Company where the person works
  • Example: ```json { "person_name": "

    Related MCP Servers

    AI Research Assistant

    AI Research Assistant

    hamid-vakilzadeh

    AI Research Assistant provides comprehensive access to millions of academic papers through the Semantic Scholar and arXiv databases. This MCP server enables AI coding assistants to perform intelligent literature searches, citation network analysis, and paper content extraction without requiring an API key. Key features include: - Advanced paper search with multi-filter support by year ranges, citation thresholds, field of study, and publication type - Title matching with confidence scoring for finding specific papers - Batch operations supporting up to 500 papers per request - Citation analysis and network exploration for understanding research relationships - Full-text PDF extraction from arXiv and Wiley open-access content (Wiley TDM token required for institutional access) - Rate limits of 100 requests per 5 minutes with options to request higher limits through Semantic Scholar

    Web & Search
    12 8
    Linkup

    Linkup

    LinkupPlatform

    Linkup is a real-time web search and content extraction service that enables AI assistants to search the web and retrieve information from trusted sources. It provides source-backed answers with citations, making it ideal for fact-checking, news gathering, and research tasks. Key features of Linkup: - Real-time web search using natural language queries to find current information, news, and data - Page fetching to extract and read content from any webpage URL - Search depth modes: Standard for direct-answer queries and Deep for complex research across multiple sources - Source-backed results with citations and context from relevant, trustworthy websites - JavaScript rendering support for accessing dynamic content on JavaScript-heavy pages

    Web & Search
    2 24
    Math-MCP

    Math-MCP

    EthanHenrickson

    Math-MCP is a computation server that enables Large Language Models (LLMs) to perform accurate numerical calculations through the Model Context Protocol. It provides precise mathematical operations via a simple API to overcome LLM limitations in arithmetic and statistical reasoning. Key features of Math-MCP: - Basic arithmetic operations: addition, subtraction, multiplication, division, modulo, and bulk summation - Statistical analysis functions: mean, median, mode, minimum, and maximum calculations - Rounding utilities: floor, ceiling, and nearest integer rounding - Trigonometric functions: sine, cosine, tangent, and their inverses with degrees and radians conversion support

    Developer Tools
    22 81