
In Progress
Posted
Paid on delivery
I need a self-contained Python workflow that can read roughly 20,000 company names from an Excel sheet, run each name through Google, locate the official website or contact page, and then push that URL to ScrapeGraphAI’s Extract endpoint. Using the prompt, JSON schema and API key I will supply, the script should capture emails and mobile numbers and write them back to a new Excel file. You are free to build the browser layer with Selenium, Playwright, or any similar headless solution—whatever delivers the best stability. A straightforward XLSX output with sensible columns (company, website, email, phone) is perfectly fine for me. I am unsure how often Google will trigger CAPTCHA at this scale, so please include your recommended mitigation strategy (delays, rotating proxies, captcha-solver service, etc.) in your proposal. Once the run is complete I expect: • The final Excel/CSV containing all successfully extracted data • Well-commented Python source code with [login to view URL] • A short README explaining how to rerun the workflow • A quick summary of success rate and any rows that could not be processed Please outline your estimated timeline and highlight any previous projects where you automated Google searches, large-scale scraping, or API-driven data extraction. I will review the approaches and pick the one that offers the most reliable, maintainable solution.
Project ID: 40437507
20 proposals
Remote project
Active 4 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs

Hi, I can build this workflow in Python with a focus on stability, scalability, and maintainability for large-volume processing (~20,000 companies). My approach: • Read company names from Excel using pandas/openpyxl • Use Playwright or Selenium (preferably Playwright for better stability) to search Google and identify official/contact pages • Send discovered URLs to the ScrapeGraphAI Extract endpoint using your API key, prompt, and schema • Extract/store emails and phone numbers into structured XLSX/CSV output • Add retry logic, logging, checkpoint saving, and failure tracking for long runs For CAPTCHA/rate-limit mitigation, I’d recommend: • Randomized delays and human-like pacing • Rotating proxies if needed • Session reuse/browser fingerprint handling • Optional CAPTCHA-solving integration • Incremental saves for safe recovery Deliverables: • Final Excel/CSV output • Well-commented Python source code • [login to view URL] • README with rerun instructions • Processing summary with success/failure stats I’ve worked on automation, browser scraping, Excel-based workflows, API integrations, and large-scale data extraction pipelines with a strong focus on reliability and unattended execution. Estimated timeline: 3–5 days depending on CAPTCHA frequency and testing.
₹6,000 INR in 7 days
1.8
1.8
20 freelancers are bidding on average ₹9,087 INR for this job

⭐⭐⭐⭐⭐ Create a Python Workflow to Extract Company Data from Google ❇️ Hi My Friend, I hope you're doing well. I've reviewed your project requirements and see you are looking for a Python workflow to automate data extraction from Google. You don’t need to look any further; Zohaib is here to help you! My team has completed 50+ similar projects in data extraction and automation. I will build a stable solution using Selenium or Playwright to scrape the data you need. I will read the company names from your Excel sheet, find the official websites, and push the URLs to ScrapeGraphAI’s Extract endpoint. ➡️ Why Me? I can easily do your project as I have 5 years of experience in Python automation, web scraping, and data extraction. My expertise includes handling APIs, data processing, and creating reliable scripts. I also have a strong grip on tools like Selenium and Playwright. ➡️ Let's have a quick chat to discuss your project in detail and let me show you samples of my previous work. Looking forward to discussing with you! ➡️ Skills & Experience: ✅ Python Programming ✅ Web Scraping ✅ Selenium ✅ Playwright ✅ API Integration ✅ Data Processing ✅ Excel File Handling ✅ Error Handling ✅ JSON Manipulation ✅ Data Validation ✅ Automation Scripting ✅ CAPTCHA Solutions Waiting for your response! Best Regards, Zohaib
₹3,700 INR in 2 days
8.1
8.1

Hi, I can fix your Python Automation for Google Search + ScrapeGraphAI Extraction I've solved this exact problem many times. Here is what I will do: Build a stable Python workflow to read 20,000 company names from Excel and search Google safely. Extract the official website/contact page URL, then send it to ScrapeGraphAI using your prompt, JSON schema, and API key. Save clean results into a new Excel/CSV with company, website, email, and phone, plus a summary of failed rows. I’ll also add Google CAPTCHA mitigation with smart delays, retry logic, proxy support, and optional captcha-solver integration. 10 days free support after delivery Milestone-based payment Reply "YES" and Best regards, syed ribal
₹12,500 INR in 6 days
2.0
2.0

Hello, I've carefully reviewed your project "Python Automation for Google Search + ScrapeGraphAI Extraction" and I'm excited about the opportunity to work with you. **Backend Development:** • Expert in FastAPI and Django REST Framework for high-performance APIs • JWT authentication, role-based access control, and API security best practices • Database design with PostgreSQL, MongoDB, and Redis caching • Async processing, message queues (Celery, RabbitMQ) **AI & Agent Development:** • LLM integration (OpenAI, Anthropic, Google Gemini) • Built autonomous AI agents using LangChain and custom frameworks • Experience with NVIDIA NeMo Guardrails for production AI safety • Agentic patterns: ReAct, Chain-of-Thought, multi-agent systems **My Approach:** 1. Understand requirements and propose technical architecture 2. Set up development environment with proper CI/CD 3. Implement features with regular updates and clear communication 4. Comprehensive testing (unit, integration, e2e) 5. Smooth deployment with documentation and handover I'm available to start immediately and can dedicate full-time effort to deliver quality results on schedule. I pride myself on clean code, on-time delivery, and excellent communication throughout the project. Looking forward to discussing your project in detail. Best regards, Gowtham
₹6,127.20 INR in 30 days
1.9
1.9

Processing 20,000 company searches reliably is less about simple scraping and more about building a stable automation pipeline that can handle Google rate limits, CAPTCHAs, retries, and structured API extraction without breaking mid-run. I can build a fully automated Python workflow that searches company names, detects official/contact pages, sends URLs to ScrapeGraphAI Extract API, and writes validated emails/mobile numbers back into Excel. I will use Python with Playwright or Selenium, along with queue-based processing, retry handling, and structured logging to ensure large-scale execution reliability. To reduce CAPTCHA/rate-limit issues at this scale, I recommend: - Randomized delays + request throttling - Rotating proxies for large batches - Optional CAPTCHA-solving integration if needed I have worked on: - Large-scale Python scraping systems - Automated Excel/API workflows - Selenium & Playwright browser automation - Structured data extraction pipelines Will you provide proxy infrastructure, or should I include a recommended setup? Do you want multi-threaded execution for faster processing, or maximum stability with controlled concurrency? I can first run a sample batch on 100 companies to validate extraction quality before full deployment. Since I am new to Freelancer.com, I am offering a competitive rate to build my profile reputation. I am happy to provide a quick mockup/sample before you award me the project.
₹3,000 INR in 3 days
0.0
0.0

Ahmedabad, India
Payment method verified
Member since Apr 30, 2026
₹1500-12500 INR
₹75000-150000 INR
$30-250 USD
₹100-400 INR / hour
$15-25 USD / hour
$250-750 USD
$8-15 CAD / hour
₹12500-37500 INR
$8-15 USD / hour
£2-5 GBP / hour
₹1500-12500 INR
₹37500-75000 INR
₹400-750 INR / hour
$10-60 USD
₹600-666 INR
₹600-1500 INR
$8-15 USD / hour
₹12500-37500 INR
$30-250 USD
$8-15 USD / hour
₹12500-37500 INR