Web Scraper project scope:
A semi-automatic web scraper that scrapes data until a CAPTCHA challenge, letting the user enter the CAPTCHA, that:
- Executes locally on a PC running Windows or MacOSX, with internet access;
- Takes a list of numbers provided by the user in an Excel or csv file;
- Pulls all data entries associated with each number from a Public website, and stores all entries into an organized Microsoft Excel workbook which is saved locally on local personal computer`s harddrive automatically,
- presents all CAPTCHA Turing tests for entry in a window, and resumes scraping as described above after entry of the CAPTCHA challenge,
- optionally, saves all .pdfs or other document files linked to each data entry in a folder named by the entry number.
Documentation explaining the function of each part of the web scraper software will be provided by the programmer, as well as source code free of license restrictions. The scraper must be autonomous on our PC, not running in part through your or another site.
Please let us know if you have any questions, and good luck bidding.