I need a script written in PHP that grabs an Excel spread sheet from a website. Rather, I actually need two scripts for two distinctly different websites. The program needs run in Linux making use of a free OCR application, run on the server side (not through a browser), and the down loaded Excel file will be saved locally. The program should make use of an existing web scraping tool kit. The program needs to log in, POST four fields and save the Excel file that is returned as a link in the HTML.
The four fields will be statically defined as variables at the beginning of the script so that I can edit them. I do not want a graphical interface.
Access to the site requires a user name and password (account setup is easy and free). After logging in, the user must pass an “additional security” screen with a random digital image password that is typed in – which may require OCR. After successfully logging in, the site does NOT “time out” the user. Four fields are then identified – state, county, start date and end date. Hit submit, and the server prepares a list. I then make sure the “summary view” box is checked (this is a static setting), and then click the “XL” icon. A box comes up to save or open the file.
This site doesn’t have a log in. Starting at the home page, “advanced search” is selected. This step can probably be bypassed. A search screen opens with the same four fields to be chosen (there are other fields that I don’t use on both sites). This site also produces the random digital image that will need OCR. Enter the data for the same four fields and hit search (there are two search buttons with different functions – it matters that the correct one is clicked). Then click the link to open the Excel spreadsheet as on Site #1. This site lets you stay logged in indefinitely – but after about 10 searches, the security image process runs again. The number of searches between security checks must be random.
The file to be saved will overwrite the previous file and each site must have a separate file.
Contact me and I will send the website locations directly. You can create your own account.
14 freelancers are bidding on average $436 for this job
Hi ! I m interested in your project and I have experience in web scraping/crawling and web automation. I hope you will consider my bid. Plz chk my PMB. Thanks !!