I require a simple php web crawler script to be coded in php. The script has to have the ability to crawl just the select pages that I as admin choose and ignore the rest of the internet. The pages the script should crawl would need to be able to be set within a simple textarea in the admin section of the script.
When crawling these select pages it needs to search for certain keywords in the pages' source code and input them into a mysql database on the fly. When inputing data from the crawled pages into the database it has to validate before insertion to make sure no duplicate data already exists in the database.
A frontend to then search the database with search string queries and display the results on the page is also needed.
Knowledge of PHP regular expressions are a must for this project.
I will require the selected programmer to sign a non-disclosure agreement and a copyright agreement (which are attached to this post) before we start.