Screen scraping of 6 websites using open source
$100-500 USD
Teslim sırasında ödenir
I require a very simple application which scrap data from 6 different sites and create an xml output.
The program should use an open source scrapping tool which called
WebHarvest (you can find it in : [url removed, login to view])
What i need from you is a Web Harvest script files which creates variable contains the XML and a small java application which execute the script and print the XML (Example: [url removed, login to view]).
There should not be any code in the java main except running the script and sending parameters value and output the XML (all the logic and the creation of the XML will reside in the scripts)
There will be a total of 6 urls that we require web scraping. Here they are and the requirements. Each site would require its own script:
[url removed, login to view]
Takes a state as a search criteria. Returns pages of results. Each result should be converted (for all pages) should be converted into an xml file called [url removed, login to view] when the run is complete.
[url removed, login to view]
Takes a state as a search criteria. Returns results in a flash outputted view. Each result should be converted (for all pages) should be converted into an xml file called [url removed, login to view] when the run is complete.
[url removed, login to view]
Takes a state as a search criteria. Each result should be converted (for all pages) should be converted into an xml file called [url removed, login to view] when the run is complete.
[url removed, login to view]
[url removed, login to view] (list view)
Takes a zip code AND a price range. Each result should be converted (for all pages) should be converted into an xml file called [url removed, login to view] when the run is complete.
[url removed, login to view] with the real estate plugin
Takes a zip code AND a price range. Each result should be converted (for all pages) should be converted into an xml file called [url removed, login to view] when the run is complete.
Because all of these are real estate websites, you will be required to first do a post search on them in order to scrape the results. The post search query typically requires a zip code, state and/or city
The scripts should be able to be called via java code You will provide both the scripts and the java code
Proje NO: #3498408