Kapalı

Django/web2py price comparison website with scrapy scrapper

Hello

I need someone to develop a website in Django/web2py for a price comparison site using Scrapy (or something better) & Selenium - code must be documented in English. It should allow Scrapy/alternative to crawl a "variable" number of separate sites (using a number of "spiders") that can pull out product details such as Product ID, Title, Price, Vendor, Description, Image, URL and Stock Position etc. This information should then be placed in a PostgrSQL database to be displayed using Web2py/Django. There should also be a way of the URL to the products be changed to affiliate links.

This is an easy project for someone who has done this before, if you have examples of previous work this will go in your favour so please reference them. Additionally if you have advice on the a better architecture/solution I am open to ideas.

Expected Features:

a) The Products Table in the server database to be automatically populated by the scraper. The required fields are Product ID, Title, Price, Vendor, Stock Position, Payment Options, Delivery Time

b) Easy extensibility (with some python coding) to add more sites in future.

c) To meet the above, the scraper to be implemented as two modules. The "Scraper Module" and the "Parameter Module".

d) The "Scraper Module" would do the actual scraping of multiple sites (based on parameters read from the Parameters Module), and also automatically populate the Products Table in the database server. For sites with content rendered in JavaScript, Scrapy to be used with Selenium for effective scraping.

e) The "Parameters Module" would include a Form through which scrape parameters such as the primary URL, scraping rules for each field to be scraped, format of data to be extracted, and whether to use simple crawl (for sites without JavaScript) or complex crawl (for sites with content rendered in JavaScript). These parameters would be stored in a table, and accessed by the "Scraper Module" at run time.

f) The scraped URLs (referred by the primary URL) to be saved in a Database Table with "processed flag", so that these can be skipped if scraping needs to be resumed after interruption.

g) Primary URLs also to be saved with the date of last successful scraping, to enable scheduling of periodic repeat scrapings.

h) While executing scraping, only those fields that have changed since last scrape are to be extracted and the original table entry for the product to be "updated", as required. In case of new products, the details to be "inserted" as a new row in the Products Table.

i) Scrapy to be used with Selenium for effective scraping of sites with heavy JavaScript content.

j) Performance must be adequate to enable scraping of the sites in order to generate the Products database

k) There should also be a way of the URL to the comparison products within the website to be updated changed to affiliate links.

Expected Skills: Web Scraping, Scrapy, Selenium, Python, Data Mining, Javascript, MySQL

Budget: USD 200 to USD 300

Beceriler: Django, MySQL, PostgreSQL, Python, Web Scraping

Daha fazlasını görün: web2py comparison site, scrapy price comparison, web2py django, scrapy price title, web2py scrapy, django price comparison, web2py data entry table, web scraping django, price comparison scrapy, django comparison site, django web2py, website ideas 2013, web scraping solution, web scraping price, web develop on python, vendor comparison format, simple use case examples, scraping web for ideas, scheduling javascript, primary modules, image web solution, easy url scrapper, django work, develop website with content, develop a website price

İşveren Hakkında:
( 0 değerlendirme ) Croydon, United Kingdom

Proje NO: #4437217

7 freelancer bu iş için ortalamada 489$ teklif veriyor

SigmaVisual

I can help in your project, please check PMB and our ratings/reviews to get idea of our experience. Please let me know if you have any queries.

in 10 gün içinde450$ USD
(86 Değerlendirme)
7.0
srinichal

I can deliver the project

in 12 gün içinde630$ USD
(47 Değerlendirme)
6.5
pablotorres

i can do it

in 30 gün içinde300$ USD
(46 Değerlendirme)
5.0
raul27868

Hello, I can do this work for you and I'm ready to start. Please see pmb for more details. Regards Raul

in 10 gün içinde250$ USD
(13 Değerlendirme)
4.8
getveltrod

Hi, Veltrod Software services is a global software consulting company specialized in providing Mobile applications, Social media frameworks and eCommerce solutions. Leveraging best-in-class people, processes, and t Daha fazlası

in 25 gün içinde721$ USD
(1 Değerlendirme)
0.0
KrunkSystems

Please check our PM

in 3 gün içinde525$ USD
(0 Değerlendirme)
0.0
rasilu

Um a Python Developer and have a sound knowledge on Django , Google App Engine. I can deliver the project in a good coding standard and a quality product

in 12 gün içinde550$ USD
(0 Değerlendirme)
0.0