Devam Ediyor

Parsing Pages of an old HTML site into a Database

[url removed, login to view] is a site we are moving from static html to a database driven site. What I need is all the essentials parsed out logically into a spreadsheet. The standard stuff like title tags, meta tags, image name, alt text and description.

The site is 15 years old and has several different developers work on it so not every page has the same layout. 95% of the pages are product pages and those are what I need parsed.

Please see the Project Clarification Board for updated information and details.

Beceriler: HTML, Perl, Python, Web Scraping, XML

Daha fazlasını gör: old database, name of product and description, html to spreadsheet, html spreadsheet, alt com, parsing pages, database developers, site developers, developers site, text parsing, static pages, perl developers, parsed, html tags, html site, html layout from image, text image html, database pages, image parsing, parsing text perl, perl xml html, text html perl, spreadsheet html, alt image, pages html site

İşveren Hakkında:
( 62 değerlendirme ) Roseville, United States

Proje NO: #743168