We need to extract all product information from a ecommerce store.
You should log into the site, navigate through all the categories until you are certain that you will fetch every product in the store.
Once you have all products identified (and fetched), you should extract its sku (stock-keeping unit), the html (of the product's page) and any image(s) associated with that product.
We do not need to categorize each element from the product. The html page will work for us now (later we will see what kind of database model would meet our needs and eventually modify the code accordingly).
The images shall be stored in a 'images' folder and identified by the fetched sku followed by an underscore and the original name.
The sky and the html shall be stored in a database with columns: `sku` (index) and `html`.
RESUMING LAST FETCH
Once the code is run, it should check if the previous run ended correctly. If not (connection lost, maximum execution time, etc), the script should resume (not start over) the fetching process.
The script should check if XX hours have passed since last execution. If not, then the script shall stop and display a warning message saying: 'It's been YY hours since last fetch.' and present a link 'Force fetch' which will ... force the fetching process.
Please refer to the images here attached if you need further clarification.
Note: Pages will be in Spanish. If you need help translating any part of it, just let me know.
18 freelancer bu iş için ortalamada 163$ teklif veriyor
I can write such script in Perl. I have sound expexience developing such projects. If you are interested, contact me. Project can be done in 5-10 days.