I require a small piece of code to be written to the the following.
1) crawl a website based on a single domain variable.
2) store all the unique urls in a file CSV, SQL db (your pref)
2) take every web page on this site and calculate it's total page size including all objects.
e.g. include all html, imnages, swf, etc and all includes e.g. css and js.
you may use any language you feel appropriate, but it must be fast and light.
source code will be required for future mods.
pleas esubmit a basic description of how you would do this and a time scale for completion.
Bu iş için 14 freelancer ortalamada $180 teklif veriyor
I'm experienced Java programmer. I can write this application for you. Best regards, Sergey Rylkov.
Dear sir, check the PM for details.. i'm good in creating such kind of softwares like spiders/crawlers... have done such kind of software before... check the PM for details..
I participated a web crawler project in my summer internship. I can do it.
For a single domain spider with the additional features, plus a light-weight tool and rapid scan, I would read the html, java script and style sheets entirely (since all of them may contain links), and try to read the Daha Fazla