Parallel Python Code That Counts How Many Websites Have Canvas

I need a simple Python script that scrapes a list of websites in a csv file (e.g. top 500,000 Alexa sites attached), and checks if the website uses Canvas in the HTML (by checking for "<Canvas>") or in JavaScript (by checking for "createElement("canvas")" or "createElement('canvas')"). The code should output the number and percentage of websites using Canvas out of the list.

It is recommended that the code uses the Python Libraries “Requests” and/or "BeautifulSoup4" with a similar logic as the one I started writing (attached). The following points need to be satisfied:

• The code uses parallel computing for efficiency, so it doesn't run for so long

• The http header has to look like it came from a real browser, so websites don't block it

• The reading time of a website should not exceed 30 seconds, and should time out if no response for 30 seconds and go to the next website

• The script needs to count and print the number of successfully read and unread sites from the csv file of top sites (as the one I am attaching does for the unread). The unread sites could be because a website is no longer available or responsive, or any other reason

• The script needs to handle errors and doesn't crash

• The script has to print the duration of execution (how many hours, minutes or seconds)

• The script has to print the number and percentage of sites containing Canvas either in the HTML source code or JavaScript

It would be great if we can have a version that is not parallel to compare the performance, but not super important

Beceriler: Javascript, Python, Web Scraping

Daha fazlasını gör: python parallel processing example, illumina canvas, python parallel for loop multiprocessing, parallel python vs multiprocessing, python joblib parallel, how to parallelize python code, canvas: versatile and scalable detection of copy number variants, parallel programming with python pdf, need a python code that does, how to hire developer for python code in india, how many websites in lebanon, how many trusted freelance websites in india, how many point that are needed for graphic design, how many ecommerce websites in india, how many bloggers are there on freelance websites, designers that knows how to write content for websites, how many websites developed using java, how many shopify websites can you have, search word file python code, telit send gps python code

İşveren Hakkında:
( 0 değerlendirme ) Khobar, Saudi Arabia

Proje NO: #15614099



I am a python expert and i can do your work. i can start immediately. and complete your work on time.

1 gün içinde %selectedBids___i_sum_sub_4%%project_currencyDetails_sign_sub_5% USD
(32 Değerlendirme)

Bu iş için 2 freelancer ortalamada $38 teklif veriyor


Hello. After reviewing your post, I am very interested in that due to my experience. I’d like to be considered for your project position. Whether you need to satisfy above point Relevant Skills and Experience I can m Daha Fazla

1 gün içinde %bids___i_sum_sub_32%%project_currencyDetails_sign_sub_33% USD
(2 Değerlendirme)