Query the United States Patent and Trademark Office website for all patents that reference a particular patent number that I’ll provide. (This process is very straightforward and takes just seconds; I can provide full instructions.) The resulting list includes 1,314 results with 50 hits per page. Each hit is linked to the full text document for a specific patent.
What I need:
1) Someone to download the html code from the full text document for each referencing patent (i.e., each of the 1,314).
2) Once these pages of plain text html code are in hand, someone to parse the results into fields in an Excel file. There will be about 15 fields. Four of these fields (inventor, inventor location, patents referenced, and other references) will have up to 30 individual entries. I can provide full details on the specific fields that I need for each patent and guidance on the unique text that can be used as markers for finding each field within the full text document.
The deliverable is:
1) An Excel file with each of the 1314 results in its own row. The columns would be the specific fields scraped and parsed from the full text documents.
2) The code you used to do this. It must be well commented.
25 freelancers are bidding on average $71 for this job
Hi, I would like to write a little demo for you, I will do it in PHP, could you posible show me source link from you want to get content? Regards CruzDelSur
Dear Sir, We have relevant experience. Please see the PMB for complete description about this project. Here is our place holding bid for this project. Best Regards, Nadeem
Hi sir, I have pretty good knowledge and experience in parsing and validating documents of [url removed, login to view] html will be very much easire and faster in [url removed, login to view] forward to hear from u to start up this project