The task is to find dynamically generated PDF documents yourself and extracting their metadata. It's very easy and straightforward task without needing any special skills.
Just like most of the Word documents are made using Microsoft Word, most of the PDF documents are made using Adobe products. However these are mostly static documents like, books, brochures. Another example of a static document is that people sometimes convert their document from Word to PDF. In our study, we are NOT interested in these kind of documents.
The software used to create a PDF document is mentioned in the properties of the document (called Producer Line). What I want you to do in this task is to check the producer line of the PDF documents that you receive in your emails. I say emails, because nowadays companies send PDF documents via email and these are NOT static documents. They generate same looking document for every customer but with different data in it. These are the documents that I am interested in. Example document types might be, invoices, telephone bills, subscription documents, personal letters, quotes, certificates etc.
Note: I am not interested in any kind of personal data. Just to give an analogy here: you have your house and your house is made out of bricks. I ask you to check the brand of the bricks that the constructor used when building your house.
So, in short, you need to:
1. Check your emails containing PDF documents (in gmail you can search for "filename:pdf")
2. Download the PDF document to your PC.
3. Open the PDF document using Adobe Acrobat Reader (or whatever reader you use)
4. From the menu, select File -> Properties (or Ctrl + D)
5. Copy the full "PDF Producer" line
An example producer line can be:
"Adobe PDF Library 15.0"
"Adobe LiveCycle PDFG ES; modified using iText® 5.5.6 ©2000-2015 iText Group NV (AGPL-version)"
Your report should be a simple excel file that contains
- What kind of document is this? (for example credit card statement document, telephone bill, student score report, boarding pass etc.)
- The company that created the document (for example Wells Fargo Bank, or the name of the insurance company)
- The PDF Producer (as explained above)
You will be paid by the number of documents that are unique only. "Unique" means, different banks, different insurance companies, different airline companies, basically any company sending out PDF documents to people. Unique also means different document type in companies, for example bank statement and mortgage statement are two different document type, so these are counted as unique as well, even though they might be from the same bank. You will be paid for every unique PDF producer of document you present.
I have attached an example screenshot of the producer line of a document.
Bu iş için 11 freelancer ortalamada $125 teklif veriyor
hello, many projects on similiar subjects, u can check my profile for details, milestones will be discussed according to the requirements hope to get message from u . thanks & BR
Hi, I read your project description and I feel I may be the right person for this project but how can I find pdf documents. You say they sent be email, but how they would know my email. Could clarify more?
Hi there, Let's do this right now. I am an Algorithm Master. We can discuss more regarding the given project in the inbox. So, I will wait to hear from you. Thanks. Stay home, stay safe, and take care.
I am interested to join your work Pls give me opportunity to prove myself. I will complete your work at on time. Kindly consider me on your work