Parse 27 text or 27 html files (the text and html files are exactly the same in content; difference is formatting) into comma-separated values (CSV) files that are flat and ready for import into any database like PostgreSQL. The files I have contain newspaper articles from a database, with various components (date, title, publication, abstract, full text of the article, etc.). I need these components read into fields, and I need all the text cleaned (remove all non-ascii characters, escape quotes and other characters that may trow off a simple CSV import, etc.), and recorded as comma-separated values, with one article per line (including the full text as one field).
This should be a 30-minute job for anyone with parsing/scripting skills, because the files are well-formatted and standardized, just not in the format I need.
I attach two of the files as an example.
OK! I can finish this in an hour. So, please give me this work. I will start it immediately. Thanks.
Bu iş için 5 freelancer ortalamada $36 teklif veriyor
I can do it, I love programming. I have extensive knowledge of python programming language. If trust me you will not regret.