In this assignment, you are a given a dataset of approximately 20,000 news documents collected from a set of newsgroups (mailing lists). The set of documents (email messages) is partitioned almost evenly across 20 different topics such as sport, electronics, politics, etc. The documents of each newsgroup are stored in one directory. Each news document is stored in a text file in a semi-structured format.
Bu iş için 6 freelancer ortalamada $37 teklif veriyor
Hello, I m Tahsinul Alam, completed Masters in Software Engineering now working as one of the CEO at Workspace Infotech Ltd, software firm located in Dhaka,Bangladesh. Relevant Skills and Experience Big Data, Java P Daha Fazla
hi. i can do it Relevant Skills and Experience i know java and done some data processing with it before Proposed Milestones $30 USD - program is it to be done in plain java or some big data frameworks should be used Daha Fazla
professional experience of more then 3 years. I can help you