The short description of this project is:
Given a word or multi-word concept and a language, download the corresponding Wikipedia article in that language, if it exists.
Convert the download from html into plain text. Return an object with that text.
I recognize that there are a lot of issues for clarification like:
- what counts as a language?
- what counts as a valid search term?
- how to deal with various character sets?
In order to cut this short, you may allow for only two languages: English and German.
A valid search term is any term for which Wikipedia directs you to a final site or to a site that presents several options. If there are several options. visit all of them, download them, and return several objects with the corresponding texts.
Bids promising somewhat different end-products are welcome! You may have something similar in store already, or you may have thought more deeply about some of these issues.
The program must be written in Java.
22 freelancer bu iş için ortalamada 169$ teklif veriyor
Hi recently i am working on salesforce plateform and have enough knowledge about HTML, Oracle 9i database, JSP ,java,JS and Dreamviewer. And i worked on 3 or 4 sites implementation.