Devam Ediyor

Word/Phrase Tokenzier

Java Tokenzier

In java, write a tokenzier class to tokenize a string into a word, phrase (greedy style), or other tokens according to the convention used by break iterator (i.e. subclass break iterator). Return type is List. The dictionary of reference is [url removed, login to view], initialize/cache the memory with pharses (2 words or more) for better performance. Speed is extremely important. Please discover the most optimal phrase search alogrithm.

Test Input:

I like coffee table!

Test Output:

list("I", " ", "like", " ", "coffe table", "!")

I have attached a code written to parse chinese language and found their greedy search algorithm to be usable. However, the code is buggy and has a lot of undesired processing for chinese characters. Please recommend a better alogrithm if your bid message.

Beceriler: Java

Daha fazlasını görün: written chinese, word reference, the alogrithm, test algorithm, string search algorithm c, string search algorithm, string processing in c, string processing algorithm, string iterator, string algorithm, search algorithm in c, list iterator c, iterator string, iterator in c, greedy greedy, greedy algorithm java, greedy algorithm code, for greedy, a search algorithm, algorithm test, algorithm string, algorithm input, algorithm greedy, java word phrase list, tokenzier

İşveren Hakkında:
( 0 değerlendirme ) Alameda, United States

Proje NO: #65205

Seçilen:

SachinBhatt

We can do this for you in satisfactory manner.

5 gün içinde 70$ USD
(5 Değerlendirme)
3.7

9 freelancer bu iş için ortalamada 69$ teklif veriyor

noiresol

Noiresol offers an outstanding value added professional services with high degree of quality at very competitive prices. We have creative designers, experienced programmers and business analysts to give cutting edge t Daha fazlası

1 gün içinde 70$ USD
(1 Değerlendirme)
4.0
justgreat

Placeholder's bid I do not understand how it can detect "coffee table " as different from "coffee","table", unless all the phrases that are to be taken as one words (like this example ) are fed as database to it. Ple Daha fazlası

in 2 gün içinde100$ USD
(2 Değerlendirme)
2.4
IntSS

Instance Software Solutions (ISS) is a product based company which is having rich experience on developing enterprise, distributed and web based applications using PHP/Mysql/Java/HTML/JavaScript/ JSP/Servlets/Struts/EJ Daha fazlası

in 7 gün içinde70$ USD
(0 Değerlendirme)
0.0
amounir86

I can do this for you in a very efficient speed, but as it was asked why is "coffee table" not "coffee" " " "table"

in 5 gün içinde50$ USD
(0 Değerlendirme)
0.0
brocker

Hi , I would not use StringTokenizer but regular expression. Get in touch and we can continue this conversation. Regards Brocker

in 7 gün içinde100$ USD
(0 Değerlendirme)
0.0
waswani

can do this for u the way you want.

in 2 gün içinde50$ USD
(0 Değerlendirme)
0.0
JohnE70

Tokenizer shouldnt be a problem, I am assuming you are looking at some sort of translator program that breaks each word up and then returns a Chinese equivalent.

1 gün içinde 70$ USD
(0 Değerlendirme)
0.0
harringf

Will code for you and explain the code to you over the phone if you reside in the USA or Canada - or Over Skype to Skype anywhere in the world. Let me know if you would likt this to be done fast. Thank you.

in 3 gün içinde45$ USD
(0 Değerlendirme)
0.0