OCR Linux Script Modification

Adaptation of OCR script (WebOCR) with Spellcheck (Aspell) to work with Japanese spellcheck. At the moment, my site works with European languages.

There isn't a Japanese aspell dictionary so this will be developed.

The basic process is to add a space between kanji (Unicode range U 4E00–U 9FBF) and other characters. This keeps kanji together in blocks and the other characters in blocks. This is to overcome the fact that Japanese does not use spaces.

When the user removes the spaces between the kanji and connected characters, the search can look for these strings of kanji and other character blocks.

When editing has finished, at the end, the spaces can be removed in a batch.

Used Code:

Complete zip package can be found at [url removed, login to view]

Other source is Aspell, NHOCR and WeOCR.

Beceriler: HTML, Linux, Web Sitesi Tasarımı

Daha fazlasını gör: ocr kanji linux, linux ocr japanese, ocr script linux, ocr kanji, basic japanese, japanese package design, japanese dictionary, script linux, ocr, Linux script, cutout, batch editing, adaptation user, japanese character, ocr japanese, japanese ocr, website dictionary, http batch, dictionary design, linux batch, code modification website, site adaptation, linux package, code batch, code blocks

İşveren Hakkında:
( 0 değerlendirme ) Kawasaki, Japan

Proje NO: #748075