Kapalı

Text Corpus software

To build a database of lexical words from a corpus of texts divided into text types and to rate each word as to the probability of its occurrence within that text type.

This involves being able to store text files, take each lexical word and count it within the overall corpus, count it within the text type (novel, newspaper article, etc) and then say which text type it is most likely to occur in, or - alternatively - give the probabilities of its occurrence in each text type.

The text corpus will initially be about 10 million words, but will grow to about 100 million, so only programs which are very fast will be useful.

Results need to be represented numerically and if possible graphically. Programmers need to be highly numerate, preferably with a reasonable knowledge of statistics, intermediate level.

Phase 2: the user inputs a text and the text is 'typed' according to the probability of each word in the text occurring in a particular text type.

Phase 3: other clasifications will be important eventually, e.g. age, gender, social background of author of text. We will eventually need to profile each corpus author and thereby rate the user's text's author's profile on the basis of the corpus.

It would be ideal if the program can be web-based AND on a user's p.c.

You will need an ftp address where I can upload the corpus of texts to.

Beceriler:

Daha fazlasını görün: web based programs, types programmers, statistics level, programmers database, author it, text corpus software, word count corpus software, database programmers, author address, word p, text inputs, text based, say, probability statistics, novel text, need software programmers, e novel, age, intermediate , count words text, numerically, ftp profile, software article, corpus, likely

İşveren Hakkında:
( 0 değerlendirme ) Shrewsbury, United Kingdom

Proje NO: #22451

5 freelancer bu iş için ortalamada 560$ teklif veriyor

nidle

Dear Sirs, We got extremely interested in the project proposed by you. We are an IT company specializing in web technologies and programming. Our specialists are ready to start working on the required software straigh Daha fazlası

in 45 gün içinde1000$ USD
(7 Değerlendirme)
1.6
gawab

I will be glad to work with you.

in 10 gün içinde300$ USD
(1 Değerlendirme)
0.0
donghuayi

I have a Master degree in computational linguistics from University of Southern california. I have done several research projects just like yours, not only were we able to place words with syntactical features, but we Daha fazlası

in 10 gün içinde500$ USD
(0 Değerlendirme)
0.0
ivt84

Dear sir I am very glad to read your requirements. I assure you that We can do this job with 100% perfection. Please believe us = we will give you real quality work for cheapest prices. You can read our past Daha fazlası

in 30 gün içinde599$ USD
(1 Değerlendirme)
0.0
Randeep

We are a group of highly educated professionals with specialization in various fields. although we are new to elance.com, we have done such projects before and we are confident of doing your project. We will always be Daha fazlası

in 30 gün içinde400$ USD
(2 Değerlendirme)
4.3