[Chinese NLP] Extract Chinese keywords from Chinese text (Simplified)

I will provide working code which is currently used to extract English keywords from English texts.

The working code does the following:

1. Tokenize the text into sentences.

2. Perform sentiment analysis on each sentence and assign the sentence score to each word.

3. Tokenize the sentence into words.

4. Find POS tags and filter out unwanted words (like Personal Nouns).

5. Lemmatize words.

6. Use masterlist to map words. (More on the masterlist below).

7. Calculate score for each word. Current formula is: square root of word frequency times the maximal positive sentiment times (1-exp(-rank/200)), where rank for word frequency on the internet.

8. Dictionary of dictionaries is returned containing all the extracted information.

The dependencies are:

a) NLTK with corpora and Vader

b) numpy

c) pandas

d) All NLTK dependencies are checked for before running, and downloaded/installed if needed

I need you to tweak the above code so it works with Chinese, e.g. use StanfordSegmenter for tokenizing, etc.

The masterlist (6) is used to map keywords to a main keyword. It uses synonyms to do this. For example, if the word money is found, it is mapped to the word wealth. If the word cash is found, it is also mapped to wealth. I will provide a masterlist for Chinese. You just need to plug it into the existing code.

I will also provide the word ranking list (7).

So I think the main task is just using the Chinese language libraries rather than the English language libraries.

Please test your work before giving it to me.

Any questions, please ask.

Thanks for reading.

Beceriler: Makine Öğrenimi, Doğal Dil, Python

Daha fazlasını gör: web transform simplified chinese traditional chinese, simplified chinese english free translation, extract pictures text pdf files, information extraction chinese, chinese dependency parser, chinese nlp tools, awesome chinese nlp, nltk chinese, chinese nlp python, chinese word segmentation python, named entity recognition chinese, extract words text file, extract keywords sites, keywords text, extract keywords html page, vba code extract email text field, extract keywords text, aspnet extract keywords text, extract keywords text database, vba macro write text simplified chinese

İşveren Hakkında:
( 25 değerlendirme ) Hong Kong, Macau

Proje NO: #18141725

Bu iş için 8 freelancer ortalamada $307 teklif veriyor


Hi, we believe we can take care of your requirements. We have worked on several projects on Python, Django including [login to view URL] which is a stock trading platform. Apart from this , we have worked on other Daha Fazla

$150 USD in 3 gün içinde
(16 Değerlendirme)

Dear Hiring Manager. I always hate a guy who overestimate himself. So I would like to stick to the facts for my ability on development ! With hands-on experience verifying my ability to develop special and kern Daha Fazla

$300 USD in 5 gün içinde
(8 Değerlendirme)

Hello, I'm NLP researcher and masters student in computer science, I would like to work into your project.

$1111 USD in 3 gün içinde
(1 Yorum)

Hello I am a Software Engineer. I specialize to do image processing projects such as Image Segmentation, Face Recognition, Object Detect(Track) and etc. And I am familiar with Tensorflow than Caffee. Of course I Daha Fazla

$300 USD in 3 gün içinde
(6 Değerlendirme)

I have being working in the analytics/data science field for 5+ years now . I am expert in R /Python . Worked on various supervised /Unsupervised techniques like linear/logistic regression , random forest , decision tr Daha Fazla

$222 USD in 5 gün içinde
(4 Değerlendirme)

I have decent experience in the field of natural language processing and have authored six research papers at top-tier conferences. Link to Github: [login to view URL] Link to CV: [login to view URL] I am pr Daha Fazla

$144 USD in 3 gün içinde
(0 Değerlendirme)

I am a native Chinese, and fimiliar with Python, Pandas, Numpy. I am familiar with Chinese NLP tools. This is my first work, so I only want to get a good rate. Thanks!

$30 USD in 5 gün içinde
(0 Değerlendirme)
$200 USD in 10 gün içinde
(0 Değerlendirme)