[Chinese NLP] Extract Chinese keywords from Chinese text (Simplified)

I will provide working code which is currently used to extract English keywords from English texts.

The working code does the following:

1. Tokenize the text into sentences.

2. Perform sentiment analysis on each sentence and assign the sentence score to each word.

3. Tokenize the sentence into words.

4. Find POS tags and filter out unwanted words (like Personal Nouns).

5. Lemmatize words.

6. Use masterlist to map words. (More on the masterlist below).

7. Calculate score for each word. Current formula is: square root of word frequency times the maximal positive sentiment times (1-exp(-rank/200)), where rank for word frequency on the internet.

8. Dictionary of dictionaries is returned containing all the extracted information.

The dependencies are:

a) NLTK with corpora and Vader

b) numpy

c) pandas

d) All NLTK dependencies are checked for before running, and downloaded/installed if needed

I need you to tweak the above code so it works with Chinese, e.g. use StanfordSegmenter for tokenizing, etc.

The masterlist (6) is used to map keywords to a main keyword. It uses synonyms to do this. For example, if the word money is found, it is mapped to the word wealth. If the word cash is found, it is also mapped to wealth. I will provide a masterlist for Chinese. You just need to plug it into the existing code.

I will also provide the word ranking list (7).

So I think the main task is just using the Chinese language libraries rather than the English language libraries.

Please test your work before giving it to me.

Any questions, please ask.

Thanks for reading.

Evner: Maskinoplæring, Natursprog, Python

Se mere: web transform simplified chinese traditional chinese, simplified chinese english free translation, extract pictures text pdf files, information extraction chinese, chinese dependency parser, chinese nlp tools, awesome chinese nlp, nltk chinese, chinese nlp python, chinese word segmentation python, named entity recognition chinese, extract words text file, extract keywords sites, keywords text, extract keywords html page, vba code extract email text field, extract keywords text, aspnet extract keywords text, extract keywords text database, vba macro write text simplified chinese

Om arbejdsgiveren:
( 25 bedømmelser ) Hong Kong, Macau

Projekt ID: #18141725

8 freelancere byder i gennemsnit $307 på dette job


Hi, we believe we can take care of your requirements. We have worked on several projects on Python, Django including [login to view URL] which is a stock trading platform. Apart from this , we have worked on other Flere

$150 USD in 3 dage
(16 bedømmelser)

Dear Hiring Manager. I always hate a guy who overestimate himself. So I would like to stick to the facts for my ability on development ! With hands-on experience verifying my ability to develop special and kern Flere

$300 USD in 5 dage
(8 bedømmelser)

Hello, I'm NLP researcher and masters student in computer science, I would like to work into your project.

$1111 USD in 3 dage
(1 bedømmelse)

Hello I am a Software Engineer. I specialize to do image processing projects such as Image Segmentation, Face Recognition, Object Detect(Track) and etc. And I am familiar with Tensorflow than Caffee. Of course I Flere

$300 USD in 3 dage
(6 bedømmelser)

I have being working in the analytics/data science field for 5+ years now . I am expert in R /Python . Worked on various supervised /Unsupervised techniques like linear/logistic regression , random forest , decision tr Flere

$222 USD in 5 dage
(4 bedømmelser)

I have decent experience in the field of natural language processing and have authored six research papers at top-tier conferences. Link to Github: [login to view URL] Link to CV: [login to view URL] I am pr Flere

$144 USD in 3 dage
(0 bedømmelser)

I am a native Chinese, and fimiliar with Python, Pandas, Numpy. I am familiar with Chinese NLP tools. This is my first work, so I only want to get a good rate. Thanks!

$30 USD in 5 dage
(0 bedømmelser)
$200 USD in 10 dage
(0 bedømmelser)