Completed

Simple Python Text Analysis Project

I have a .csv file. Each column represents certain information of a newspaper article (year, publisher, body of text). Each row represents one newspaper article. Based on the text of the "body" column, I want new columns to be created for all rows of newspaper articles: (1) total number of words, (2) number of times any country from a list of countries (excluding the U.S.) is found, (3) most frequent word (excluding stop words), (4) number of times the most frequent words appear, (5) second most frequent word (excluding stop words), (6) number of times the second most frequent words appear, (7) third most frequent word (excluding stop words), (8) number of times the third most frequent word appear, (9) fourth most frequent word (excluding stop words), (10) number of times the fourth most frequent word appear, (11) fifth most frequent word (excluding stop words), (12) number of times the fifth most frequent word appear

I am looking for (1) the .py program that would create the above columns to the given .csv file (can use .xlsx file if preferred) and (2) the output .csv (or .xlsx file) with the above columns added to the original file. Experience with Python (pandas, nltk) is necessary.

A link to a shared Dropbox folder with the .csv (and .xlsx) file will be provided when the project is awarded. The .csv file is 138MB with 42967 lines of data (i.e., 42967 newspaper articles).

This is a simple test run to another project, and satisfactory completion of this project will lead to a larger related project. Preferences will be given to those that can complete this mini project as soon as possible.

Evner: Databehandling, Python, Statistikker

Se mere: python text mining package, text mining using python tutorial, text mining projects, text mining in python code, content analysis in python, text mining projects in python, python text mining tools, text mining python pdf, write simple python game project, simple python project, create simple python project, simple java text output project, simple text analyzer project java, text classfication project python, simple captcha text project, requirement analysis project management website design, competitive research analysis project, stockmarket analysis project, company analysis project, simple rich text editor

Om arbejdsgiveren:
( 2 bedømmelser ) Jersey City, United States

Projekt ID: #16508498

Tildelt til:

Vadimwang

Having experience of python, I can do what you want. Python is my primary programming language. Let us discuss details in chat.

$30 USD på 1 dag
(30 bedømmelser)
4.3

6 freelancere byder i gennemsnit $30 på dette job

tausy

A proposal has not yet been provided

$35 USD in 3 dage
(4 bedømmelser)
2.4
mingtian815

hi i am very interested in your task i am python expert and have rich experience in data processing , handling excel , text analysis i can help you with good result and work for you as long term thanks

$25 USD på 1 dag
(6 bedømmelser)
2.4
WIFTCAP

Hi !! Nice to e-meet you Dear Client, Thank you for looking at our candidature. As per your requirement, you are looking for professional website developer for a simple website with social media integration. we can ful Flere

$35 USD in 3 dage
(0 bedømmelser)
0.0
$30 USD på 1 dag
(0 bedømmelser)
0.0
xcrapper142

Let me do this, I promise not to disappoint you. Relevant Skills and Experience Python, csv

$25 USD på 1 dag
(0 bedømmelser)
0.0