Peyton Data Mining

you are going to read some text files and classify them according to their labels. The Reuters corpus is one of the most famous datasets for text categorization tasks. We provide a subset of this dataset on Brightspace. You apply these files to make your classifier. There is more information about this dataset available on [login to view URL]

1- Download zip file and extract it. Consider this data is a subset of full Reuters corpus to make it possible for you to process without the need of a powerful server.

2- Each file contains some XML files. Explore XML files and find a list of all fields available there.

3- Write a function extract a Pandas's Dataframe containing: (1) headline, (2) text, (3) bip:topics,(4)

[login to view URL], (5) itemid, (6) XMLfilename

4- Write a python function to find all the possible values for bip:topics. Consider that each news can

belong to more than one topic.

5- Write a function to prepare your text data by methods such as removing stop words. You are allowed

to use the NLTK library.

6- Extract features from the text using any approach you like. Write a function that input the Dataframe

in step 3 and generates a new Dataframe of your features and labels.

7- Divide your data into a training and test set. You can use any method such as cross-validation. You

need to provide a reason why you decide so here.

8- Write a function to get the Dataframe of step 6 and a set of parameters to return a trained classifier

to classify all labels that you get in step 4.

9- Write a function to evaluate the quality of your classifier (like accuracy, F-score, AUC, ...). Explain why

you think this function is the best choice

9- Generate five different classifiers (Random Forest, Decision Tree, Linear Regression, Neural Network, and SVM) using step 8. Tune them up for the best parameters. Find the best classifier. Explain why.

Evner: Python, Datasøgning, Software Arkitektur, Databehandling, XML

Se mere: excel data mining project, build data mining project, data mining marketing research, data mining research companies, example data mining, purchase data mining contract information, data mining cleaning, dataset data mining association, screen scraping data mining, role database developers data mining, data mining using aspnet, datasets data mining association, medical billing service data mining, data mining websites excell, find data mining clients, data mining find jobs php, email find data mining, Find research on Image Processing/ Data Mining, find data mining expert

Om arbejdsgiveren:
( 0 bedømmelser ) Middle Sackville, Canada

Projekt ID: #21831994

Tildelt til:


Hello Dear...! Alert: I will give you 20% discount on my bid rate also give on my All Services. So grabs this special offer is limited. Let’s get to the point. I came to know that your Looking a developer which Flere

$131 CAD in 3 dage
(4 bedømmelser)

17 freelancere byder i gennemsnit $177 på dette job


Hi, I read your project description and I am interested in your job. As you can see my profile, I am a full-time developer and have just completed many projects. Specially, I have top skills for C/C++, C#, Java, Py Flere

$200 CAD in 2 dage
(69 bedømmelser)

Hello? How are you? I am excited to work with you on this project. I have done a lot of jobs with python like Django admin, Flask, python scrap, pysql, python tkinter GUI etc Here is on of my scrap with python wor Flere

$155 CAD in 3 dage
(130 bedømmelser)

Dear, As an expert in python, I have developed many scripts and applications using python, PyQt, wxpython, tkinter. I developed FlightPlanner and Wamdam database management using python and wxpython. My recent work: Flere

$140 CAD in 7 dage
(59 bedømmelser)

Hi, I have gone through your requirement to scrape lots of websites. I am EXPERT in building scraping tools /scripts. Hence, I can SURELY work on your project. I am having 4 YEARS of EXPERIENCE in developing PHP-PYTHON Flere

$108 CAD in 3 dage
(70 bedømmelser)

Hi, Nice to meet you! I have read your requirements carefully and I am very interesting for your project. I am confident of this project as I'm a professional Python,Data Mining expert with over 5 years of experience. Flere

$140 CAD in 7 dage
(22 bedømmelser)

Hi, I have worked with NLP for sentiment analysis. I used Pythonfor the development. I would like to work on your project. Let me know if you want to discuss further. Regards, Monir

$250 CAD in 14 dage
(9 bedømmelser)

Hi.I have checked your requirement and understand it well. I have many experience in **** python **** I am a full stack developer with enough experience and skills in Django & ReactJs & VueJs & ASP.NET & PHP & JAVA Flere

$140 CAD in 7 dage
(12 bedømmelser)

I am a professional data scientist from Scotland I have a vast amount of experience in data mining I am more than happy to go ahead and discuss your project with you please drop me a text here.

$277 CAD på 1 dag
(4 bedømmelser)

Hi, there! I saw your description carefully and I think it best fits on my skill set. I'm a Python expert, I have more experience in data processing. Scraping is my major skill and I can build your project using differ Flere

$150 CAD in 7 dage
(7 bedømmelser)

Hi Sir, Having Expertise in nature language processing, using python. also worked on different classification algorithm from machine learning and Deep learning. let's connect for further discussion. Thanks

$200 CAD in 2 dage
(1 bedømmelse)

i can do it in a couple of days, i would use cross-validation because it is the one that i normally use.

$100 CAD in 10 dage
(0 bedømmelser)

Certified in Java 1.2. I have been working with Java and JEE for 15 years. I have worked with several programming languages as: C, Python, Javascript, Visual Basic among others. I have experience doing compilers and in Flere

$250 CAD in 7 dage
(0 bedømmelser)

we have good team to do the project already we are doing python AUS projects on time delivery we can do python/r,data sciences

$140 CAD in 7 dage
(0 bedømmelser)

Hi! Your project is similar to the project done in chapters 9, 10, and 11 from "Data Science with Python and Dask". I already done that project, so I can work on your project with confidence. I have experience choosi Flere

$225 CAD in 7 dage
(0 bedømmelser)

I have gone through your job description carefully and I am very interested in your project. I am very professional in wordpress design, bugfix, PHP, javascript and I can manage your project perfectly. Thank you!

$155 CAD in 3 dage
(0 bedømmelser)

Hello, Checking your requirements we found ourselves fit to proceed with this project. I have some queries . It would be really great if we can get connected here to understand requirement and clarify everything in mor Flere

$250 CAD in 12 dage
(2 bedømmelser)