Data Science Project

Job Description:


Irrespective of whether or not data and images are stored/analyzed in a centralized

manner, variability in scanner models, acquisition protocols and reconstruction

settings are unavoidable in the current clinical practice. Yet radiomics are notoriously

sensitive to such protocol variations. Hence, there is a clear need for the harmonization

of features in order to allow consistent findings in radiomics multicenter studies.


The objective of this project is to develop different models to predict failure (endpoint)

of the radiomics signature based from MRI, PET and CT scans.


[login to view URL] contains 197 rows and 498 columns:

[login to view URL]: binary property to predict

You can split the dataset as you want to create the training/validation/test datasets


You have to deliver three different models:


§ Create an ensemble classification model (atleast 3 models of your choice).

§ Preprocess the data

o Check for null and missing values

o Check for normality, if not, normalized the data

o Get the correlation of the whole data expect the categorical variables

§ Split the data into training (80%) and testing (20%)

§ Print the AUC values during Training

§ Print the Top 20 important features during Training

§ Print the AUC values during Testing


§ Create a neural network-based classification model.

§ Create five hidden layers with 256, 128, 128, 64 and 64 neurons, respectively

with activation functions of Sigmoid

§ Create an output layer with ten neurons respectively with activation functions

of Softmax.

§ Every layer is followed by a dropout to avoid overfitting.

§ Copy the slide 15 backpropagation compiler approach.

§ Copy the slide 33 model compiler approach.

§ Train the model with epoch = 10, batch size = 128 and validation split = 0.15

(reference slide 33).

§ Evaluate the trained model using the testing dataset.

§ Get the model prediction using the testing dataset.


§ Without considering the binary output and categorical variables in the dataset,

compare the following clustering technique results:

o K-Means

o Hierarchical

o Model Based


To deliver:

§ 3 (Model1, Model2, and Model13) R Markdown

§ 3 (Model1, Model2, and Model13) PDF Files from an R Markdown Outputs

§ These should be pushed into your final github repository.

§ Name the repo as INFS692

Readme file

This md file must have the documentation about your application, models, packaging,

setup and dockerization


Use git to version your code and push it to any public repository and send/upload the

link in the My Drive before December 10, 2022 at 11:59PM.


You will be evaluated on:

§ Quality and structure of your codes

§ Models architecture

§ Git commits quality

§ Data preparation and preprocessing

§ Documentation

§ The quality of your answers

Færdigheder: Programmeringssproget R, Datavidenskab, Statistikker, Statistisk analyse, Statistical Modeling

Om klienten:
( 0 bedømmelser ) Zamboanga, Philippines

Projekt ID: #35266684

20 freelancere byder i gennemsnit $485 timen for dette job


Hi I am a very experienced statistician, data scientist and academic writer. I have completed several PhD level thesis projects involving advanced statistical analysis of data. I have worked with data from several comp Flere

$1700 USD in 25 dage
(153 bedømmelser)

DATA ANALYST Hello there, I am best in statistics, R programming analysis of data, SPSS,SAS Statistical/Data Analysis, Multivariate Statistical Analysis, Regression Analysis, STATA, MINITAB, R language, Factor Analysi Flere

$250 USD in 7 dage
(126 bedømmelser)

Hi, Statistics is my favorite subject and will be glad to help. I have skills in Data Processing, Statistics, R Programming Language Statistical Analysis

$250 USD in 5 dage
(71 bedømmelser)

Hello sir I m a skilled Data scientist with more than 9 years of professional experience in r programming language and python . I can start now Regards.

$395 USD in 3 dage
(25 bedømmelser)

Do kindly reach out to me over chat, i am a python ml specialist with over 8 years experience in industry. Would do my best to serve the requirements

$500 USD in 10 dage
(39 bedømmelser)

I am an expert statistician and data analyst with more than five years of experience. I have full command of Excel analysis, SPSS, STATA, R LANGUAGE, AND PYTHON. I am an expert in logistic regression analysis, deep lea Flere

$300 USD in 2 dage
(24 bedømmelser)

Hi, How are you? Very happy to bid your project because my skills are fitted in your project. I have 8 years experience in Python, R programming and data science. I am very familiar with classification. I will do my be Flere

$250 USD in 3 dage
(6 bedømmelser)

Data science expert. I can do it. As 9+ years experiences in these field. I can give good quality work. I have read the guidelines of your work.I believe that i can provide you the best quality works you are anticipati Flere

$700 USD in 7 dage
(20 bedømmelser)

Dear client! I am interested in your project Data Science Project I have completed similar papers in the past and can assure you of exceptional and original work within the agreed deadline. I have skills in R Program Flere

$250 USD in 3 dage
(10 bedømmelser)

STATISTICS, DATA SCIENCE, R EXPERT HERE!!! "Satisfy the client with my ability and passion" This is my slogan here. I hope you will be interested in me. Thanks.

$500 USD in 3 dage
(1 bedømmelse)

Professional data science area expert can help you >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>

$345 USD in 7 dage
(4 bedømmelser)

Hi there, I have read your project details. I can analyse your data with above-mentioned requirements.I can do this lower fares. Message me for further discussions. So I can start working on your project as soon as pos Flere

$250 USD in 7 dage
(2 bedømmelser)

Hi there, I have gone through your project details and would like to tell you that l have a great bunch of experience in SPSS Statistics, Data Mining, Statistical Analysis, Statistics and Python. For that I would requi Flere

$500 USD in 7 dage
(0 bedømmelser)

Hi there, Hope you are doing great. Firstly about the Data Science Project that i have a great experience in it. I am very confident to pull it off once awarded. I am a Full stack developer with a team of experienced d Flere

$700 USD in 20 dage
(0 bedømmelser)

Hi, We would like to grab this opportunity and will work till you get 100% satisfied with our work. We are an expert team which have many years of experience on Statistics, R Programming Language, Statistical Analysi Flere

$500 USD in 7 dage
(0 bedømmelser)

Hello I can do this. Please share the details of the task so that I can check and confirm accordingly.

$500 USD in 7 dage
(2 bedømmelser)

Expert HERE!!! Thanks for you job post! I just done a project such as yours so I can satisfy you perfectly with my skills and experiences. If you hire me, then you will never regret and I will do my best. I hope see yo Flere

$500 USD in 7 dage
(0 bedømmelser)

Hi, I am an economics graduate with a deep understanding of statistics principles and methodology. Along with this I am well-versed in Machine Learning algorithms which makes me an ideal partner for your project. Furt Flere

$500 USD in 7 dage
(0 bedømmelser)