ml-cvd-prediction

CS 7641 Final Project: ML for Cardiovascular Disease Prediction

Group 16: Suzan Manasreh, Natasha Mohanty, Kalp Vyas, Eric Chang, Chih-Chun Huang

Contents

Directory and File Structure

Top-level directory layout

.
├── csvs                                        
│   ├── Cardiovascular_Disease_Dataset.csv
│   ├── cleveland.csv
│   ├── cleveland_targets.csv
│   ├── full_dataset.csv
│   └── shuffled_data.csv
├── environment 
│   └── 7641_project_env.yml
├── public
│   ├── cm_nn_lda.png
│   ├── confusion_matrix.png
│   ├── confusionmatrixlda.png
│   ├── Confusionmatrixoriginal.png
│   ├── dbconfusionmatrix.png
│   ├── decisionboundary.png
│   ├── dt-3.png
|   ├── dt
|   ├── epsilon-elbow-estimation.png
|   ├── epsilon-estimation.png
│   ├── gmm_output.png
│   ├── gridresultlda.png
│   ├── gridresultoriginal.png
│   ├── hyperparam_nn.png
│   ├── kmeans-1.png
│   ├── kmeans-2.png
│   ├── kmeans-3.png
│   ├── lda-1.png
│   ├── nn-parameter.png
│   ├── output-withnoise.png
│   ├── output-withoutnoise.png
│   ├── rf-lda.png
│   ├── rf-nonlda.png
│   └── targetvs.png
├── src 
│   ├── cleveland.ipynb
│   ├── dbscan.ipynb
│   ├── featurereduction.ipynb
│   ├── kmeans.ipynb
│   ├── mendeley.ipynb
│   ├── supervised.ipynb
│   ├── supervisedfeaturereduction.ipynb
│   ├── Supervisedlearningtuning_nn.ipynb
│   ├── Supervisedlearningtuningsvm.ipynb
│   ├── unsupervised.ipynb                   
├── _config.yml
├── final_report.md
├── midterm_report.md
├── notes.md
├── proposal.md
└── README.md

/csvs/

This directory holds all the csv files used for the project.

/environment/

This directory holds the yaml files necessary for installing custom environments.

/public/

This directory holds all generated images that are referenced within the proposal and midterm report. The purpose of the images is to display the results from running multiple supervised and unsupervised algorithms.

/src/

This directory holds all the code responsible for different machine learning algorithms and generating analysis images stored within the /public/ directory. All algorithms use the /csvs/shuffled_data.csv file.