Titanic - Machine Learning from Disaster

Introduction

This is my first Machine Learning Project. This repository contains my work for the Kaggle Titanic survival prediction competition. The challenge is to use machine learning to create a model that predicts which passengers survived the Titanic shipwreck based on a range of features.

Portfolio Website with my BEST Deployed Models in the cloud

I also made a custom interactive website for this project, which describes the project, my techniques, and most importantly, I deployed my best models and allowed users to pass in values which are given to the model in the cloud and gives back predictions of survival to the user. Try it out at https://michael-ye-titanic-home.streamlit.app/

Project Description

The project employs a variety of machine learning models, including Random Forests, Gradient Boosted Trees, and Neural Networks, to predict survival. Techniques for data cleaning, feature engineering, and model tuning are thoroughly documented in the Jupyter notebooks.

File Descriptions

titanic-neural-network (1).ipynb: FULL MAIN jupyter notebook (including stuff from the CLEAN_titanic-RandomForest file) from start to finish for the Neural Network
CLEAN_titanic-neural-network.ipynb: CLEAN means that I removed all unncessary data preprocessing stuff and only included the code and markdown cells which I ended up using in my final data preprocessing. CLEAN also includes the most recent neural network models
titanic-random-forest-classifier.ipynb: FULL Jupyter notebook for the Random/Decision Forest Models (including all the stuff from the CLEAN_titanic-RandomForest)
CLEAN_titanic-RandomForest.ipynb: CLEAN means that I removed all unncessary data preprocessing stuff and only included the code and markdown cells which I ended up using in my final data preprocessing. CLEAN also includes the most recent random/decision forest models
CLEAN_titanic-GradientBoostedTreesModel.ipynb: Jupyter notebook for the Gradient Boosted Trees model.
model_plot.html: Visualization of model performance.
model1_RF.png: Image depicting the Random Forest model structure.
model3_GB_hyperparameters.json: Hyperparameters used for the Gradient Boosted model.
model3_GB_summary.txt: Summary of the Gradient Boosted model's performance.
model3_RF_hyperparameters.json: Hyperparameters used for the Random Forest model.

Installation

To set up this project, you will need Python 3.x and the following libraries: Pandas, NumPy, SciKit-Learn, Matplotlib, Seaborn, and TensorFlow. Installation can be done via pip:

pip install pandas numpy scikit-learn matplotlib seaborn tensorflow

Usage

Each model can be run by navigating to the respective Jupyter notebook and executing the cells in order. Ensure that you have Jupyter installed and run:

jupyter notebook

Contributing

Contributions are welcome. Please open an issue first to discuss what you would like to change or add.

Credits

Special thanks to Kaggle for hosting the dataset and challenge.

License

This project is open source and available under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
TFDF_Model3		TFDF_Model3
decision-forests-1.5.0		decision-forests-1.5.0
directory_for_model_10/keras_tuner_model10		directory_for_model_10/keras_tuner_model10
directory_for_model_11/keras_tuner_model11		directory_for_model_11/keras_tuner_model11
directory_for_model_12_new		directory_for_model_12_new
directory_for_model_13/keras_tuner_model13		directory_for_model_13/keras_tuner_model13
directory_for_model_14/keras_tuner_model14		directory_for_model_14/keras_tuner_model14
directory_for_model_8/keras_tuner_model8		directory_for_model_8/keras_tuner_model8
directory_for_model_9/keras_tuner_model9		directory_for_model_9/keras_tuner_model9
model3_GB		model3_GB
model3_RF		model3_RF
my_dir/keras_tuner_demo		my_dir/keras_tuner_demo
yggdrasil_decision_forests		yggdrasil_decision_forests
.gitignore		.gitignore
CLEAN_titanic-GradientBoostedTreesModel.ipynb		CLEAN_titanic-GradientBoostedTreesModel.ipynb
CLEAN_titanic-RandomForest.ipynb		CLEAN_titanic-RandomForest.ipynb
CLEAN_titanic-neural-network.ipynb		CLEAN_titanic-neural-network.ipynb
KAGGLE CLEAN_titanic-neural-network copy.ipynb		KAGGLE CLEAN_titanic-neural-network copy.ipynb
KAGGLE_CLEAN_titanic-RandomForest.ipynb		KAGGLE_CLEAN_titanic-RandomForest.ipynb
MODEL12_NeuralNetwork_notebook.ipynb		MODEL12_NeuralNetwork_notebook.ipynb
README.md		README.md
Titanic Score 80.143.png		Titanic Score 80.143.png
X_train_imputed.csv		X_train_imputed.csv
X_train_imputed_without_missing_Embarked.csv		X_train_imputed_without_missing_Embarked.csv
X_train_imputed_without_missing_Embarked2.csv		X_train_imputed_without_missing_Embarked2.csv
age.py		age.py
choose_features.py		choose_features.py
directory_for_model_12_newKaggle.zip		directory_for_model_12_newKaggle.zip
gender_submission.csv		gender_submission.csv
heatmap.png		heatmap.png
heatmap2.pdf		heatmap2.pdf
heatmap2.png		heatmap2.png
heatmap3.png		heatmap3.png
high score of 80.622.jpg		high score of 80.622.jpg
model.png		model.png
model10.png		model10.png
model11.png		model11.png
model12.png		model12.png
model13.png		model13.png
model14.png		model14.png
model1_RF.png		model1_RF.png
model3_GB_hyperparameters.json		model3_GB_hyperparameters.json
model3_GB_summary.txt		model3_GB_summary.txt
model3_RF_hyperparameters.json		model3_RF_hyperparameters.json
model3_RF_summary.txt		model3_RF_summary.txt
model9.png		model9.png
model_12_saved.h5		model_12_saved.h5
model_14_saved.h5		model_14_saved.h5
model_plot.html		model_plot.html
scaler2.pkl		scaler2.pkl
script.py		script.py
submission.csv		submission.csv
test.csv		test.csv
test_data_imputed.csv		test_data_imputed.csv
testingthis.py		testingthis.py
titanic-neural-network (1).ipynb		titanic-neural-network (1).ipynb
titanic-random-forest-classifier.ipynb		titanic-random-forest-classifier.ipynb
titanic.zip		titanic.zip
train.csv		train.csv
tuner.zip		tuner.zip
tuner2.zip		tuner2.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Titanic - Machine Learning from Disaster

Introduction

Portfolio Website with my BEST Deployed Models in the cloud

Project Description

File Descriptions

Installation

Usage

Contributing

Credits

License

About

Releases

Packages

Languages

23yem/Final-Titanic-Machine-Learning

Folders and files

Latest commit

History

Repository files navigation

Titanic - Machine Learning from Disaster

Introduction

Portfolio Website with my BEST Deployed Models in the cloud

Project Description

File Descriptions

Installation

Usage

Contributing

Credits

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages