Kevin Fu: UC Berkeley, MS Data Science - All Projects

W210 Capstone: StockpickAI

One of the first ML-based stock prediction models geared towards the Retail Investor. Monthly recommendation of 5 stocks that are most likely to outperform the S&P 500 over the next calendar year.
Hal Varian Award Showcase Finalist
Results/Model: 55-60% Accuracy: XGBoost / +19.9% annualized vs. +8.9% S&P 500 (2004-2022)
Tools: Python, SQL / Amazon AWS (EC2, S3) / XGBoost, Random Forest, K Means Clustering

Predict flight delays to decrease explicit (financial) and implicit (time) costs for both airlines and consumers alike
Results/Model: 84% Recall: Random Forest (% of delayed flights that are correctly classified as delayed)
Tools: Python / Databricks / Random Forest, Gradient Boosted Trees, Logistic Regression

Accurately classify slang-heavy social media posts (using NLP techniques) in order to attract higher ad spending
Classification of deslanged social media posts likely to outperform those with slang/acronyms
Results/Model: 86% F1 Score: BERT
Tools: Python / GCP / Deep Learning (Transformers: BERT & T5, Recurrent Neural Networks), Naive Bayes

Created end-to-end computer vision solution that classifies 20 animal types with unstructured data
Combination of low amount of training data and compute resources yielded a low scoring model
Results/Model: 32% F1 Score: Support Vector Machines (SVM)
Tools: Python / Deep Learning (Convolutional Neural Networks), SVM, PCA, T-SNE, Sobel, Harris Corners

Determine if race or hospital preparedness was more important with respect to COVID mortality rates to help combat “fake news”.
Results: State hospital preparedness held higher significance
Tools: R / Linear & Logistic Regression

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
Other MIDS Projects		Other MIDS Projects
W203 Race vs. Hospital Preparedness on COVID Mortality Rates		W203 Race vs. Hospital Preparedness on COVID Mortality Rates
W210 StockPickAI		W210 StockPickAI
W261 Flight Delay Prediction		W261 Flight Delay Prediction
W266 Subreddit Classification		W266 Subreddit Classification
W281 Image Classification		W281 Image Classification
README.md		README.md