This project is based on the data-driven approach of the Oakland Athletics baseball team, as featured in the book "Moneyball" by Michael Lewis. The analysis aims to identify cost-effective player replacements using sabermetrics, focusing on batting average, on-base percentage, and slugging percentage. The goal is to help the Athletics maintain a competitive edge with limited resources by finding undervalued players who can match or exceed the contributions of key players lost in the offseason.
- R: For data manipulation, statistical analysis, and visualization
- Data Visualization: ggplot2 and other R libraries for clear insights
Data/
: ContainsBatting.csv
andSalaries.csv
, with player performance and salary information.Code/
: IncludesMoneyball.R
with all the R code for data cleaning, analysis, and visualization.
- Data-Driven Strategy: Emphasis on sabermetrics allows for objective player evaluation, focusing on undervalued statistics like on-base percentage and slugging percentage.
- Budget Efficiency: Highlights players who can replace key contributors (e.g., Jason Giambi, Johnny Damon) within the constraints of a small-market team budget.
- Long-Term Impact: The analysis provides insights into maintaining competitiveness through efficient resource allocation, which has implications for sports management beyond baseball.
- Clone this repository.
- Run the R script (
Moneyball.R
) in RStudio or any compatible R environment. - Ensure necessary R packages (like
dplyr
,ggplot2
) are installed. - Review the analysis results, visualizations, and player recommendations.
Connect with me on LinkedIn to discuss this project or view my other work in data analysis and sports analytics.