Analytics & Strategy

Portfolio

Concise project decks with clear takeaways—preview instantly or download a polished PDF to dive deeper.

Technical Projects

R Reports
Predictive Analytics

Predicting Flight Delays — Logistic Regression on 1M+ U.S. Flights

Cleaned and feature-engineered a dataset of 1M+ U.S. flights (Jan–Feb 2024), conducted EDA across airports and days of the week, then trained a logistic regression model to predict delay probability. Found origin airport is the dominant predictor — JFK, LGA, and EWR exceeded 80% delay rates — while distance and departure time had minimal influence. Model achieved 65.2% accuracy on a held-out test set.

  • Skills: Logistic regression, EDA, train-test split, confusion matrix
  • Tools: R (tidyverse, ggplot2, caret, lubridate)
Download PDF
Reports Tableau
Wine Analytics

Gallo Winery — Preliminary Wine Analysis

Analyzed 13,050 wine records to understand how type and geography affect price and quality. Sparkling wines performed best, Rosé worst. Used t-tests to validate regional differences and guide sourcing decisions.

  • Skills: Descriptive analytics, two-sample t-tests
  • Tools: Tableau storytelling, Python/R/SQL stack
Download PDF
Excel Reports Tableau
Pricing Strategy

Gallo Winery — Wine Pricing Model & Tableau Dashboard

Built regression models linking price to rating, vintage, and popularity. Found sparkling wines command premium prices. Created interactive Tableau dashboard to support pricing strategy.

  • Skills: Correlation, regression diagnostics
  • Tools: Tableau filters, Python/R/SQL
Download PDF
Excel Reports
Customer Analytics

Credit Card Behavior — Taiwan (≈25k customers)

Tested gender and seasonal payment patterns. Men carried $125 higher balances on average. September payments exceeded April by $15. Results shaped marketing and retention strategies.

  • Skills: Two-sample & paired t-tests
  • Tools: Confidence intervals, experiment-to-action translation
Download PDF
Excel Reports
Retail Pricing

Walmart vs. Amazon/Macy's/Overstock — Women's Shoes

Compared 26k shoe prices across retailers. ANOVA confirmed Walmart consistently undercuts competitors. Recommended targeted price adjustments and aggressive price-match messaging.

  • Skills: ANOVA, pairwise confidence intervals
  • Tools: Large-scale data cleaning & merchant segmentation
Download PDF
Excel Reports
Financial Modeling

EPS Drivers — NYSE (1,501 companies)

Modeled earnings per share using EBT, operating margin, and pre-tax margin. EBT proved the strongest predictor. Identified unexpected negative margin relationships worth investigating.

  • Skills: Multiple regression, significance testing
  • Tools: Model diagnostics, financial storytelling
Download PDF
Excel Reports
Lending Strategy

Debt Predictors — SoFi (≈2,000 clients)

Predicted client debt levels using demographics and financial behavior. Identified key risk factors and multicollinearity issues. Enabled better targeting for consolidation offers.

  • Skills: Multiple regression, diagnostics
  • Tools: Residual analysis, predictive screening
Download PDF
R Reports
Data Wrangling

U.S. College Tuition — State & Regional Trends

Tidied and reshaped 50-state average tuition data (2004–2016) into long format, joined with U.S. Census regions, and produced five base visualizations revealing the Northeast's consistent cost premium over other regions.

  • Skills: Data tidying, pivot_longer, table joins
  • Tools: R (tidyverse, ggplot2)
Download PDF
R Reports
Data Visualization

Tuition Visualization — Storytelling & Annotation

Refined five tuition visualizations with titles, axis labels, reference lines, direct data labels, and on-plot annotations to transform raw charts into publication-ready figures that communicate regional cost trends clearly.

  • Skills: Visual storytelling, annotation, ggplot2 theming
  • Tools: R (ggplot2, tidyverse)
Download PDF
R Reports
Sports Analytics

Olympic Medal Trends — Regional Analysis (1990–2016)

Joined 271k Olympic athlete records with NOC region data to analyze medal counts across regions. Built multi-series line plots and faceted medal-type breakdowns for the top 6 regions over 26 years.

  • Skills: Table joins, group summaries, faceted visualization
  • Tools: R (tidyverse, ggplot2)
Download PDF
R Reports
Geospatial Analytics

State Milk Production & Population — Correlation Study

Analyzed U.S. milk production (1970–2017) using regional scatter plots and a choropleth map by state. Measured a moderate positive correlation (r = 0.63) between state population and milk output, with noted data limitations.

  • Skills: Choropleth mapping, Pearson correlation, trend analysis
  • Tools: R (usmap, ggplot2, tidyverse)
Download PDF
Excel Reports
Salary Analytics

Nonlinear Salary Growth — University Graduate Study

Tested four nonlinear regression models (Quadratic, Lin-Log, Log-Lin, Log-Log) across 140 state universities to predict mid-career salary from starting salary. The Log-Log model outperformed all others with an adjusted R² of 0.7649 and the lowest standard error.

  • Skills: Nonlinear regression, model comparison, adjusted R²
  • Tools: Excel regression analysis, statistical reporting
Download PDF
Excel Reports
Credit Analytics

Credit Limits & Default Risk — Demographic Regression Study

Built two regression models on ~24,000 credit card customers to examine how education and demographics influence credit limits and default probability. Found repayment behavior and age as key risk signals, and recommended collecting additional financial variables for stronger predictive power.

  • Skills: Multiple regression, predictive inference, model limitations
  • Tools: Excel regression diagnostics, statistical reporting
Download PDF