Earthquake Rescue Project

In 2010 there was a devastating earthquake in the Caribbean. As part of the rescue effort, a team of data scientists analyzed pixel data of satellite photos for evidence of blue tarps (indicating people in need of assistance).

For a class project I used this problem with the original data to train a variety of classifiers and evaluate performance at different thresholds, utilizing R and R markdown. This project included:

  • Extensive EDA

  • 10 fold cross validation with the same data split for all models

  • Heavily skewed data with around a 3% true positive rate

  • ROC and Precision Recall curves to determine ideal thresholds

  • Messy, real-world holdout data used for evaluation

  • Analytical conclusions based on results of the data

Models considered in the evaluation:

  • Logistic regression

  • LDA

  • QDA

  • KNN

  • Elastic Net (ridge/lasso)

  • Random Forest

  • SVM

The clear front runners were logistic regression, LDA, and QDA.

The button below takes you to the full report. To view the full code don’t hesitate to reach out to jordan.hiatt11@gmail.com.

Previous
Previous

PySpark with Reddit Data

Next
Next

Visualizations with D3.js - US Map with County Data