Austin Animal Center Data Explorer

2022-05-15 89 words 2 minutes

Contents

Increasing the shelter's adoption rate with data insights and machine learning

Language: Python

🎉 The App won 3rd place in the Plotly Summer App Challenge 🎉

1 Disclaimer

The Austin Animal Center (AAC) Data Explorer is a web app developed for the Plotly Summer Community App Challenge. The challenge has not been initiated by the AAC and the resulting app is not a product of the AAC. It is Open Source and is not intended to be commercialized.

2 Data Sourcing

The data used to build the app come from Kaggle. The data relative to the shelter intakes were directly sourced from the AAC API.

2 Data Explorer

The *Explore Data page* displays key data and figures about the shelter’s intakes and outcomes. Thanks to [Plotly](https://plotly.com/), every graph is highly interactive and allows the following features:

zoom in and out;

data point hover;

legend selection;

The map on the right panel of the page was built using [mapbox](https://www.mapbox.com/) and also allows a maximum interactivity.

2 Predictions

The Predict page allows the app user to enter a series of parameters about a new animal intake. The “predict” button will then plug-in those parameters into a trained machine-learning model to determine whether the animal will be adopted or not (see model specifications below).

The model used to determine whether an animal will be adopted or not is a Gradient Boosting Classifier - an ensemble classification model. The best hyper-parameters were determined using a RandomizedSearchCV:

learning_rate=0.1

max_depth=20

min_samples_leaf=1

min_samples_split=2

n_estimators=100

tol=0.0001

subsample=0.5

validation_fraction=0.1

The best model has an overall accuracy of 86% (i.e. the model predicts the right outcome 86% of the time). Below is the “confusion matrix” displaying the model’s overall performance, where the label 0 is “will not be adopted” and the label 1 is “will be adopted”:

The worst errors occur when an animal is falsely predicted to be adopted while in reality it will most likely not be adopted. This type of error is reflected in the upper right corner and represents 7.8% of all predictions.