Data on the Edge: Handling Outliers

Data on the Edge: Handling Outliers

Before we tackle how to handle them, let’s quickly define what an outlier is.  An outlier is any data point that is distinctly different from the rest of your data points. When you’re looking at a variable that is relatively normally distributed, you can think of...
Brushing Up on R-Squared

Brushing Up on R-Squared

When was the last time you took a course in statistics? For many of us, it’s been a few years… at least. When talking to customers about some of the statistical concepts that factor into predictive models, I’ve found that while many topics are “kind of familiar”, most...