Population & Sample Variance

A practical example of why we divide by (n-1) when calculating the sample variance

Detecting and Fixing Outliers in Data

Using a Selective Median Filter to clean erroneous data

Remittance and the exchange rate

Remittance is surprisingly not dependent on the prevailing exchange rate

Base Rate Fallacy

Interpreting commonly quoted statistical results through understanding of base and false discovery rates.

Fertility & Infant Mortality

An analysis to determine the relationship between fertility and infant mortality rates

Are Movies Getting Better or Worse?

A fun little data analysis exercise using IMDB movie rating database

Demystifying the Data Scientist

Is the time ripe for a full stack data scientist?

Introduction to R

Learn the basics of using R to perform simple data analysis

Monte Carlo simulation in Scala

Calculating the value of Pi using Monte Carlo simulation

Curve smoothing in R

Common techniques for smoothing curves and how to do that in R