A wild dataset has appeared! Now what?

Where do we start when we stumble across a dataset we don’t know much about? Lets say one where we don’t necessarily understand the underlying generative process for some or all of the variables. Lets assume for now we’re sure there aren’t one off interventions or level shifts in the data, and we don’t know anything about […]

Examining Your Presence on Twitter with Python

My Evil The Following with absoluteBLACK’s direct mount oval ring. The purpose of this post is to show how a sponsorship/marketing manager might track their athletes or brand ambassadors. The code we’re writing below can be used for many other applications such as tracking general trends across locales or HR insidiously monitoring if employees are […]

Lending Club Data Analysis Revisited with Python

2.5 years ago I analyzed Lending Club’s issued loans data (yikes! I was using R back then!) . It was the most visited blog post on my site in 2013 through 2014. Today it’s still number 5. Reddit picked up my simple “35-hour work week with Python” post which is now #1: Lending Club is […]