Geolocation Using R and ggmaps

Given a vector of addresses how do you map their longitude and latitude using R? I regularly work with spatial data. Most of the spatial data needs to be converted from addresses to longitude/latitude before it can be used for analysis. Using the library “ggmap” and function “geocode” we can make a call to Google’s API which returns […]

Two key color functions in R

R has a bunch of nice built-in color systems including things like topo.colors(), heat.colors() and rainbow() which are easy to use and allow you to control the transparency (alpha). The built-in color systems are generally good for rapid prototyping but I like to customize my plots by including alternate color palettes. I have found that […]

Animations Using R

R does not have an easy way to generate high quality short animations (videos in mp4 or avi formats). In this post I show you how to make a short  (30 seconds)high quality .mp4 animated video of a rotating volcano that you can share with your friends in 6 easy steps using R. To start you’ll […]

Airline Sentiment Analysis Part II Plots

In the previous post we used Twitter to mine peoples sentiments about different and tested whether or not the responses were well founded or random using the binomial test. In this post we continue with our analysis displaying our results through waffle graphs, word matrices and wordclouds. The wordclouds were generated with the data from […]

Airline Sentiment Analysis!

In this post we draw on Twitter data through R to gauge how the public feels about airlines (airline sentiment analysis) in less than 100 lines of code (including commentary)! What’s Twitter? Twitter is a social media platform that allows users to post a set of 140 characters (Tweets) to express their feelings, thoughts and […]

My Brother’s Keeper Data Jam

What is My Brother’s Keeper / Data Jam? In August I attended My Brother’s Keeper “data jam”. MBK is an an initiative started by president Obama: to address persistent opportunity gaps faced by boys and young men of color and ensure that all young people can reach their full potential. The motivation behind the data […]

Data Science: 5 Principles

In this blog we explore 5 key principles of good data science.  While big data is a word that has been thrown around a lot lately it is data science that adds value to big data. In particular data science can be thought of as an increasing important field that is is 1/3 statistics skills, 1/3 […]

3 Limitations of Big Data

In years past data acquisition was difficult. With the advent of cheap storage technologies, high-speed internet and proliferation of digital geo-encoded devices we are now literally drowning in data. While experts such as Hao Wang, vice president of information services and chief information of the Research Foundation for SUNY, tout Bid Data as the being able […]