Playing with 80 Million Amazon Product Review Ratings Using Apache Spark | Max Woolf's Blog

Manipulating actually-big-data is just as easy as performing an analysis on a dataset with only a few records.