Archive

Everything I've written.

56 articles on data engineering, distributed systems, and the tools around them.

2019

Should you use Parquet? Hadoop Dec 20, 2019 Beginners Guide to Columnar File Formats in Spark and Hadoop Hadoop Nov 21, 2019

2017

A Quick Guide to Concurrency in Scala Scala Mar 28, 2017 4 Fun and Useful Things to Know about Scala's apply() functions Scala Mar 6, 2017 10+ Great Books and Resources for Learning and Perfecting Scala Scala Feb 14, 2017 10+ Great Books for Apache Spark Spark Jan 13, 2017

2016

An Introduction to Hadoop and Spark Storage Formats (or File Formats) Hadoop Sep 1, 2016 Is it 'MapReduce' or 'Map Reduce'? Hadoop Jun 29, 2016 Type-Safe Scalding MapReduce Tutorial - Joining and Summarizing Data Hadoop Mar 2, 2016 Hadoop MapReduce Advanced Python Join Tutorial with Example Code Hadoop Feb 9, 2016 5 Industry Veterans Pick Their Favorite MapReduce Frameworks Hadoop Jan 5, 2016

2015

Apache Spark Java Tutorial [Code Walkthrough With Examples] Spark Dec 28, 2015 Apache Spark Scala Tutorial [Code Walkthrough With Examples] Spark Dec 14, 2015 Apache Hive vs MySQL - What are the key differences? Hadoop Dec 8, 2015 Scalding Hadoop MapReduce Tutorial [Code Walkthrough With Examples] Hadoop Oct 20, 2015 Apache Hive Customization Tutorial Series Hadoop Jul 27, 2015 Cascading MapReduce Hadoop Tutorial with Examples Hadoop Jun 25, 2015

2014

8 SQL-on-Hadoop frameworks worth checking out Hadoop Jun 8, 2014 Data Warehousing and Hadoop 2014 Survey Personal Jan 21, 2014

2013

Reading data from HDFS programatically using java (and scala) Hadoop Dec 28, 2013 Hadoop Python MapReduce Tutorial for Beginners Hadoop Nov 17, 2013 Hadoop MapReduce Scoobi Tutorial with Examples Hadoop Nov 3, 2013 Hadoop Hive UDF Tutorial - Extending Hive with Custom Functions Hadoop Aug 10, 2013 15+ Great Books for Hadoop Hadoop May 31, 2013 A Beginners Guide to Hadoop Hadoop Apr 17, 2013 Real World Hadoop - Implementing a Left Outer Join in Pig Hadoop Apr 7, 2013 Simple Check-in now has more features Personal Feb 27, 2013 Real World Hadoop - Implementing a Left Outer Join in Hive Hadoop Feb 20, 2013 An Excellent Hadoop Resouce - The Hadoop Weekly Newsletter Hadoop Feb 14, 2013 Real World Hadoop Implementing A Left Outer Join In Hadoop Map Reduce Unclassified Feb 9, 2013 Auto indenting text in Sublime Text 2 Programming Jan 26, 2013 Hadoop MapReduce Framework Tutorials with Examples Hadoop Jan 5, 2013

2012

Do you spend hours formatting word documents? Office Dec 7, 2012 How to Format References in APA Style using Microsoft Word 2011 for Mac OSX Office Nov 9, 2012 How to do 'find and replace' within currently selected text in Microsoft Word 2011 for Mac OSX Office Nov 9, 2012 Adding scaladocs to a Scala function in Sublime Text 2 Programming Oct 31, 2012 Game Programming in Scala with Libgdx and Box2D Programming Oct 22, 2012 Simple Check-in 'Unknown Error' bug is fixed! Android Apr 28, 2012 One touch check-ins for foursquare Android Apr 14, 2012 Scrolling past the end of a file in Sublime Text 2 Programming Mar 18, 2012 Tips for using CDH's Hadoop Distribution with Amazon's S3 Hadoop Mar 9, 2012

2011

How to have page numbers start on page 3 using MS Word 2011 for Mac Office Dec 16, 2011 GeoPop - My First Android Game Android Nov 26, 2011 [link] Beginning Android Game Development Book Android Oct 27, 2011 Adding JARS to hive without using ADD JAR Hadoop Oct 20, 2011 The start of an online puzzle community? Personal Oct 4, 2011 Some fun Hadoop and Hive Bugs Hadoop Sep 20, 2011 A Warning About Apache Thrift's Asynchronous Clients Programming Apr 19, 2011 It has been a good week Personal Feb 28, 2011 Kindle store prices Personal Feb 25, 2011

2010

NullPointerException when performing a join with Hadoop Hive 0.50 Hadoop Dec 29, 2010 Remember stuff easier Personal Nov 7, 2010 Mini Squadron - my favorite android game of the moment Android Oct 9, 2010 Ruby JSON Serialization sucks for Time objects Programming Sep 28, 2010

2009

Operation Algorithm: Quicksort in C and Scheme Programming Nov 10, 2009

2008

Be Careful of Data Obtained From Bloomberg using RTD or BDP Functions Office Dec 3, 2008