Browsing All posts tagged under »hive«

A JSON read/write SerDe for Hive

July 11, 2011 by


Today I finished coding another SerDe for Hive which, with my employer’s permission, I published on github here: Since the code is still fresh in my mind, I thought I’d write another article on how to write a SerDe, since the official documentation on how to do it it scarce and you’d have to […]

Writing a Hive SerDe for LWES event files

October 27, 2009 by


I am currently working to set up an OLAP data warehouse using Hive on top of Hadoop. We have a considerable amount of data that comes from the ad servers on which we need to perform various kinds of analysis. Writing a map-reduce job is not difficult in principle – it’s just time consuming and […]