You are here

hive

A JSON read/write SerDe for Hive

Today I finished coding another SerDe for Hive which, with my employer's permission, I published on github here: https://github.com/rcongiu/Hive-JSON-Serde.git.

Since the code is still fresh in my mind, I thought I'd write another article on how to write a SerDe, since the official documentation on how to do it it scarce and you'd have to read the hive code directly like I had to do.

Writing a SerDe in Hive for Lwes event files

I am currently working to set up an OLAP data warehouse using Hive on top of Hadoop. We have a considerable amount of data that comes from the ad servers on which we need to perform various kinds of analysis.

Writing a map-reduce job is not difficult in principle – it's just time consuming and requires the skills of a trained java engineer, which wouldn't be needed were we using SQL. That's where hive comes in: it allows us to query an hadoop data store using a flavor of SQL.

 

Subscribe to RSS - hive