Friday, January 24, 2014

Parquet: efficient column store on Hadoop

I got this from my collaborator Joey Gonzalez. Cloudera is backing up Apache Parquet, an efficient column store on top of Hadoop. Which is an open source version of Google Dremel.

If you like to hear more about Cloudera's vision about new trends data science, you should attend our 3rd GraphLab Conference to hear Josh Wills, director of data science @ Cloudera.

No comments:

Post a Comment