Google
Current tag: gzip
Comparison of decompress ways in Hadoop
Written on:May 7, 2009
0

I have recently made some programming using Hadoop, which is a framework for massive data processing over many numbers of servers. Hadoop reads input data from (large) files and performs MapReduce data reductions. One method of reading is input is decompressing GZIP:ed files. Java has built-in support for reading gzipped stream using GZIPInputStream. However, Hadoop ships with its own implementation that uses native libraries for efficiency. I was curious about…
Read more...

