Group: Apache ORC
Sort: popular | newest
A shim layer for supporting various versions of Hadoop dynamically. This module uses a higher version of Hadoop so that we can create shims that let us use new features of Hadoop without having a hard dependency on the latest version.
Last Release on Nov 25, 2019
ORC is a self-describing type-aware columnar file format designed for Hadoop workloads. It is optimized for large streaming reads, but with integrated support for finding required rows quickly. Storing data in a columnar format lets the reader read, decompress, and process only the values that are required for the current query.
Last Release on Nov 26, 2019