Apache Durid is a real-time database to power modern analytics application.
The Apache Knox™ Gateway is an Application Gateway for interacting with the REST APIs and UIs of Apache Hadoop deployments.
Apache Calcite is a dynamic data management framework.
Llama is a Yarn Application Master that mediates the management and monitoring of cluster resources between Impala and Yarn.
Apache Kudu is an open source distributed data storage engine that makes fast analytics on fast and changing data easy.
Kite is a set of libraries, tools, examples, and documentation focused on making it easier to build systems on top of the Hadoop ecosystem.
Apache Impala is the open source, native analytic database for Apache Hadoop.
Apache Avro™ is a data serialization system.
Community governance is listed in the repository.
Apache DataFu is a collection of libraries for working with large-scale data in Hadoop.
Apache Giraph is an iterative graph processing system built for high scalability.
Quantcast File System (QFS) is a high-performance, fault-tolerant, distributed file system developed to support MapReduce processing, or other applications reading and writing large files sequentially.
The goal of the Apache Mahout™ project is to build an environment for quickly creating scalable, performant machine learning applications.
Apache Ignite In-Memory Computing, Database and Caching Platform
The Apache Hive (TM) data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL.
A software platform for processing vast amounts of data