Storage

Apache Hadoop HDFS is a distributed file system for storing large volumes of data. Apache Kudu completes Apache Hadoop’s storage layer, enabling fast analytics on fast data.

HDFS Overview
Provides an overview of Apache Hadoop HDFS, its benefits, and the key components.