Apache Impala Reference
Performance Considerations
Performance Best Practices
Query Join Performance
Table and Column Statistics
Generating Table and Column Statistics
Runtime Filtering
Min/Max Filtering
Bloom Filtering
Late Materialization of Columns
Partitioning
Partition Pruning for Queries
Understanding Performance using EXPLAIN Plan
Understanding Performance using SUMMARY Report
Understanding Performance using Query Profile
Scalability Considerations
Scaling Limits and Guidelines
Hadoop File Formats Support
Using Text Data Files
Using Parquet Data Files
Using ORC Data Files
Using Avro Data Files
Using RCFile Data Files
Using SequenceFile Data Files
Ports Used by Impala
Transactions
Configure Impala Daemon to spill to HDFS