Lecture

PAX: Hybrid Solution for Efficient Data Storage

Description

This lecture introduces PAX, a hybrid solution for efficient data storage, decomposing slotted-pages into mini-pages per attribute to improve cache-friendliness and reduce I/O delays. It discusses how PAX can replace NSM in-place, its adoption by major database systems, and its benefits for analytical queries. The lecture also covers Parquet, a columnar storage format for Hadoop, which enables efficient data processing by storing only relevant data and supporting nested data structures.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.