Mar 11 – 12, 2024
Universitätshauptgebäude
Europe/Berlin timezone

Bridging the gap between data lakes and RDBMSs - Efficient query processing with Parquet

Not scheduled
20m
HS 024 (Universitätshauptgebäude)

HS 024

Universitätshauptgebäude

Fürstengraben 1 07743 Jena
Poster Poster Poster

Speaker

Alice Rey

Description

In the age of massive data, time-intensive loading phases make databases less viable for data exploration tasks.
Still, the highly optimized query engines of database systems are greatly beneficial for the performance of data analysis tasks.
With our research, we want to bridge this gap and provide paramount analytical performance without the need of static data loading.
Our approach enables the integration of Parquet files --- one of the most used columnar file formats in data lakes --- into the data processing pipeline of a database system in a convenient way. We allow end-users to benefit from the database system performance without a costly and time-consuming loading phase.

Type of Poster A solution

Primary author

Presentation materials