HackerNoon Mobile

Better reading experience on the app
Your Definitive Guide to Lakehouse Architecture with Iceberg and MinIOby@minio
7,316 reads

Your Definitive Guide to Lakehouse Architecture with Iceberg and MinIO

August 8th 2023
22m
by @minio 7,316 reads
tldt arrow
EN
Read on Terminal Reader
Read this story w/o Javascript

Too Long; Didn't Read

Apache Iceberg seems to have taken the data world by storm. Initially incubated at Netflix by Ryan Blue, it was eventually transmitted to the Apache Software Foundation where it currently resides. At its core it is an open table format for at-scale analytic data sets (think hundreds of TBs to hundreds of PBs). It is a multi-engine compatible format. What that means is that Spark, Trino, Flink, Presto, Hive, and Impala can all operate independently and simultaneously on the data set. It supports the lingua franca of data analysis, SQL, as well as key features like full schema evolution, hidden partitioning, time travel, and rollback and data compaction. This post focuses on how Iceberg and MinIO complement each other and how various analytic frameworks (Spark, Flink, Trino, Dremio, and Snowflake) can leverage the two.
featured image - Your Definitive Guide to Lakehouse Architecture with Iceberg and MinIO
MinIO HackerNoon profile picture

@minio

MinIO


Receive Stories from @minio


Credibility

react to story with heart

RELATED STORIES

L O A D I N G
. . . comments & more!