How do I stay up to date with the latest features in Data Lakehouse Stack?

Use feature.delivery to track releases from 18 Data Lakehouse Stack repositories in one chronological view. Simply select the repositories you want to monitor and get automatic updates when new features are released.

What's new in Data Lakehouse Stack?

Stay informed about the latest Data Lakehouse Stack updates by monitoring releases from key repositories including Delta Lake, Apache Iceberg, Apache Hudi and more.

How to track latest features in Data Lakehouse Stack?

feature.delivery consolidates releases from multiple GitHub repositories into a single timeline, making it easy to track new features, bug fixes, and updates across the entire Data Lakehouse Stack ecosystem.

https://feature.delivery

?l=

Use this link to track latest updates across the 18 repositories in Data Lakehouse Stack

Staying up-to-date with latest features of the
Data Lakehouse Stack in 2026

How does it work?

feature.delivery is a free, web-based platform that helps developers track the latest releases from multiple GitHub repositories — all in one streamlined, chronological view. By centralizing release information across tools, libraries, and frameworks, feature.delivery makes it easier than ever to stay on top of the updates throughout your development stack.

Checkout this 1 minute intro video to see it in action

The Data Lakehouse Stack (Databricks, Apache Iceberg, Delta Lake) combines the scalability and low-cost storage of data lakes with the data management and reliability features of data warehouses. This stack enables unified analytics, real-time streaming, and seamless machine learning workflows, all while maintaining open standards and interoperability. The stack provides ACID transactions, schema enforcement, and compatibility with popular data processing engines, making it the go-to architecture for modern data engineering and analytics.

Here's a breakdown of the Data Lakehouse Stack into different categories

Core Table Formats

Core table formats ensure the foundational storage structure for the lakehouse, supporting ACID transactions, schema evolution, and time travel. These libraries are essential for reliable and performant data lake operations.

Delta Lake

delta-io/delta

Apache Iceberg

apache/iceberg

Apache Hudi

apache/hudi

Data Processing Engines

Data processing engines provide the compute layer for executing complex queries, ETL pipelines, and real-time analytics on lakehouse data.

Apache Spark

apache/spark

Trino

trinodb/trino

Presto

prestodb/presto

Connectors and Integrations

Connectors bridge the core table formats with various data processing engines, cloud platforms, and BI tools, ensuring interoperability and flexible analytics.

delta-rs

delta-io/delta-rs

iceberg-spark

apache/iceberg/tree/master/spark

trino-iceberg

trinodb/trino/tree/master/plugin/trino-iceberg

Data Governance and Catalogs

Governance and catalog tools provide metadata management, data discovery, and fine-grained access control, ensuring data is secure and easily discoverable.

Hive Metastore

apache/hive

Apache Atlas

apache/atlas

Amundsen

amundsen-io/amundsen

Streaming and Real-Time Processing

Streaming frameworks bring real-time data ingestion, transformation, and analytics to the lakehouse, supporting up-to-date insights.

Apache Flink

apache/flink

Spark Structured Streaming

apache/spark/tree/master/sql/core/src/main/scala/org/apache/spark/sql/streaming

Data Versioning and Lineage

Tools for data version control and lineage tracking ensure reproducibility, auditing, and compliance within the lakehouse.

lakeFS

treeverse/lakeFS

DataHub

acryldata/datahub

Orchestration and Workflow Management

Workflow orchestration tools manage complex ETL pipelines and ensure dependable, automated data movement in the lakehouse.

Apache Airflow

apache/airflow

Dagster

dagster-io/dagster

Dive deeper into the Data Lakehouse Stack by exploring these open source repositories on GitHub. Click on each URL to view the latest releases, community updates, and feature enhancements. Stay current and unlock the full potential of your data lakehouse architecture today!