Explore the Data Lakehouse Stack, featuring Databricks, Apache Iceberg, and Delta Lake, for next-generation analytics. This stack delivers reliability, scalability, open standards, and seamless integration with the modern data ecosystem. Learn about the core libraries, connectors, data governance tools, and various open source repositories that fuel innovation in the lakehouse architecture. Stay ahead with robust ACID transaction support, schema evolution, and compatibility with Apache Spark, Presto, Trino, and more.
feature.delivery is a free, web-based platform that enables developers to monitor and consolidate software releases from multiple GitHub repositories into a single, streamlined chronological view. By centralizing release information across various tools, libraries, and services, feature.delivery simplifies the process of staying informed about the latest updates in a development stack. Stay ahead of the curve with feature.delivery, the free online tool designed to help developers effortlessly track and consolidate the latest releases from multiple GitHub repositories in one clean, chronological view. Whether you're managing a complex development stack or simply want to stay up to date with your favorite open-source projects, feature.delivery streamlines release tracking so you never miss an important update. By keeping up with the latest changes, developers can quickly adopt new features, enhance performance, and maintain a competitive edge in today's fast-moving tech landscape. Say goodbye to manual tracking and hello to smarter, faster development with feature.delivery.
how do I stay up to date with the latest features of the Data Lakehouse Stack?
how to keep up with the latest features in Data Lakehouse Stack?
what's new in Data Lakehouse Stack?
how to track latest features in Data Lakehouse Stack?

Staying up-to-date with latest features of the
Data Lakehouse Stack in 2026

How does it work?

feature.delivery is a free, web-based platform that helps developers track the latest releases from multiple GitHub repositories — all in one streamlined, chronological view. By centralizing release information across tools, libraries, and frameworks, feature.delivery makes it easier than ever to stay on top of the updates throughout your development stack.

Checkout this 1 minute intro video to see it in action

The Data Lakehouse Stack (Databricks, Apache Iceberg, Delta Lake) combines the scalability and low-cost storage of data lakes with the data management and reliability features of data warehouses. This stack enables unified analytics, real-time streaming, and seamless machine learning workflows, all while maintaining open standards and interoperability. The stack provides ACID transactions, schema enforcement, and compatibility with popular data processing engines, making it the go-to architecture for modern data engineering and analytics.

Here's a breakdown of the Data Lakehouse Stack into different categories

Core Table Formats

Core table formats ensure the foundational storage structure for the lakehouse, supporting ACID transactions, schema evolution, and time travel. These libraries are essential for reliable and performant data lake operations.

Delta Lake

delta-io/delta
An open-source storage layer that brings ACID transactions to Apache Spark and big data workloads.
what's new in Delta Lake?
how to track latest features in Delta Lake?
new updates in Delta Lake?
new features in Delta Lake?

Apache Iceberg

apache/iceberg
A high-performance format for huge analytic tables, enabling reliable data lake operations at scale.
what's new in Apache Iceberg?
how to track latest features in Apache Iceberg?
new updates in Apache Iceberg?
new features in Apache Iceberg?

Apache Hudi

apache/hudi
A transactional data lake framework that greatly simplifies incremental data processing and data pipeline development.
what's new in Apache Hudi?
how to track latest features in Apache Hudi?
new updates in Apache Hudi?
new features in Apache Hudi?

Data Processing Engines

Data processing engines provide the compute layer for executing complex queries, ETL pipelines, and real-time analytics on lakehouse data.

Apache Spark

apache/spark
Unified analytics engine for large-scale data processing, with built-in modules for SQL, streaming, and machine learning.
what's new in Apache Spark?
how to track latest features in Apache Spark?
new updates in Apache Spark?
new features in Apache Spark?

Trino

trinodb/trino
Distributed SQL query engine for big data, optimized for low-latency analytical queries across multiple data sources.
what's new in Trino?
how to track latest features in Trino?
new updates in Trino?
new features in Trino?

Presto

prestodb/presto
Distributed SQL query engine for running interactive analytic queries against data sources of all sizes.
what's new in Presto?
how to track latest features in Presto?
new updates in Presto?
new features in Presto?

Connectors and Integrations

Connectors bridge the core table formats with various data processing engines, cloud platforms, and BI tools, ensuring interoperability and flexible analytics.

delta-rs

delta-io/delta-rs
A Rust implementation of the Delta Lake protocol, providing bindings for Python, JVM, and other languages.
what's new in delta-rs?
how to track latest features in delta-rs?
new updates in delta-rs?
new features in delta-rs?

iceberg-spark

apache/iceberg/tree/master/spark
Apache Iceberg's Spark integration module for seamless data format compatibility.
what's new in iceberg-spark?
how to track latest features in iceberg-spark?
new updates in iceberg-spark?
new features in iceberg-spark?

trino-iceberg

trinodb/trino/tree/master/plugin/trino-iceberg
Trino's connector for Apache Iceberg, enabling high-performance SQL queries on Iceberg tables.
what's new in trino-iceberg?
how to track latest features in trino-iceberg?
new updates in trino-iceberg?
new features in trino-iceberg?

Data Governance and Catalogs

Governance and catalog tools provide metadata management, data discovery, and fine-grained access control, ensuring data is secure and easily discoverable.

Hive Metastore

apache/hive
Centralized metadata repository for managing table schemas and partition metadata.
what's new in Hive Metastore?
how to track latest features in Hive Metastore?
new updates in Hive Metastore?
new features in Hive Metastore?

Apache Atlas

apache/atlas
Open source metadata and governance platform for data cataloging, lineage, and policy enforcement.
what's new in Apache Atlas?
how to track latest features in Apache Atlas?
new updates in Apache Atlas?
new features in Apache Atlas?

Amundsen

amundsen-io/amundsen
Data discovery and metadata platform for improving productivity and data governance.
what's new in Amundsen?
how to track latest features in Amundsen?
new updates in Amundsen?
new features in Amundsen?

Streaming and Real-Time Processing

Streaming frameworks bring real-time data ingestion, transformation, and analytics to the lakehouse, supporting up-to-date insights.

Apache Flink

apache/flink
Stream processing framework for high-throughput, low-latency data analytics.
what's new in Apache Flink?
how to track latest features in Apache Flink?
new updates in Apache Flink?
new features in Apache Flink?

Spark Structured Streaming

apache/spark/tree/master/sql/core/src/main/scala/org/apache/spark/sql/streaming
Built-in streaming analytics engine in Apache Spark, supporting end-to-end exactly-once semantics.
what's new in Spark Structured Streaming?
how to track latest features in Spark Structured Streaming?
new updates in Spark Structured Streaming?
new features in Spark Structured Streaming?

Data Versioning and Lineage

Tools for data version control and lineage tracking ensure reproducibility, auditing, and compliance within the lakehouse.

lakeFS

treeverse/lakeFS
Git-like data version control for data lakes, enabling safe experimentation and rollbacks.
what's new in lakeFS?
how to track latest features in lakeFS?
new updates in lakeFS?
new features in lakeFS?

DataHub

acryldata/datahub
A modern data catalog that provides metadata management, lineage, and data discovery.
what's new in DataHub?
how to track latest features in DataHub?
new updates in DataHub?
new features in DataHub?

Orchestration and Workflow Management

Workflow orchestration tools manage complex ETL pipelines and ensure dependable, automated data movement in the lakehouse.

Apache Airflow

apache/airflow
Platform for programmatically authoring, scheduling, and monitoring workflows.
what's new in Apache Airflow?
how to track latest features in Apache Airflow?
new updates in Apache Airflow?
new features in Apache Airflow?

Dagster

dagster-io/dagster
Data orchestrator for machine learning, analytics, and ETL.
what's new in Dagster?
how to track latest features in Dagster?
new updates in Dagster?
new features in Dagster?

Dive deeper into the Data Lakehouse Stack by exploring these open source repositories on GitHub. Click on each URL to view the latest releases, community updates, and feature enhancements. Stay current and unlock the full potential of your data lakehouse architecture today!