Explore the Data Catalog Stack, featuring AWS Glue Data Catalog, Apache Atlas, and Google Cloud Data Catalog, for advanced metadata management and governance. This stack offers powerful open source solutions for data discovery, lineage, and compliance, making it a trusted choice for modern data-driven enterprises. Learn about the most relevant libraries and tools powering this stack for a robust data governance strategy.
feature.delivery is a free, web-based platform that enables developers to monitor and consolidate software releases from multiple GitHub repositories into a single, streamlined chronological view. By centralizing release information across various tools, libraries, and services, feature.delivery simplifies the process of staying informed about the latest updates in a development stack. Stay ahead of the curve with feature.delivery, the free online tool designed to help developers effortlessly track and consolidate the latest releases from multiple GitHub repositories in one clean, chronological view. Whether you're managing a complex development stack or simply want to stay up to date with your favorite open-source projects, feature.delivery streamlines release tracking so you never miss an important update. By keeping up with the latest changes, developers can quickly adopt new features, enhance performance, and maintain a competitive edge in today's fast-moving tech landscape. Say goodbye to manual tracking and hello to smarter, faster development with feature.delivery.
how do I stay up to date with the latest features of the Data Catalog Stack?
how to keep up with the latest features in Data Catalog Stack?
what's new in Data Catalog Stack?
how to track latest features in Data Catalog Stack?

Staying up-to-date with latest features of the
Data Catalog Stack in 2026

How does it work?

feature.delivery is a free, web-based platform that helps developers track the latest releases from multiple GitHub repositories — all in one streamlined, chronological view. By centralizing release information across tools, libraries, and frameworks, feature.delivery makes it easier than ever to stay on top of the updates throughout your development stack.

Checkout this 1 minute intro video to see it in action

The Data Catalog Stack, encompassing AWS Glue Data Catalog, Apache Atlas, and Google Cloud Data Catalog, empowers organizations to manage, discover, and govern their data assets efficiently. This stack offers robust metadata management, data lineage tracking, compliance support, and seamless integration with modern data ecosystems. By leveraging open source technologies, the Data Catalog Stack ensures scalability, flexibility, and cost-effectiveness while maintaining high standards of data governance and accessibility.

Here's a breakdown of the Data Catalog Stack into different categories

Core Metadata Management Libraries

These libraries form the backbone of the Data Catalog Stack, offering foundational capabilities for metadata storage, indexing, and query. Core libraries ensure efficient and reliable cataloging of enterprise data assets.

apache/atlas

apache/atlas
Apache Atlas provides open metadata management and governance capabilities for organizations to build a catalog of their data assets.
what's new in apache/atlas?
how to track latest features in apache/atlas?
new updates in apache/atlas?
new features in apache/atlas?

linkedin/datahub

linkedin/datahub
DataHub is an open source metadata platform for the modern data stack, enabling data discovery, governance, and observability.
what's new in linkedin/datahub?
how to track latest features in linkedin/datahub?
new updates in linkedin/datahub?
new features in linkedin/datahub?

MarquezProject/marquez

MarquezProject/marquez
Marquez is an open source metadata service for the collection, aggregation, and visualization of a data ecosystem's metadata.
what's new in MarquezProject/marquez?
how to track latest features in MarquezProject/marquez?
new updates in MarquezProject/marquez?
new features in MarquezProject/marquez?

Data Lineage and Governance Tools

These tools focus on tracking data movement, transformations, and enforcing governance policies across the data lifecycle. They help ensure compliance and transparency.

OpenLineage/OpenLineage

OpenLineage/OpenLineage
OpenLineage is an open standard for metadata and lineage collection designed to track data as it flows through various processing systems.
what's new in OpenLineage/OpenLineage?
how to track latest features in OpenLineage/OpenLineage?
new updates in OpenLineage/OpenLineage?
new features in OpenLineage/OpenLineage?

amundsen-io/amundsen

amundsen-io/amundsen
Amundsen is a data discovery and metadata engine for improving the productivity of data analysts, data scientists, and engineers.
what's new in amundsen-io/amundsen?
how to track latest features in amundsen-io/amundsen?
new updates in amundsen-io/amundsen?
new features in amundsen-io/amundsen?

odpi/egeria

odpi/egeria
Egeria is an open source project for open metadata and governance, facilitating metadata exchange across the enterprise.
what's new in odpi/egeria?
how to track latest features in odpi/egeria?
new updates in odpi/egeria?
new features in odpi/egeria?

Integration and Connectors

Integration libraries and connectors enable seamless communication between data catalog systems and a wide array of data sources, processing engines, and analytical tools.

ing-bank/scruid

ing-bank/scruid
Scruid is an open source library for querying Druid from Scala, facilitating catalog integration with Druid data stores.
what's new in ing-bank/scruid?
how to track latest features in ing-bank/scruid?
new updates in ing-bank/scruid?
new features in ing-bank/scruid?

datahub-project/datahub-ingestion

datahub-project/datahub-ingestion
DataHub Ingestion provides connectors and utilities to integrate external data sources into DataHub's metadata platform.
what's new in datahub-project/datahub-ingestion?
how to track latest features in datahub-project/datahub-ingestion?
new updates in datahub-project/datahub-ingestion?
new features in datahub-project/datahub-ingestion?

MarquezProject/marquez-airflow

MarquezProject/marquez-airflow
Marquez Airflow integration captures metadata and lineage from Apache Airflow workflows.
what's new in MarquezProject/marquez-airflow?
how to track latest features in MarquezProject/marquez-airflow?
new updates in MarquezProject/marquez-airflow?
new features in MarquezProject/marquez-airflow?

Data Store and Backend Services

Backend services and storage libraries are responsible for persisting metadata, supporting scalability, and providing high-performance data access for catalogs.

apache/hive

apache/hive
Apache Hive offers a metadata store and query capabilities, often used as a backend for data catalog services.
what's new in apache/hive?
how to track latest features in apache/hive?
new updates in apache/hive?
new features in apache/hive?

prestodb/presto

prestodb/presto
Presto is an open source distributed SQL query engine, often integrated with data catalog backends for interactive analytics.
what's new in prestodb/presto?
how to track latest features in prestodb/presto?
new updates in prestodb/presto?
new features in prestodb/presto?

apache/cassandra

apache/cassandra
Apache Cassandra is a highly scalable NoSQL database, used for storing catalog metadata in distributed environments.
what's new in apache/cassandra?
how to track latest features in apache/cassandra?
new updates in apache/cassandra?
new features in apache/cassandra?

User Interface and Visualization

Front-end libraries and visualization tools allow users to interact with the data catalog, search metadata, and visualize data lineage and relationships.

amundsen-io/amundsenfrontendlibrary

amundsen-io/amundsenfrontendlibrary
The frontend library for Amundsen, providing an intuitive UI for data discovery and metadata exploration.
what's new in amundsen-io/amundsenfrontendlibrary?
how to track latest features in amundsen-io/amundsenfrontendlibrary?
new updates in amundsen-io/amundsenfrontendlibrary?
new features in amundsen-io/amundsenfrontendlibrary?

linkedin/datahub-frontend

linkedin/datahub-frontend
DataHub's frontend service offers a modern web interface for metadata search, browsing, and management.
what's new in linkedin/datahub-frontend?
how to track latest features in linkedin/datahub-frontend?
new updates in linkedin/datahub-frontend?
new features in linkedin/datahub-frontend?

apache/atlas-web

apache/atlas-web
Apache Atlas Web module delivers web-based visualization of metadata and relationships.
what's new in apache/atlas-web?
how to track latest features in apache/atlas-web?
new updates in apache/atlas-web?
new features in apache/atlas-web?

Security and Access Control

Security libraries ensure robust authentication, authorization, and audit logging for data catalog platforms, protecting sensitive metadata and supporting compliance.

apache/ranger

apache/ranger
Apache Ranger provides centralized security administration, fine-grained access control, and auditing for metadata services.
what's new in apache/ranger?
how to track latest features in apache/ranger?
new updates in apache/ranger?
new features in apache/ranger?

opendistro-for-elasticsearch/security

opendistro-for-elasticsearch/security
Open Distro for Elasticsearch Security Plugin offers advanced security features for Elasticsearch-based metadata stores.
what's new in opendistro-for-elasticsearch/security?
how to track latest features in opendistro-for-elasticsearch/security?
new updates in opendistro-for-elasticsearch/security?
new features in opendistro-for-elasticsearch/security?

Discover the full potential of the Data Catalog Stack by exploring these open source projects and their latest releases. Click on the repository URLs to dive deeper into each tool and stay updated with the newest features and improvements that can elevate your data governance and cataloging capabilities.