Explore the comprehensive Data Quality Stack, including Great Expectations, Deequ, and Apache Griffin, to optimize your data integrity and governance. Learn how this stack facilitates robust data validation, profiling, and monitoring with a vibrant ecosystem of open source libraries. Enhance your data pipelines and ensure high-quality analytics with the best-in-class data quality tools.
feature.delivery is a free, web-based platform that enables developers to monitor and consolidate software releases from multiple GitHub repositories into a single, streamlined chronological view. By centralizing release information across various tools, libraries, and services, feature.delivery simplifies the process of staying informed about the latest updates in a development stack. Stay ahead of the curve with feature.delivery, the free online tool designed to help developers effortlessly track and consolidate the latest releases from multiple GitHub repositories in one clean, chronological view. Whether you're managing a complex development stack or simply want to stay up to date with your favorite open-source projects, feature.delivery streamlines release tracking so you never miss an important update. By keeping up with the latest changes, developers can quickly adopt new features, enhance performance, and maintain a competitive edge in today's fast-moving tech landscape. Say goodbye to manual tracking and hello to smarter, faster development with feature.delivery.
how do I stay up to date with the latest features of the Data Quality Stack?
how to keep up with the latest features in Data Quality Stack?
what's new in Data Quality Stack?
how to track latest features in Data Quality Stack?

Staying up-to-date with latest features of the
Data Quality Stack in 2025

How does it work?

feature.delivery is a free, web-based platform that helps developers track the latest releases from multiple GitHub repositories — all in one streamlined, chronological view. By centralizing release information across tools, libraries, and frameworks, feature.delivery makes it easier than ever to stay on top of the updates throughout your development stack.

Checkout this 1 minute intro video to see it in action

The Data Quality Stack, featuring Great Expectations, Deequ, and Apache Griffin, is a powerful suite of open source tools designed to ensure data integrity, reliability, and trustworthiness across data pipelines. Leveraging this stack helps organizations automate data validation, profiling, and cleansing, thereby reducing errors, improving compliance, and boosting confidence in data-driven decision-making. With a rich ecosystem and active community, the Data Quality Stack stands out as a leading solution for maintaining high standards in data engineering workflows.

Here's a breakdown of the Data Quality Stack into different categories

Core Data Quality Libraries

These libraries form the backbone of the Data Quality Stack, providing essential frameworks for data validation, profiling, and governance. They are widely adopted for their scalability, flexibility, and integration capabilities across data environments.

Great Expectations

great-expectations/great_expectations
A leading open-source library for data validation, documentation, and profiling, supporting a wide range of data sources.
what's new in Great Expectations?
how to track latest features in Great Expectations?
new updates in Great Expectations?
new features in Great Expectations?

Deequ

awslabs/deequ
A scalable library built on Apache Spark for defining and running data quality checks on large datasets.
what's new in Deequ?
how to track latest features in Deequ?
new updates in Deequ?
new features in Deequ?

Apache Griffin

apache/griffin
An open source data quality solution for big data, offering both batch and streaming data quality measurement.
what's new in Apache Griffin?
how to track latest features in Apache Griffin?
new updates in Apache Griffin?
new features in Apache Griffin?

Profiling and Monitoring Tools

These tools provide advanced capabilities for data profiling, monitoring, and anomaly detection, essential for maintaining ongoing data quality in dynamic environments.

Soda Core

sodadata/soda-core
A modern open source data monitoring tool for detecting, analyzing, and resolving data quality issues.
what's new in Soda Core?
how to track latest features in Soda Core?
new updates in Soda Core?
new features in Soda Core?

OpenLineage

OpenLineage/OpenLineage
A standard for metadata and data lineage collection, enabling data quality tracking across pipelines.
what's new in OpenLineage?
how to track latest features in OpenLineage?
new updates in OpenLineage?
new features in OpenLineage?

Marquez

MarquezProject/marquez
An open source metadata service for the collection, aggregation, and visualization of data lineage.
what's new in Marquez?
how to track latest features in Marquez?
new updates in Marquez?
new features in Marquez?

Data Profiling and Exploration

Libraries focused on automated data profiling and exploration to understand data distributions, outliers, and data schema before implementing data quality checks.

pandas-profiling

pandas-profiling/pandas-profiling
Generates extensive profiling reports from pandas DataFrames, aiding in initial data quality assessment.
what's new in pandas-profiling?
how to track latest features in pandas-profiling?
new updates in pandas-profiling?
new features in pandas-profiling?

ydata-profiling

ydataai/ydata-profiling
Successor to pandas-profiling, providing enhanced data profiling and report generation for dataframes.
what's new in ydata-profiling?
how to track latest features in ydata-profiling?
new updates in ydata-profiling?
new features in ydata-profiling?

Data Quality Orchestration & Workflow

These tools support the orchestration and automation of data quality checks as part of broader ETL and data pipeline workflows.

Dagster

dagster-io/dagster
An orchestration platform for data assets, making it easy to include data quality checks in pipeline workflows.
what's new in Dagster?
how to track latest features in Dagster?
new updates in Dagster?
new features in Dagster?

Airflow

apache/airflow
A platform to programmatically author, schedule, and monitor workflows, often used to schedule data quality checks.
what's new in Airflow?
how to track latest features in Airflow?
new updates in Airflow?
new features in Airflow?

Prefect

PrefectHQ/prefect
A workflow management system that makes it easy to build, run, and monitor data quality tasks.
what's new in Prefect?
how to track latest features in Prefect?
new updates in Prefect?
new features in Prefect?

Data Quality for Data Lakes and Warehouses

Specialized libraries and plugins for implementing data quality checks within popular data lake and data warehouse environments.

great_expectations-expectations

great-expectations/awesome-expectations
A repository of community-contributed Great Expectations expectations for various data sources.
what's new in great_expectations-expectations?
how to track latest features in great_expectations-expectations?
new updates in great_expectations-expectations?
new features in great_expectations-expectations?

dbt-expectations

calogica/dbt-expectations
A dbt package for data testing using Great Expectations-style checks within your dbt workflows.
what's new in dbt-expectations?
how to track latest features in dbt-expectations?
new updates in dbt-expectations?
new features in dbt-expectations?

great-expectations-dbt

great-expectations/great_expectations-dbt
Community plugin to integrate Great Expectations with dbt for seamless data quality checks.
what's new in great-expectations-dbt?
how to track latest features in great-expectations-dbt?
new updates in great-expectations-dbt?
new features in great-expectations-dbt?

Data Governance and Metadata

Tools that enhance data governance by providing metadata management, data cataloging, and lineage for better quality control.

Amundsen

amundsen-io/amundsen
A data discovery and metadata platform for improving the productivity of data analysts, data scientists, and engineers.
what's new in Amundsen?
how to track latest features in Amundsen?
new updates in Amundsen?
new features in Amundsen?

DataHub

datahub-project/datahub
An open-source metadata platform for the modern data stack, enabling rich metadata management and lineage.
what's new in DataHub?
how to track latest features in DataHub?
new updates in DataHub?
new features in DataHub?

Discover the full potential of the Data Quality Stack by exploring each repository’s release page. Click on the URLs to see the latest features, updates, and integrations that can enhance your data quality initiatives.