Using the Airflow ShortCircuitOperator to Stop Bad Data from Reaching ETL Pipelines

I'm a huge fan of Apache Airflow and how the open-source tool enables data engineers to scale data pipelines by more precisely orchestrating workloads. 

But what happens when Airflow testing doesn’t catch all of your bad data? What if “unknown unknown” data quality issues fall through the cracks and affect your Airflow jobs? 

CategoriesUncategorized