Airflow
Apache Airflow is an open-source platform for orchestrating data workflows, created at Airbnb in 2014 and later handed over to the Apache Foundation.
Apache Airflow is an open-source platform for orchestrating data workflows, created at Airbnb in 2014 and later handed over to the Apache Foundation.
It lets you define pipelines (extraction, transformation, loading, ML, reporting…) as DAGs (Directed Acyclic Graphs) in Python, then schedule, execute, monitor and manage their dependencies and error recovery.
It is one of the references for modern data engineering, competing with Dagster, Prefect, Argo Workflows and managed services (MWAA on AWS, Cloud Composer on GCP).
