/

data-integration

data-engineering
elt
etl
python
data
data-orchestrator
data-pipelines
data-science
workflow
scheduler
pipeline
orchestration
machine-learning
apache
data-analysis
data-pipeline
bigquery
change-data-capture
data-collection
sql
airflow
apache-airflow
dag
automation
snowflake
java
mssql
redshift
dagster
analytics
metadata
mlops

airbytehq/airbyte
502日前13.2k

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

apache/seatunnel
501日前7.0k

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

mage-ai/mage-ai
501日前6.5k

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

cloudquery/cloudquery
503日前5.4k

The open source high performance data integration platform built for developers.

kestra-io/kestra
501日前5.4k

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.