Summary
A multi-region orchestration platform built on Kubernetes, Airflow, Meltano, and DBT, powering compliant and scalable data workflows across 12 Snowflake accounts with robust observability and infrastructure-as-code practices.
A Multi-Cluster, Multi-Cloud Orchestration ☁️
I’ve worked on scalable, resilient ETL processes using Apache Airflow, Meltano, and DBT, deployed across a distributed Kubernetes (AKS) architecture built for flexibility, compliance, and performance.
Context & Challenge 🧩
The platform spans five Kubernetes clusters (3 US, 2 EU) with production, beta, and staging environments. It supports dynamic ETL pipelines that adapt per environment—varying DAGs, parameters, secrets, and configs—while orchestrating data across 12 Snowflake accounts in the US, Canada, and Europe under strict compliance (e.g., GDPR).

Key Contributions 🧑💻
I contributed to and maintained various parts of the orchestration platform, supporting its reliability, scalability, and compliance across multiple environments:
- System logs to Datadog.
- Airflow logs persisted on Azure Blob for debugging.
Outcomes & Learnings 🚀
This project brought real-world scale and complexity to my data engineering experience.
It strengthened my ability to balance performance, reliability, and compliance in large-scale data platforms.