Built and optimized cloud data workflows, backend systems, an internal Azure analytics dashboard, and automated test coverage for cloud consumption, telemetry, and billing analytics.
- Migrated legacy Scala logic to PySpark in Azure Synapse, rewriting 20+ notebooks and data pipelines for Fortune 500 consumption and billing analytics.
- Reduced runtime of 3 production Spark pipelines by 75% by optimizing joins, partitioning, cluster scaling, and Managed Identity authentication.
- Built ETL workflows using Cosmos DB, Cosmos SCOPE, and Kusto/KQL to ingest, validate, and onboard 80+ telemetry metrics.
- Owned and enhanced 150+ Playwright E2E tests across Product, Consumption, and Quality areas for an Azure Usage dashboard.