Data Engineer Β· Microsoft Stack Β· Big Data Β· Cloud-Native ETL
Building scalable, reliable data platforms with Azure, Fabric, and Spark.
Data Engineer with 5+ years of experience building scalable, reliable cloud data platforms using Microsoft Azure, Fabric, and Apache Spark. I specialize in turning fragmented, inconsistent data into trustworthy systems where small data quality issues don't become business problems.
My focus:
- Cloud Data Platforms β Microsoft Azure, Fabric, medallion architecture, cloud-native design
- Big Data Processing β Apache Spark, distributed computing, performance optimization
- Analytics Engineering β Dimensional modeling, analytics-ready data warehouses, Power BI
- Data Quality & Reliability β Validation, monitoring, observability, production patterns
π Vancouver, Canada Β· π¨π¦ Open to Data Engineer / Analytics Engineer roles Β· Public sector focus (TransLink, health authorities, government)
Cloud & Big Data Platforms
Data Warehousing & Analytics
Orchestration & Processing
Infrastructure & Tools
Architecture & Patterns
Medallion Architecture Β· Dimensional Modeling Β· Real-time Pipelines Β· Cloud-Native Design Β· Data Quality Validation Β· Observability
Enterprise-scale medallion lakehouse on Microsoft Fabric. Multi-region retail data platform (27 countries, 10B+ annual transactions).
- Incremental processing, data quality validation, analytics-ready models
- Unified customer / product / sales dimensional models
- Power BI integration for real-time reporting
Stack: Microsoft Fabric Β· OneLake Β· PySpark Β· Power BI Β· Big Data at scale
End-to-end data warehouse on real TransLink GTFS transit data using medallion architecture.
- Bronze β Silver β Gold layers with embedded data-quality checks
- Handled domain edge cases (GTFS times beyond 24:00)
- Dimensional models for ridership analysis and operational insights
- Public sector project (TransLink / transit authority relevance)
Stack: Python Β· SQL Server Β· Medallion Β· Dimensional Modeling
Production-grade ETL pipeline demonstrating enterprise reliability patterns.
- Apache Airflow orchestration, Spark distributed processing, AWS infrastructure
- Idempotency, failure handling, comprehensive monitoring and observability
- Real-world migration case study: legacy batch jobs β cloud-native DAGs
Stack: Apache Airflow Β· Spark Β· AWS Β· Production Patterns
Medallion-based lakehouse using Delta Lake + Unity Catalog.
- Governed data access, scalable PySpark transformations, reusable logic
Stack: Databricks Β· Delta Lake Β· Unity Catalog Β· PySpark
Health data integration using FHIR standards.
- Multi-source healthcare data consolidation, compliance-focused design
- Public sector angle (health authority data solutions)
Stack: FHIR Β· Healthcare APIs Β· Python Β· Data Integration
β Medallion Architecture β Bronze/Silver/Gold layering, clean separation of concerns
β Big Data at Scale β PySpark, distributed processing, performance optimization
β Cloud-Native Platforms β Microsoft Azure, Fabric, modern data lakehouse design
β Data Quality β Validation gates, anomaly detection, observability
β Analytics Engineering β Dimensional models, fact/dimension tables, Power BI
β Production Reliability β Failure handling, retries, monitoring, operational maturity
β Public Sector Data β TransLink, healthcare, government-relevant skills
- β Microsoft Certified: Azure Data Fundamentals (DP-900)
- π In progress: Microsoft Fabric Data Engineer (DP-700)
- π Building hands-on lakehouse projects on Microsoft Fabric & Databricks
Data Engineer / Analytics Engineer roles focused on:
- β Microsoft Azure / Fabric cloud platforms
- β Big data processing (Spark, distributed systems)
- β Medallion / lakehouse architectures
- β Analytics-ready data warehouse design
- β Public sector (TransLink, BC Public Service, health authorities, municipalities)
π Vancouver, Canada Β· Open to on-site / hybrid / remote
I'm open to collaborating on data platform projects, exploring new roles, or discussing pipelines and data architecture.
LinkedIn Β· Portfolio Β· Email
Build systems that remain reliable as complexity grows.



