Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings
View JeffWilliams2's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report JeffWilliams2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
JeffWilliams2/ReadMe.md

About Me

Data Engineer designing and operating production-grade batch and streaming pipelines. Experienced in building end-to-end data platforms across ingestion, orchestration, transformation, and analytics using Spark, Airflow, dbt, PostgreSQL, Snowflake, and AWS, with a focus on data modeling and reliability.

Tech Stack

Python PostgreSQL R NumPy Pandas Apache Spark Apache Airflow Docker Kubernetes AWS Streamlit Figma

Featured Projects

Banking CDC Pipeline Netflix dbt Snowflake Pipeline

AWS EMR Spark Setup Travel Recommendations AWS

Stock Sector Streamlit App Airbnb Trend Tableau

S&P 500 Comparison App DNA Nucleotide Counting App

Books Reading:

  • Data Warehouse Toolkit
  • Fundementals of Data Engineering
  • Designing Data-Intensive Applications

Pinned Loading

  1. PortfolioProjects PortfolioProjects Public

    This repository contains a collection of portfolio projects.

    Jupyter Notebook 1

  2. realtime-banking-cdc-pipeline realtime-banking-cdc-pipeline Public

    Real-Time Banking CDC Pipeline: PostgreSQL → Debezium → Kafka → Snowflake with DBT transformations

    Python

  3. fed_speech_recognition fed_speech_recognition Public

    Analyze market movements by correlating speech transcripts with real-time price data using OpenAI's Whisper model.

    Jupyter Notebook

  4. netflix-dbt-snowflake netflix-dbt-snowflake Public

    Data pipeline using dbt and snowflake for movie analytics with dimensional modeling and testing.

  5. aws-emr-spark-demo aws-emr-spark-demo Public

    Spark data processing on AWS EMR: PySpark transformations, SQL aggregations.

    Python

  6. travel-destination-generator travel-destination-generator Public

    Leverages AWS Bedrock and Claude 3 Sonnet to analyze your interests and generate travel recommendations.

    TypeScript

Morty Proxy This is a proxified and sanitized view of the page, visit original site.