Portfolio

A snippet of personal (and some work-related) things I’ve been working on.

I’m fortunate that most of my day job is open source, you can check out my daily contributions on my github.


Machine Learning

Austin Airport TSA Wait Time Forecasting

I created and maintain a bluesky bot that posts a daily forecast for how busy the airport will be using a simple ARIMA model trained on historical TSA checkpoint volume data. As part of this work, I set up an ETL that processes hundreds of thousands of pages of PDFs posted to the TSA website.

Data flow diagram of the forecast bot Data flow diagram of the forecast bot

Example post: Relatively normal wait times expected tomorrow: Higher than 26% of days. Example prediction


Data Engineering

Austin Road Conditions

I created a bluesky bot that posts road condition updates using real-time automated sensor data.

Every 5 minutes, the script checks for changes to the road grip at available sensor locations around Austin from this open data portal dataset.

If there is a change in road conditions, a new post is created.

The code is open source and in theory could be deployed for other cities if the data were made available.

Example post: POOR roadway grip reported at FM 2222 RD / LAKEWOOD DR, was previously FAIR. Current roadway condition is Snow. Example post

Voter Turnout Scraping

For 2024’s early voting period I set up an ETL script that scraped live voter turnout data and plotted it alongside a comparison to the the 2020 election.

The archived post along with more visualizations is available here.

2024 live voter turnout comparison 2024 live voter turnout comparison


Data Visualization

2024 Elections Visualizations

One topic I frequently visualize is elections. My atx-elections-data repo contains several examples of code I have written to visualize elections mostly in Texas.

Precinct-level election shifts Precinct-level election shifts

Austin MetroBike Trips Visualization

Using a tool called flowmap.blue I was able to quickly visualize multiple years of docked bicycle data. It is featured on flowmap.blue’s examples page.

Created with: Python, Google sheets

A map displaying the trip density of bicycle trips between kiosks in Austin, Texas. Link to the interactive


Skills Summary

Programming:

  • Python (expert)
  • R (intermediate)
  • Javascript (intermediate)

Data Engineering:

  • Extract transform load (ETL) scripting with Python and dbt
  • Building and deploying Docker containers
  • SQL (Postgres, Oracle) for database administration and data extraction/transformation
  • Cloud orchestration with Prefect, on-premises orchestration with Apache Airflow
  • Amazon Web Services (AWS): S3, EC2
  • Google Cloud Service (GCS): BigQuery, Cloud Functions, Cloud Storage

Data Science/Machine Learning

  • Machine learning: Pytorch, XGBoost, Scikit learn
  • Experience applying deep learning, PCA, and supervised learning to real world problems

Business Intelligence

  • Power BI (expert)
  • Hex (expert)
  • Tableau (intermediate)
  • MicroStrategy (intermediate)
  • Geospatial analysis and mapping with ArcGIS Online, geopandas, postGIS

Education & Certifications

Education:

  • Master’s of Science in Data Science, The University of Texas at Austin. Dec 2024
  • Bachelor’s of Science in Aerospace Engineering, The University of Texas at Austin. Dec 2018

Certifications:

Master’s Coursework: