Category: Data Engineering
-

Building a DevSecOps-Ready GCP ETL Pipeline for Multi-Year Google Sheet Survey Analysis
A behind-the-scenes look at how I engineered a secure, reproducible, analytics-ready GCP ETL pipeline that automates the ingestion and transformation of multi-language Google Sheet survey data; blending YAML preprocessing, Google API integration, data auditing, and DevSecOps principles into one cohesive system.
-

From Chaos to Clarity: A Lightweight Multilanguage ETL Pipeline for Excel <-> SharePoint Integration
A practical, human-centered data migration framework that blends Python, Bash, and JavaScript to automate the transition across SaaS platforms. Designed to make complex data accessible, secure, and easy to navigate for everyone.
-
Automating NYC InfoHub Data Collection with an Intelligent Excel Web Scraper
A look at how intelligent web scraping, concurrency, and security checks turn NYC InfoHub’s sprawling maze of Excel datasets into a fast, verifiable, and human-accessible data repository.
-
Why Microsoft VBA Still Matters: Automating Data Workflows in Excel with Visual Basic
Discover why Microsoft VBA still matters in 2025. Learn how simple Excel macros can automate data workflows, streamline reports, and empower nonprofits and education teams with real-world Visual Basic examples.
-
Regression Analysis for SARS-CoV-2 RNA Concentrations in NYC Wastewater
An analysis of the trends of SARS-CoV-2 RNA concentrations in NYC wastewater during the height of the COVID-19 pandemic that uses API Building, SQLite for database creation, manipulation, and storage, Pandas & Matplotlib for data visualization and analysis, and scikit-learn for Linear Regression.