Chieh-An Chang
HomeAboutExperienceCredentialsProjectsResumeContact

Chieh-An (Andy) Chang

Data Science & Data Engineering co-op candidate building analytics pipelines, machine learning models, and AI applications from messy data to deployable systems.

Projects

Case studies across data science, ML, engineering, AI, analytics, and quant work.

Filter the gallery by domain, then open each case study for the problem, data, approach, results, technology stack, and lessons learned.

AI system

PythonLangChainLangGraph
AI

Human-in-the-Loop Email Agent via LangChain

State-aware AI email assistant with gated tool execution

An AI safety-focused email agent using LangChain, LangGraph, prompt middleware, and human-in-the-loop controls to prevent unauthorized email actions.

AI SafetyAgentsHITLLangGraph

AI system

PythonLangChainLangGraph
AI

Multi-Agent Event Coordinator

LangGraph workflow for vendor sourcing, constraints, and text-to-SQL

A multi-agent coordinator that automates event-planning workflows with LangChain, LangGraph, LangSmith tracing, MCP tools, Tavily Search, and a self-correcting SQLite text-to-SQL agent.

Multi-AgentLLMOpsMCPText-to-SQL

AI system

PythonGoogle GeminiLangChain
AI

YouTube Summary RAG Video Analyzer

Gemini and FAISS app for transcript-grounded video Q&A

A RAG video analysis tool that chunks YouTube transcripts, embeds retrieval context with FAISS, and serves interactive summaries and question answering through Streamlit.

RAGLLMsVector SearchStreamlit

Data Science system

PythonPandasGeoPandas
Data Science

BC PM2.5 Short-Term Forecasting Report

ETL and statistical learning workflow for air-quality prediction

A forecasting project that automates public data retrieval, compresses large environmental datasets, and models short-term PM2.5 air pollution outcomes.

ForecastingETLAir QualityGeospatial

Analytics system

Power BIDAXPower Query
Analytics

Data Career Analytics Dashboard

Power BI dashboard across 479K job-market records

A Power BI analytics dashboard that visualizes data job trends, skill demand, and role-level market patterns across monthly CSV datasets.

DashboardJob MarketETLDAX

Machine Learning system

PythonScikit-learnPandas
Machine Learning

Customer Segmentation and Strategic Recommendation

K-Means segmentation for investment propensity strategy

A customer analytics project that uses multivariate EDA and K-Means clustering to identify customer segments and recommend targeted investment strategies.

ClusteringK-MeansCustomer AnalyticsStrategy

Machine Learning system

PythonPandasScikit-learn
Machine Learning

High-Dimensional Data Imputation via Group Lasso

Dimension reduction and imputation for noisy high-dimensional data

A machine learning project comparing Group Lasso and KNN-style imputation under missing-not-at-random settings and high-dimensional feature spaces.

ImputationGroup LassoMissing DataFeature Selection

Analytics system

PythonPandasA/B Testing
Analytics

Cancellation Policy for Ridesharing

A/B testing and root-cause analysis on large-scale ridesharing data

An analytics project that designs and evaluates a cancellation policy using A/B testing, root-cause analysis, and statistical validation.

A/B TestingRoot Cause AnalysisPolicyRidesharing

Data Science system

PythonPandasScikit-learn
Data Science

Predicting Falcon 9 Reusability

Classification pipeline for first-stage landing outcomes

A data science workflow predicting whether a Falcon 9 first stage will land successfully, using feature engineering, exploratory analysis, and model comparison.

ClassificationFeature EngineeringEDASpaceX

Machine Learning system

PythonPyTorchPandas
Machine Learning

Music Generation with Deep Learning

LSTM-based sequence model for generated music notes

A PyTorch deep learning project that encodes note sequences and trains an LSTM model for music generation.

Deep LearningLSTMMusic GenerationSequence Modeling

Quant system

RMCMCBayesian Inference
Quant

Modeling Equity Market Trends through GBM

Bayesian GBM and HMC for JNJ stock analysis

A quantitative research project under Professor Jazi using Bayesian Geometric Brownian Motion and Hamiltonian Monte Carlo to estimate and forecast Johnson & Johnson stock behavior.

GBMHMCForecastingJohnson & Johnson

Analytics system

RRegression AnalysisANOVA
Analytics

Student Learning Preference Analysis

Survey-based analysis of traditional and AI-enabled study tools

A survey research project under Professor Labadi analyzing how students use traditional resources, online platforms, and LLM tools for learning.

EducationGenerative AISurveyStatistical Testing