Chieh-An Chang
HomeAboutExperienceCredentialsProjectsResumeContact

Chieh-An (Andy) Chang

Data Science & Data Engineering co-op candidate building analytics pipelines, machine learning models, and AI applications from messy data to deployable systems.

Resume

Chieh-An (Andy) Chang

Data Science & Data Engineering co-op candidate building analytics pipelines, machine learning models, and AI applications from messy data to deployable systems.

Download Resume PDF

Education

Sep 2025 - Apr 2027 (Expected)

Master of Data Science and Artificial Intelligence (Co-op)

University of Waterloo

  • CGPA: 4.00/4.00 (91.0/100); nominated for the Vector Institute Scholarship.
  • Relevant coursework: Data Visualization, Machine Learning, Data Engineering, Distributed Systems, Big Data, and Vector Database.

Sep 2019 - Jun 2025

Honours Bachelor of Science, Double Major in Computer Science and Statistics

University of Toronto

  • Graduated with High Distinction; CGPA: 3.81/4.00 with an 89.8/100 average in 300/400-level coursework.
  • Dean's List Scholar in 2021, 2022, 2024, and 2025.
  • Shifted from Computer Science to a Computer Science and Statistics double major to connect AI engineering, data science, machine learning, and probabilistic inference.

Experience

May 2026 - Aug 2026 (4 mos)

Data Engineer

Investment Management Corporation of Ontario (IMCO) - Toronto, ON

Azure Data Factory, SQL Server Management Studio, Databricks, Snowflake, Power BI

  • Automated front-office data pipelines via metadata-driven orchestration, ensuring data availability by integrating SSMS and ADF.
  • Secured data reliability by engineering Medallion architectures via Azure Data Lake Store and Databricks, removing landing errors.
  • Empowered Total Portfolio and Capital Markets (TPCM) decisions by routing Databricks outputs seamlessly into Snowflake.
  • Accelerated project delivery, meeting Agile Sprint objectives, by managing CI/CD via Azure DevOps, Git, and Liquibase.

Jan 2024 - Aug 2025

Teaching Assistant

University of Toronto

  • Mentored 240+ students through tutorials on SQL, statistical inference, probability, Bayesian statistics, and database fundamentals.
  • Improved student satisfaction with 90%+ positive feedback by explaining research ideas and technical concepts through 35+ office hours.
  • Supported fair assessment for 2,000+ students with consistent rubric-based marking and 100% on-time delivery.

Project Experience

AI

Human-in-the-Loop Email Agent via LangChain

An AI safety-focused email agent using LangChain, LangGraph, prompt middleware, and human-in-the-loop controls to prevent unauthorized email actions.

PythonLangChainLangGraphOpenAI APIMiddlewareTool Calling

AI

Multi-Agent Event Coordinator

A multi-agent coordinator that automates event-planning workflows with LangChain, LangGraph, LangSmith tracing, MCP tools, Tavily Search, and a self-correcting SQLite text-to-SQL agent.

PythonLangChainLangGraphLangSmithMCPText-to-SQL

AI

YouTube Summary RAG Video Analyzer

A RAG video analysis tool that chunks YouTube transcripts, embeds retrieval context with FAISS, and serves interactive summaries and question answering through Streamlit.

PythonGoogle GeminiLangChainFAISSStreamlit

Data Science

BC PM2.5 Short-Term Forecasting Report

A forecasting project that automates public data retrieval, compresses large environmental datasets, and models short-term PM2.5 air pollution outcomes.

Contact

Email
c84chang@uwaterloo.ca
Location
Toronto, ON, Canada

Skills

Python - AdvancedSQL - AdvancedDatabricks / Snowflake - AdvancedMicrosoft Azure - IntermediatePandas / GeoPandas - AdvancedScikit-learn - AdvancedLangChain / LangGraph - IntermediateLLMs / RAG / Vector Search - IntermediatePower BI / DAX - IntermediateR / Statistical Modeling - AdvancedSciPy / Statsmodels - IntermediatePyTorch - IntermediateMatplotlib / Seaborn / Plotly - IntermediateReact / Next.js - IntermediateSupabase / Postgres - IntermediateDocker / AWS / Vercel - Intermediate

Certifications

AWS Certified Cloud Practitioner

Amazon Web Services (AWS) - Expected Jun 2026

Databricks Certified Data Engineer Associate

Databricks - Expected May 2026

SQL (Advanced, Intermediate, Basic)

PythonPandasGeoPandasScikit-learnSciPyStatsmodels

Analytics

Data Career Analytics Dashboard

A Power BI analytics dashboard that visualizes data job trends, skill demand, and role-level market patterns across monthly CSV datasets.

Power BIDAXPower QueryM LanguageExcel/CSV

Machine Learning

Customer Segmentation and Strategic Recommendation

A customer analytics project that uses multivariate EDA and K-Means clustering to identify customer segments and recommend targeted investment strategies.

PythonScikit-learnPandasNumPyMatplotlibSeaborn

HackerRank - Issued Nov 2025

Python for Data Science, AI & Development

IBM - Issued Jul 2025

Competitions

GenAI Genesis 2026

UTMIST & Google Developer Group - Mar 2026

AI & Data Science for Good Hackathon

Waterloo.AI - Mar 2026

Statistical Modelling (Supervised Learning)

University of Toronto Kaggle competition - Nov 2024 - Dec 2024