harish@portfolio:~$
Harish Gandhi
whoami

Harish
Gandhi

Data Engineer at TextNow, based in Toronto, Canada. IIT Chicago alum (M.S. Computer Science, 3.72 GPA) with hands-on experience across healthcare and analytics domains. I specialize in designing and scaling ETL pipelines, data automation, and machine learning integrations — bridging the gap between business needs and technical execution. Winner of the TextNow AI Hackathon 2025. PyData Chicago speaker & NumFOCUS contributor.

Projects

A selection of projects spanning machine learning, NLP, data engineering, and voice interfaces.

AI Support Assistant

Winner of the TextNow AI Hackathon 2025. Built an AI-powered support assistant that classifies user questions against Slack history, suggests relevant past answers, and escalates unmatched queries via chatbot to Jira with contextual ticket creation.

AI NLP Slack API Jira Chatbot Hackathon Winner
Calorie Count Policy

Calorie Count Policy

Analyzed the impact of mandatory calorie labeling on restaurant menu choices using causal inference and statistical modeling. Presented at PyData Chicago.

Python Causal Inference Statistics
View Notebook
Recruiter Analytics

Recruiter Analytics

Built a web analytics dashboard that tracks recruiter engagement patterns on LinkedIn profiles using web scraping and data visualization pipelines.

Web Scraping Analytics Visualization
View on GitHub
Crime Prediction

Crime Prediction

Applied supervised ML models (Random Forest, SVM, XGBoost) on Chicago crime dataset to predict crime type and location, achieving 85%+ classification accuracy.

Machine Learning Scikit-learn XGBoost
View Notebook
Tirukkural Alexa Skill

Tirukkural Alexa Skill

Designed and published an Amazon Alexa skill that recites couplets from the ancient Tamil classic Tirukkural, with contextual explanations and keyword search.

Alexa SDK Node.js NLP
Learn More
NewsOptimism

NewsOptimism

NLP pipeline that crawls major news outlets, applies sentiment analysis and topic modeling, then surfaces a daily "optimism score" with positive story highlights.

NLP Sentiment Analysis NLTK
Live Demo
Patient Activity Tracker

Patient Activity Tracker

Computer vision system for monitoring patient activity and posture in clinical settings, using image classification and time-series anomaly detection.

Computer Vision TensorFlow Healthcare
Learn More

Skills

Languages & Scripting
Python SQL Scala Shell Script
Data Engineering & Orchestration
Apache Airflow dbt Azure Data Factory Databricks Spark ETL Pipelines
Cloud & Big Data
Snowflake AWS (S3, EC2, Lambda, MKS, MWAA) Azure (Cosmos DB, Data Lake, Synapse) Kafka
Containerization & CI/CD
Docker Kubernetes Jenkins Git
Databases
SQL Server PostgreSQL MongoDB
Visualization & Tools
Redash Tableau JupyterHub

Specialties: ETL Development • Data Modeling • Data Quality & Accuracy • API Integration • Data Pipeline Optimization • Machine Learning Deployment

"If you hang around the barbershop long enough… sooner or later you're gonna get a haircut." — Denzel Washington

Education

Illinois Institute of Technology, Chicago

Master of Science in Computer Science

2016 – 2018  |  GPA: 3.72 / 4.0

Concentration in Data Science, Software Engineering & Database Systems

Data Science Software Engineering Database Systems Machine Learning

Certifications

Google Cloud / Coursera
ML with TensorFlow on GCP Specialization Art and Science of Machine Learning
Stanford University
Mining Massive Datasets Statistical Learning
deeplearning.ai / Coursera
Neural Networks and Deep Learning

Speaking & Community

PyData Chicago

Speaker — June 2018. Presented text mining and causal inference research on restaurant calorie labeling policy.

NumFOCUS

Guest Speaker — June 2018 – Present. Contributing to open-source scientific computing community outreach.

TextNow AI Hackathon 2025

Winner — 2025. Built an AI support assistant that classifies questions against Slack history, suggests relevant answers, and escalates to Jira with contextual ticket creation.

Volunteering

SpirIT-ED Manager & LEAP Coordinator (2011–2013). Led community engagement and educational outreach initiatives.

Contact

I'm always open to discussing data science projects, research collaborations, or just connecting. Reach out through any of the channels below.