Hi!

I’m currently a Data Scientist at Lyft, and recently graduated from a Master’s degree in Data Science at the University of Washington. I’m interested in the intersection of machine learning, impactful analytics, and product strategy.

Project Portfolio @ UW

Federated Semi-supervised and Transfer Learning [Report] [Slides] [GitHub]

  • Empirical investigation of semi-supervised image classification tasks within federated learning framework, in sponsorship with Brendan McMahan’s group at Google AI.
  • Keywords: TensorFlow, Federated Learning, Privacy, Machine Learning, Image Classification, Semi-supervised Learning

Can you convince me? [Report] [GitHub]

  • Machine learning analysis of likelihood of argumentative persuasion leveraging graph-based user similarity, on users of r/ChangeMyView with large-scale dataset of over 1.6B comments.
  • Keywords: Python, Apache Spark, Google BigQuery, SimRank, Graph Analysis

ArtNet [Report] [GitHub]

  • Implementation and enhancement of Neural Style Transfer technique using Convolutional Neural Networks and PyTorch.
  • Keywords: PyTorch, Google Colab, Deep Learning, Neural Style Transfer

Statistical Machine Learning [GitHub]

  • Implementation of machine learning and gradient descent algorithms from scratch using Python.
  • Keywords: Linear Regression, Logistic Regression, Gradient Descent, Kernel Support Vector Machine, Principal Component Analysis, Cross Validation, Multi-class Classification, L2- and L1-regularization

Machine Learning for Big Data [GitHub]

  • Implementation of large scale machine learning algorithms from scratch using Python and Spark.
  • Keywords: Spark, Market Basket Analysis, Recommendation Systems, Locality-Sensitive Hashing, Principal Component Analysis, k-means Clustering, PageRank and HITS, Dense Communities in Networks, Decision Tree Learning, Data Streaming Algorithms

Early Childhood Education Factor Analysis [Report] [GitHub]

  • Hypothesis testing of relationship between childhood behavioral and socioeconomic factors and academic performance.
  • Keywords: R, Linear Regression, Hypothesis Testing

Yelpagram [Tableau]

  • Data visualization prototype for Yelp influencer platform, aimed at building user trust, combatting fake reviews, and consolidating information to reduce cognitive load.
  • Keywords: Tableau, Data Visualization, Human-Centered Design

Education

  • M.S. Data Science, University of Washington
  • B.S. Bioengineering, University of Pennsylvania

Professional Experience

  • Data Scientist @ Lyft
  • Product Analyst Intern @ Google
  • Data Scientist @ Memorial Sloan Kettering Cancer Center
  • Actuarial Consultant @ Aon