Hi!
I’m currently a Data Scientist at Lyft, and recently graduated from a Master’s degree in Data Science at the University of Washington. I’m interested in the intersection of machine learning, impactful analytics, and product strategy.
Project Portfolio @ UW
Federated Semi-supervised and Transfer Learning [Report] [Slides] [GitHub]
- Empirical investigation of semi-supervised image classification tasks within federated learning framework, in sponsorship with Brendan McMahan’s group at Google AI.
Keywords: TensorFlow, Federated Learning, Privacy, Machine Learning, Image Classification, Semi-supervised Learning
Can you convince me? [Report] [GitHub]
- Machine learning analysis of likelihood of argumentative persuasion leveraging graph-based user similarity, on users of r/ChangeMyView with large-scale dataset of over 1.6B comments.
Keywords: Python, Apache Spark, Google BigQuery, SimRank, Graph Analysis
- Implementation and enhancement of Neural Style Transfer technique using Convolutional Neural Networks and PyTorch.
Keywords: PyTorch, Google Colab, Deep Learning, Neural Style Transfer
Statistical Machine Learning [GitHub]
- Implementation of machine learning and gradient descent algorithms from scratch using Python.
Keywords: Linear Regression, Logistic Regression, Gradient Descent, Kernel Support Vector Machine, Principal Component Analysis, Cross Validation, Multi-class Classification, L2- and L1-regularization
Machine Learning for Big Data [GitHub]
- Implementation of large scale machine learning algorithms from scratch using Python and Spark.
Keywords: Spark, Market Basket Analysis, Recommendation Systems, Locality-Sensitive Hashing, Principal Component Analysis, k-means Clustering, PageRank and HITS, Dense Communities in Networks, Decision Tree Learning, Data Streaming Algorithms
Early Childhood Education Factor Analysis [Report] [GitHub]
- Hypothesis testing of relationship between childhood behavioral and socioeconomic factors and academic performance.
Keywords: R, Linear Regression, Hypothesis Testing
Yelpagram [Tableau]
- Data visualization prototype for Yelp influencer platform, aimed at building user trust, combatting fake reviews, and consolidating information to reduce cognitive load.
Keywords: Tableau, Data Visualization, Human-Centered Design
Education
- M.S. Data Science, University of Washington
- B.S. Bioengineering, University of Pennsylvania
Professional Experience
- Data Scientist @ Lyft
- Product Analyst Intern @ Google
- Data Scientist @ Memorial Sloan Kettering Cancer Center
- Actuarial Consultant @ Aon
