preloader

I’m a

Data Scientist Software Engineer Database Engineer Adventerous Hiker Active Photographer Sourdough Enthuasist Graduate Student
Dive in

KNOW MORE
ABOUT ME

Hello! I recently graduated with a Master’s degree at the University of Southern California and I’m actively looking for a Data Scientist position! I have experience in an analytical role leveraging “off the shelf” models to build a transparent solution that is understood by both a technical and non-technical audience.
DOWNLOAD MY RESUME      
about-me

EXPERIENCE

  • Data Scientist

    Genentech May 2021-Present

    • Design and recommend the best principle investigator and clinical site that will enroll the required diverse patients

    • Improve Search Engine for Medical Content Inquiries

  • Course Producer / Grader

    University of Southern California Jan 2020-Dec 2020

    • Guided over 93 students to obtain practical experience in applying machine learning to video games.

    • Co-managed 36 student projects over 2 semesters; secured and setup GCP and AWS environments for 18 teams.

    • Surveyed students on staff objectives for published paper; 78% gained applicable practical experience from the class.

    • Published research paper on arXiv and presented paper at EAAI-21.

    Check out the repositories: https://github.com/csci-599-applied-ml-for-games, https://github.com/CS527Applied-Machine-Learning-for-Games

    Read the paper
  • Directed Research Developer

    University of Southern California Sep 2019–Dec 2019

    • Restructured web app for analyzing and comparing multiple software systems development histories.

    • Incorporated 2 Plotly features for visualizing how and to what extent developers influence software quality.

    • Resolved 5 issues on Plotly displays resulting in a clearer visualization of history analysis by queried elements.

  • Data Science Intern

    WarnerBros. Entertainment Jun 2019–Aug 2019

    • Prototyped image recognition model on AWS to identify DC characters; identified Batman with 80% accuracy.

    • Examined over 30,000 DC images by designing Python ETL pipeline for automated collection and augmenting.

    • Reduced training time for a generalized model by 33% using transfer learning.

    • Demoed findings in presentation to WB data team and DCU senior manager by testing 2 DC characters with 74% accuracy.

  • SE - Web Development

    San Jose State University Sep 2017-May 2018

    • Accelerated task completion in over 60 yearly projects by overhauling ticketing system in Google Scripts.

    • Improved existing SQL queries leading to easier storage and faster retrieval of user data.

    • Visualized attendance and subscription data in over 30 performances into actionable insights.

LANGUAGES

Python 95%
SQL 95%
Java 90%
R 85%
JavaScript 80%
NoSQL 80%
Google Script 80%
Scala 75%

EDUCATION

  • Master’s Degree in Computer Science - Data Science

    University of Southern California Aug 2018-Dec 2020

  • Bacheleor’s Degree in Computer Science

    San Jose State University Aug 2014-May 2018

    Minor in Mathematics; Cum Laude

TECHNOLOGY STACK

TensorFlow
TensorFlow
Keras
Keras
PyTorch
PyTorch
Spark
Spark
Hadoop
Hadoop
Matplotlib
Matplotlib
Plotly
Plotly
Scikit-Learn
Scikit-Learn
Numpy
Numpy
Pandas
Pandas
Tidyverse
Tidyverse
seaborn
seaborn
Docker
Docker
AWS
AWS
GCP
GCP
Linux
Linux
Ubuntu
Ubuntu
Firebase
Firebase
Github
Github
Bitbucket
Bitbucket
MongoDB
MongoDB
MySQL
MySQL
WordPress
WordPress
Tableau
Tableau

TECHNICAL KNOWLEDGE

K-Nearest Neighbors
K-Nearest Neighbors
Decision Trees
Decision Trees
Random Forests
Random Forests
SVM
SVM
Naive Bayes
Naive Bayes
A-priori
A-priori
Logistic Regression
Logistic Regression
Time Series
Time Series
Data Mining
Data Mining
Information Retrieval
Information Retrieval
Data Visualization
Data Visualization
NLP
NLP
Computer Vision
Computer Vision
CNN’s
CNN’s
A2C
A2C
Reinforcement Learning
Reinforcement Learning
PPO
PPO
Recommendation Systems
Recommendation Systems
LSTMs
LSTMs
Deep Learning
Deep Learning
GANs
GANs
Map/Reduce
Map/Reduce
Statistical Analysis
Statistical Analysis
Predictive Forecasting
Predictive Forecasting
Search Engines
Search Engines
Web Crawlers
Web Crawlers
Page Rank
Page Rank
HDFS
HDFS
Query Execution
Query Execution
Graph Traversal
Graph Traversal

Portfolio - Under Construction!

LA Neighborhood Scores
LA Neighborhood Scores

Data Science

Time Series Analysis

Big Data Analytics

Databases

Cloud Services

Machine Learning

25 Feb, 21
Predicting Statistical Causes of Wildfires
Predicting Statistical Causes of Wildfires

Data Science

Time Series Analysis

Machine Learning

Databases

12 Feb, 21
IR and Web Search Engines
IR and Web Search Engines

Information Retrieval

Cloud Services

Data Science

12 May, 20
LoL Win Predector
LoL Win Predector

Data Science

12 May, 20
Pommerman
Pommerman

Machine Learning

Deep Learning

Reinforcement Learning

Cloud Services

26 Feb, 20
Amazon Recommender
Amazon Recommender

Natural Language Processing

Machine Learning

Data Science

12 Nov, 18

Fun Facts

Years of Experience

0

Years of Experience

Miles Hiked

0

Miles Hiked

FINISHED PROJECTS

0

FINISHED PROJECTS

CUPS OF COFFEE

0

CUPS OF COFFEE

CUPS OF TEA

0

CUPS OF TEA

EAGLE SCOUT AWARD

0

EAGLE SCOUT AWARD