Brandon I. King

Example Data Science Projects

NBA Data Analysis

Implemented analysis and data visualization concepts to display NBA player statistics on web application using Heroku and Python libraries

Cannabis Analysis

Automated Data Warehouse to store Cannabis Product Data and visualize product analysis using MySQL, Mode Analytics, Tableau and Python libraries

Detect Credit Card Fraud

Implemented machine learning algorithms (Logistic Regression, Decision Tree, Neural Networks, and Gradient Boosting) to detect credit card fraud using Python libraries (numpy, pandas, and scikit-learn)

Predicting Sales

Forecasted the monthly sales with Long Short-term Memory (LSTM) method using Python libraries (keras and scikit-learn)

Chatbot

Chatbot uses deep learning techniques (Natural Language Processing) to interact with customers via chat graphical user interface using Python libraries (keras, numpy, nltk, and tkinter)

Database Systems Project

Created a robust database system using SQL to provide an command line user interface for information storage and retrieval

Decision Tree in R

Implemented Decision Tree algorithm using GINI Index and Information Gain to predict outcomes in R

Beat the Bookie

Applied predictive modeling algorithms (Decision tree) in R to improve the odds to make a profit on small bet lines

Research

Presentations & Publications

Recent Research: Applying Machine Learning Methods for Insight into Textile Recycling Behavior

Abstract:

The purpose of this study was to investigate supervised machine learning models’ performance to determine the critical factors for textile recycling behavior (recycle textiles or do not recycle textiles). Secondary data from a survey given to 1,054 participants were analyzed. Six parameters were varied: feature scaling, cross-validation techniques, sampling techniques, number of folds, hyperparameters, and feature importance. Five algorithms were compared: decision tree, linear support vector classifier (linear SVC), K-nearest neighbor (KNN), gradient boosting decision trees (GBDT), and random forest trees. The hyperparameters used were the measure of impurity for decision tree and random forest, the number of nearest neighbors for KNN, and the learning rate for GBDT. The best performing model based on the F1 score was random forest on oversampled data. The feature importance resulted in zip code, gender, and ethnicity as the top 3 features. Zip code could be important because of high cardinality. When looking at permutation feature importance, the top three features were types of dwelling, gender, and ethnicity. Implications for textile and apparel survey researchers are given.

Side Coding Projects

Things I love to do in my spare time!

Cannabis

Cannabis Analysis

Database and Data Visualization for Cannabis Products

NBA

NBA Player Analysis

Simple web application performs simple web-scraping and data visualization of NBA player statistics

Sudoku

Sudoku

Sudoku Game Solver using the backtracking algorithm using Python library (pygame)

Certs

Programming languages
Python
Java
C
Quantitative and statistical analysis tools
R
VBA
JMP
Matlab
SAS
Database tools
SQL
Data visualization tools
Tableau
Soft Skills
Team Oriented
Visionary Thinker
Dependable
Organizer
Flexible

Achievements

Machine Learning Cert

July 2020
GradCert in Data Science Foundations

May 2020
Ph.D. in Textile Technology and Management

December 2021

Masters of Operations Research

December 2017

B.S. in Applied Mathematics

May 2015

Photos

Tableau

Sales

Chat

Sudoku

SQL

Questions about Projects?

Please feel free to reach out to me at bking2415@gmail.com

#BlackLivesMatter