Projects

disaster_response

Disaster Response
github

There are three major sections to the parts project. These parts include: an ETL pipeline, a ML pipeline, and a web application that demonstrates the model's efficacy in real time using the user's input. The pipelines used Python libraries Pandas, Numpy, scikit-learn, nltk, and SQLAlchemy. The front-end utilized Flask and plotly.js.

Heart Attack

Predicting Heart Disease
github

Generated a predictive model for heart disease using labeled data. The data were analyzed, visualized, and modeled using Python libraries Pandas, Seaborn, and scikit-learn. The findings were summarized in a Medium blog.

2019 World Series

2019 World Series Prediction
github

Implemented a ML model on data to predict outcome of the 2019 World Series. Back-end completed with Python libraries pandas, NumPy, and scikit-learn.


Real Estate Analysis

SC Real Estate Analysis
github

Built a Python application to determine the best areas for property investment in South Carolina using available APIs, and web scraping for rental/property values. Results were graphed in a zoom-able heat-map with Matplotlib.

Bacterial Analysis

Bacterial Analysis
github

Generated a web app that visualizes data on bacteria found in various belly-button swabs. Data analysis was performed using Python libraries pandas, NumPy, and SQLAlchemy; the web app was built using Plotly.

C# Foundations

Image Classifier
github

Built a ML model using PyTorch to recognize different species of flowers. The model utilized a pre-trained network and was trained/validated using labeled data.


Academic Trends

Academic Trends
github

Generated a D3.js visualization of data obtained from US Census and Behavioral Risk Factor Surveillance System. The data were cleaned using Pandas.

Latitude Analysis

Latitude Analysis
github

Generated a web page that uses CSS and Bootstrap to display data; the data were obtained from OpenWeatherMap, analyzed using Pandas, and visualized using Matplotlib.

Finding Donors

Finding Donors
github

Built a predictive model for finding appropriate donors using labeled data. The data were analyzed, visualized, and modeled using Python libraries Pandas, Matplotlib, and scikit-learn.


UCB MiniProjects

UCB MiniProjects
github

A collection of MiniProjects that focus on data analysis and visualization.