Projects
Here I list noteworthy projects that I have built (or added to) either alone or with a team.
4️⃣ Anscombe’s Quartet
- Details the importance of data visualization by an exploratory analysis of Anscombe’s Quartet in R.
- Written in R Markdown
- Report
📊 tab-err PyPi (Open Source Contribution)
- Working at the BHT’s Calgo Lab with Sebastian Jaeger I added to the tab-err, realistic errors in tabular data, package on PyPi.
- Wrote basic tests, developed a high level api, and modified some othe error types as well as work on bug fixes in python.
- PyPi
- GitHub
📖 Editing/ Errata Addition of Machine Learning Kurz & Gut
- I read the book Machine Learning Kurz und Gut in German and pointed out a few errors to the author.
- Shows some proficiency in German written communication.
- Github
🌐 K-Means Clustering of Cities
- Implementation of the K-Means algorithm to cluster high population cities based on geographic coordinates.
- I wrote this in R Markdown for fun.
- Report
🏦 Messenger Bot
- In this project, I setup messaging with the Telegram API, GitHub actions, and yfinance to ping my phone each day before the US Stock Exchange opens with information about the S&P 500 and crypto.
- A future direction I am interested in is having LLMs summarize news articles and send a daily report to my phone from a curated list of sources.
- GitHub
☁️ Business Intelligence Presentation
- This is a presentation for a team project to analyze data from a 2006 AOL USA search query dataset which we hosted as an Exasol database on a Virtual Machine.
- We were required to augment the database with another dataset, so we chose weather events in the USA during 2006.
- The skills developed were: SQL, Python, OLAP, ETL, Data Visualization, Tableau, and EDA.
- Presentation
⚡ Berlin Electric Vehicle Charger Geo-Visualizer
- In this project, I loaded data from various German government sources about the population and charging stations in various postal codes in Berlin.
- A heatmap overlay of a map of Berlin, subdivided by postal code is presented with options for viewing population, number of charging stations, and estimated new station demand.
- Finally, a suggestion box functionality allows users to input suggestions attached to a given postal code.
- Streamlit Page
🪙 Income Tax Calculator
- In this application I loaded 2020 IRS data and used linear interpolation of tax brackets to compute the estimated median income of various zip codes in the United States.
- Streamlit Page
- Data
🤖 NumPy Feed Forward Neural Network
- In this project, I implemented (alone, in NumPy, without the use of LLMs) a customizable feed forward neural network that can have an arbitrary number of layers and hidden units.
- This was done from the absolute basics, just matrix & vector operations to implement forwardpropagation, backpropagation, and minibatched stochastic gradient descent.
- GitHub
🏠 Housing Price Prediction Report
- In this project, I worked with a team of 3 other students to apply and compare several different statistical learning methodolgies in R on a real estate dataset from Kaggle.
- Report
- GitHub
🗣️ Statistics Research Presentation
- In this project, I present an example application of a novel statistical test to reaction times data.
- This was worked on under the supervision of Professor Kimihiro Noguchi of the WWU Mathematics department.
- We obtained funding from the WWU Mathematics department to travel to the 2024 Pacific inland Undergraduate Mathematics Conference (PiMUC) to present.
- Presentation
🎓 Computer Science BS Senior Project Writeup
- This project was the culminating work of my senior research project in my bachelor’s of computer science.
- I worked with a team of 3 people to build an applied deep learning model in Python using PyTorch, NumPy, Pandas, and Astropy to predict parameters of binary star systems.
- Report
- GitHub
🧮 Joint Mathematics Meetings Poster
- In this poster, I presented research regarding the simulation study aspect of the development of a novel class of statistical tests at the 2024 Joint Mathematics Meetings in San Francisco.
- This was done under supervision of Professor Kimihiro Noguchi of the WWU Mathematics department.
- We obtained funding from the WWU Mathematics department to travel to and stay in San Francisco for the 2024 Joint Mathematics Meetings.
- Poster
📈 Data Mining Analysis of Energy Economics Time Series
- In this project, I worked with one other student to analyze a set of time series concerning energy economics data.
- During this project, we worked with faculty in the department of business and economics as well as the department of computer science.
- The main techniques employed were clustering of time series and sequence mining. Traditional techniques for forecasting were not explored.
- Report
- GitHub
⚙️ Toy Operating System
- In this project, as we worked through the textbook: Operating Systems Concepts 10th Edition, we had to individually implement (in KPL & Assembly) a toy operating system, the Blitz OS.
- Due to academic integrity constraints I cannot publish my solution for this but I found the course instrumental in developing a deepened understanding of how operating systems work.
- Book
- Blitz OS Description
- Blitz OS GitHub
⚙️ lsdir Implementation in C
- This was a midterm live coding question which I did before the advent of ChatGPT.
- We were given only 50 minutes to complete the program but I managed to get full points including extra credit.
- GitHub
📡 TCP Client Server
- In this class project, I implemented a client server architecture that allowed for the communication of messages using TCP/IP and UNIX sockets in C.
- Assignment Description
- GitHub
📡 UDP Load Balancing Server
- In this class project, I implemented a UDP load balancing server using UNIX sockets in C.
- The assignment description is on the GitHub but in MS Word…
- GitHub
📝 Text Editor
- In this assignment, I followed a lab to implement a text editor in Java. While not the most involved project, it is still a fun one.
- GitHub