Projects
Here I list noteworthy projects that I have built (or added to) either alone or with a team.
Spotlight Projects
🗄️ RAG/Vector DB/BERT Fine Tune
- In this project, I worked on implementing a RAG system. This included fine tuning BERT on a kubernetes cluster, comparing vector databases such as Annoy and FAISS, and using LangChain to build the pipeline.
- I worked with one other student and I worked on English data while they worked on German data.
- Disclaimer: The GitHub is built like research code, other projects do a better job of indsutrial structure.
- Presentation
- Vector DB Notebook
- RAG Notebook
- GitHub
📊 tab-err PyPi (Open Source Contribution)
- Working at the BHT’s Calgo Lab with Sebastian Jaeger I added to the tab-err, realistic errors in tabular data, package on PyPi.
- Wrote basic tests, developed a high level api, and modified some other error types as well as worked on bug fixes in python.
- PyPi
- GitHub
🌐 K-Means Clustering of Cities
- Implementation of the K-Means algorithm to cluster high population cities based on geographic coordinates.
- I wrote this in R Markdown for fun.
- Report
🕵️ CIA World Factbook Visualization
- In this project, I worked with a group of 4 other students to create a report on the CIA World Fact Book.
- We were guided through the analysis by 5 research questions, the last two were done by me.
- Report
- GitHub
- CIA World Fact Book
☁️ Weather Time Series Forecasting
- In this project, I worked with a team of 2 other students to implement various time series forecasting models.
- We utilized kubernetes, docker, and VSCode Remote SSH to prototype models on the BHT Berlin’s V100 & A100 GPUs.
- I scraped the website of my friend’s weather station to acquire the data and implemented two models.
- Presentation
- GitHub
4️⃣ Anscombe’s Quartet
- Details the importance of data visualization by an exploratory analysis of Anscombe’s Quartet in R.
- Written in R Markdown
- Report
📖 Editing/ Errata Addition of Machine Learning Kurz & Gut
- I read the book Machine Learning Kurz und Gut in German and pointed out a few errors to the author.
- Shows some proficiency in German written communication.
- Github
🏦 Messenger Bot
- In this project, I setup messaging with the Telegram API, GitHub actions, and yfinance to ping my phone each day before the US Stock Exchange opens with information about the S&P 500 and crypto.
- A future direction I am interested in is having LLMs summarize news articles and send a daily report to my phone from a curated list of sources.
- GitHub
☁️ Business Intelligence Presentation
- This is a presentation for a team project to analyze data from a 2006 AOL USA search query dataset which we hosted as an Exasol database on a Virtual Machine.
- We were required to augment the database with another dataset, so we chose weather events in the USA during 2006.
- The skills developed were: SQL, Python, OLAP, ETL, Data Visualization, Tableau, and EDA.
- Presentation
⚡ Berlin Electric Vehicle Charger Geo-Visualizer
- In this project, I loaded data from various German government sources about the population and charging stations in various postal codes in Berlin.
- A heatmap overlay of a map of Berlin, subdivided by postal code is presented with options for viewing population, number of charging stations, and estimated new station demand.
- Finally, a suggestion box functionality allows users to input suggestions attached to a given postal code.
- Streamlit Page
🪙 Income Tax Calculator
- In this application I loaded 2020 IRS data and used linear interpolation of tax brackets to compute the estimated median income of various zip codes in the United States.
- Streamlit Page
- Data
🤖 NumPy Feed Forward Neural Network
- In this project, I implemented (alone, in NumPy, without the use of LLMs) a customizable feed forward neural network that can have an arbitrary number of layers and hidden units.
- This was done from the absolute basics, just matrix & vector operations to implement forwardpropagation, backpropagation, and minibatched stochastic gradient descent.
- GitHub
🏠 Housing Price Prediction Report
- In this project, I worked with a team of 3 other students to apply and compare several different statistical learning methodolgies in R on a real estate dataset from Kaggle.
- Report
- GitHub
🎓 Computer Science BS Senior Project Writeup
- This project was the culminating work of my senior research project in my bachelor’s of computer science.
- I worked with a team of 3 people to build an applied deep learning model in Python using PyTorch, NumPy, Pandas, and Astropy to predict parameters of binary star systems.
- Report
- GitHub
📈 Data Mining Analysis of Energy Economics Time Series
- In this project, I worked with one other student to analyze a set of time series concerning energy economics data.
- During this project, we worked with faculty in the department of business and economics as well as the department of computer science.
- The main techniques employed were clustering of time series and sequence mining. Traditional techniques for forecasting were not explored.
- Report
- GitHub
⚙️ Toy Operating System
- In this project, as we worked through the textbook: Operating Systems Concepts 10th Edition, we had to individually implement (in KPL & Assembly) a toy operating system, the Blitz OS.
- Due to academic integrity constraints I cannot publish my solution for this but I found the course instrumental in developing a deepened understanding of how operating systems work.
- Book
- Blitz OS Description
- Blitz OS GitHub
⚙️ lsdir Implementation in C
- This was a midterm live coding question which I did before the advent of ChatGPT.
- We were given only 50 minutes to complete the program but I managed to get full points including extra credit.
- GitHub
📡 TCP Client Server
- In this class project, I implemented a client server architecture that allowed for the communication of messages using TCP/IP and UNIX sockets in C.
- Assignment Description
- GitHub
📡 UDP Load Balancing Server
- In this class project, I implemented a UDP load balancing server using UNIX sockets in C.
- The assignment description is on the GitHub but in MS Word…
- GitHub
📝 Text Editor
- In this assignment, I followed a lab to implement a text editor in Java. While not the most involved project, it is still a fun one.
- GitHub