Machine Learning Projects
AI and Data Analysis
Machine Learning & AI
Projects involving artificial intelligence, machine learning models, and data analysis.
Projects
NLP Project (Dr. Seuss + Shrek Stories)
April 2025
An exploration of creativity and coherence in AI-generated writing using vastly different literary sources:
- Built to explore creativity and coherence in AI-generated writing using vastly different literary sources
- Trained an RNN to generate coherent text based on a blend of Shrek scripts and Dr. Seuss stories
- Focused on NLP model fine-tuning for narrative style and syntactic coherence
This project combined two very different writing styles to create a unique language model that could generate coherent text blending both narrative approaches.
Housing Prices Prediction
Kaggle Competition | March 2025
A machine learning project focused on improving real estate price estimation:
- Tackled the problem of price estimation inconsistencies in regional real estate data
- Built an XGBoost model to predict home prices; reduced RMSE from 200K to 80K
- Discovered bedroom count as a key predictor through feature analysis
This project achieved significant improvement in prediction accuracy, reducing error by 60%.
Model Notebook | Executive Summary
Car Price Prediction Project
Kaggle Competition | February 2025
A neural network approach to used car marketplace valuation:
- Created to simulate used car marketplace valuation using historical feature-rich datasets
- Developed a neural network model with log-transformed targets to improve accuracy
- Validated model improvements with lower prediction variance
Focused on improving prediction accuracy through advanced neural network techniques and data transformations.
Bike Rental Demand Prediction
Kaggle Competition | January 2025
A business-focused forecasting model for bike rental inventory planning:
- Targeted business needs to better forecast rental spikes for inventory and maintenance planning
- Preprocessed and optimized data to train a high-accuracy neural network
- Tuned hyperparameters and model architecture to improve demand forecasts
This project addressed real-world business needs for inventory and maintenance scheduling.
Model Notebook | Executive Summary
Bank Customer Retention ML Model
Kaggle Competition | January 2025
A customer churn prediction model to help banks retain customers:
- Designed to help banks proactively identify customers at risk of leaving based on behavioral patterns
- Created a random forest model to predict customer churn, boosting recall from 25% to 69%
- Focused on customer segmentation and actionable ML-driven targeting
Achieved a significant improvement in recall, nearly tripling the model’s ability to identify at-risk customers.
EDA Notebook | Executive Summary
AI Chatbot for Truck Dispatching
Smart ETA Technologies - Developer Intern Project
A Python-based AI chatbot designed to improve truck dispatching efficiency and compliance:
- Built with Ollama for AI model integration
- Python implementation for natural language processing
- Enhanced truck dispatching workflow efficiency
- Improved compliance tracking through intelligent automation
- Data Integration: Connected with Google Sheets using Python, Pandas, and Seaborn for analysis and visualization
- Performance Optimization: Structured JSON and optimized query logic to enhance AI response accuracy and performance
This project was developed during my internship at Smart ETA Technologies, where I focused on creating practical AI solutions for logistics and transportation management.
ML Model for Phone Dispatchers
BYU-I Support Center - Software Engineer Project
A proof-of-concept machine learning model built to increase efficiency for 100+ phone dispatchers:
- Python-based ML model
- Designed to optimize dispatcher workflows
- Part of my current role at BYU-I Support Center
- Integrated into existing support center infrastructure
Technologies & Tools
- Python: Primary language for ML/AI development
- Ollama: For AI model integration and deployment
- Pandas & NumPy: Data manipulation and analysis
- Seaborn: Data visualization
- Scikit-learn: Machine learning algorithms
- XGBoost: Gradient boosting for structured data
- Neural Networks: Deep learning models for complex predictions
- RNN: Recurrent neural networks for NLP tasks
- Random Forest: Ensemble methods for classification
- Data Analysis: Working with structured and unstructured data for insights