Welcome to my data science portfolio 👋! Here you’ll find selected projects that showcase my developing skills in data cleaning, exploratory data analysis, visualization, and predictive modeling. As a student learning data science, I focus on building end-to-end analytical workflows in Python and Java, and I am also learning other programming languages to broaden my programming skills. I apply statistical methods and machine learning techniques while practicing clear, reproducible analysis.
A data analysis and visualization project focused on exploring California’s residential real estate market trends and pricing patterns, emphasizing clear data storytelling.
The California housing market has long been one of the most dynamic and complex in the U.S., with prices influenced by supply shortages, demand growth, and regional disparities. This analysis aims to uncover trends in pricing, inventory, and key property features.
Tools & Skills: Python, Pandas, NumPy, Matplotlib, Seaborn, Scikit-Learn, Data Visualization
🔗 View on GitHub
A health data analysis project examining how biological sex relates to BMI, calorie expenditure, and other physiological indicators through statistical analysis and visualization.
Obesity is a major public health concern worldwide, but risk factors and body composition patterns often differ between males and females. This project investigates these differences to better understand how biological and behavioral factors contribute to obesity-related trends.
Tools & Skills: Python, Pandas, Seaborn, Plotly, Data Visualization, Exploratory Data Analysis
🔗 View on GitHub
A text and sentiment analysis project exploring emotional trends, narrative structure, and character representation in the TV series Friends.
Television narratives reflect social dynamics and character development over time. This project analyzes dialogue data from Friends to understand emotional arcs, screen presence, and gender representation using data-driven storytelling.
Tools & Skills: Python, Pandas, Seaborn, NLP, Sentiment Analysis, Data Visualization
🔗 View on GitHub