top of page

Create Your First Project

Start adding your projects to your portfolio. Click on "Manage Projects" to get started

Dimension Reduction in Economic Forecasting: Discussing and Applying Principal Component Analysis

Project type

Machine Learning

Date

March 2025 - June 2025

Location

Los Angeles, California

Role

Data Engineer

One of my favorite project from one of the most important concepts in data science: Machine Learning.

This project explores how dimensionality affects predictive accuracy in economic forecasting, specifically for housing prices. Motivated by the challenges posed by the "curse of dimensionality," my group and I analyzed how multiple macroeconomic indicators, including interest rates, inflation, unemployment, GDP, and time, influence a model's performance in predicting the Housing Price Index (HPI).

To address redundancy and overfitting caused by high-dimensional data, I applied Principal Component Analysis (PCA), effectively transforming correlated economic indicators into fewer, more meaningful features. Results demonstrated that utilizing only the first two principal components significantly simplified the model while preserving or even enhancing predictive accuracy. In this project, I cleaned time-series data from the Federal Reserve Economic Database (FRED) via API, engineered features from interest rate, inflation, unemployment rate, GDP, and time index. I also implemented min-max normalization and 5-fold cross-validation for model evaluation, then compared linear regression performance with raw features vs. principal components using PCA

The project combined rigorous empirical testing, theoretical analysis, and modern statistical techniques to provide actionable insights for improving forecasting stability and performance in economic models.

bottom of page