Skip to content

This project predicts California housing prices using machine learning regression models, including Random Forests and Decision Trees. It covers data preprocessing, exploratory analysis, model training, and hyperparameter tuning to optimize performance.

Notifications You must be signed in to change notification settings

abdul-rafay19/California-Housing-Price-Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🏠 California Housing Price Prediction

A complete end-to-end machine learning regression project using the California Housing Dataset, focusing on predicting median house values based on multiple features. This project was developed as a task for the Machine Learning Internship at Arch Technologies.


πŸ“Œ Project Summary

This project demonstrates:

  • Data preprocessing & visualization
  • Building regression models:
    • Linear Regression
    • Decision Tree Regressor
    • Random Forest Regressor
  • Cross-validation evaluation
  • Hyperparameter tuning using:
    • GridSearchCV
    • RandomizedSearchCV
  • Final model testing and RMSE evaluation

πŸ“ Dataset

  • Source: Built-in California housing dataset from sklearn.datasets
  • Features:
    • MedInc: Median income in block group
    • HouseAge: Median house age
    • AveRooms: Average number of rooms
    • AveBedrms: Average number of bedrooms
    • Population: Block group population
    • AveOccup: Average number of household members
    • Latitude & Longitude
  • Target:
    • MedHouseVal: Median house value

πŸ› οΈ Technologies & Libraries

  • Python 3.x
  • NumPy
  • Pandas
  • Matplotlib
  • Seaborn
  • Scikit-learn
  • SciPy

πŸ“Š Exploratory Data Analysis

  • Visualized distributions of features using histograms
  • Heatmap correlation matrix to identify feature relationships
  • Scatter plot between MedInc and MedHouseVal

πŸ§ͺ Model Training & Evaluation

πŸ“Œ Models Implemented:

  1. Linear Regression
  2. Decision Tree Regressor
  3. Random Forest Regressor

πŸ“Œ Evaluation Metric:

  • Root Mean Squared Error (RMSE) using 10-fold Cross-Validation

πŸ‘¨β€πŸ’» Developed By

Abdul Rafay
πŸ“š BS Software Engineering | 🎯 AI & ML Enthusiast
πŸ”— LinkedIn


πŸ“œ License

This repository is licensed under the MIT License.


🌟 Support & Contribution

If you found this helpful:

  • ⭐ Star the repo
  • 🍴 Fork it and contribute
  • πŸ“’ Share on LinkedIn and tag me!

πŸ” Accurate detection. 🎯 Precise segmentation. πŸš€ Built with passion.

About

This project predicts California housing prices using machine learning regression models, including Random Forests and Decision Trees. It covers data preprocessing, exploratory analysis, model training, and hyperparameter tuning to optimize performance.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages