data/input/tehranhouses.csv: Main dataset of Tehran housesdata/output/best_stacking_model.pkl: Trained and saved modelnotebooks/tehran.ipynb: Main notebook containing steps for analysis, data cleaning, feature engineering, modeling, and evaluationimages/: Output images and visualizations
- Business Understanding and Objective Definition
- Data Collection and Exploration
- Data Cleaning and Preprocessing
- Exploratory Data Analysis (EDA) with Plotly
- Feature Engineering and Dimensionality Reduction
- Modeling with Various Regression Algorithms
- Model Evaluation and Selection of the Best Model
- Saving the Final Model
- Install the required packages:
pip install -r requirements.txt
- Run the notebook notebooks/tehran.ipynb and follow the steps.
- The final model is saved at
data/output/best_stacking_model.pkl.
The final model is a Stacking Ensemble Regressor, combining XGBoost, Gradient Boosting, and Random Forest with a Ridge meta-learner, achieving high predictive accuracy.
This project is licensed under the LICENSE.