Explainable ML for Bank Marketing Campaigns

This project implements and explains Machine Learning models for predicting customer subscription to bank term deposits, with explanations powered by LIME and SHAP. It provides an interactive Streamlit dashboard with built-in model interpretability to help understand which features drive predictions.

SHAP (SHapley Additive exPlanations) values quantify the contribution of each feature to a prediction, offering both global and local insight into model behavior.

LIME (Local Interpretable Model-Agnostic Explanations) approximates the model locally to highlight the features that influence individual predictions.

🚀 Features

Trained models:
- Decision Tree
- Random Forest
- Naive Bayes
- K-Nearest Neighbors (KNN)
📊 Model comparison, ensemble vote, and average confidence across selected models
🔍 Local and global interpretability with LIME and SHAP (bar, waterfall, force, and summary plots)
🖥️ Interactive Streamlit dashboard for predictions and visualizations
✍️ Form-based input with dedicated Predict button, example profile loader, and input reset
🎛️ Confidence threshold slider, input presets, and light/dark/colorblind-friendly themes with custom accent color
🎉 Optional animations (balloons or snow) based on predictions
📁 Batch prediction via CSV uploads with automated schema validation and detailed error feedback
📥 Downloadable predictions, confidence charts, and explanation plots
🧪 Data exploration with summary statistics, correlation heatmaps, and feature distributions
🛠️ Model training & evaluation with cross-validation, confusion matrices, ROC and precision–recall curves, fairness dashboards, and class balancing presets
📂 Pre-trained models managed via a JSON registry that documents versions and metrics
✅ Built-in schema enforcement for single, batch, and training workflows using typed validation

📂 Project Structure

├── models/              # Pre-trained ML models (.pkl)
├── sample_data.csv      # Example dataset
├── streamlit_app.py     # Streamlit dashboard entry point
├── requirements.txt     # Dependencies
├── LICENSE              # License file
└── README.md            # Project documentation

The sample dataset is available at: https://www.kaggle.com/datasets/yufengsui/portuguese-bank-marketing-data-set

⚙️ Installation

Clone the repository:

git clone https://github.com/your-username/Explainable-ML-Bank-Marketing-Campaigns.git
cd Explainable-ML-Bank-Marketing-Campaigns

Create a virtual environment and install dependencies:

python -m venv venv
source venv/bin/activate   # On Windows: venv\Scripts\activate
pip install -r requirements.txt

▶️ Usage

Run the Streamlit app:
```
streamlit run streamlit_app.py
```
Open the app in your browser:
```
http://localhost:8501
```
Upload your dataset or use the provided sample_data.csv to:
- Generate single or batch predictions
- Compare model outputs and confidence scores
- Visualize interpretability with LIME and SHAP
- Explore datasets via summary statistics, correlation heatmaps, and feature distribution plots
- Train and evaluate models with cross-validation metrics, confusion matrices, and ROC/PR curves
- Inspect fairness metrics by selecting protected attributes during training analysis

✅ Validation & Testing

Run automated tests before committing changes:
```
pytest
```
Input forms, batch uploads, and training data are checked against a typed schema. Any violations (missing columns, out-of-range values, non-binary indicators) surface immediately in the UI so analysts can correct their files before running inference.

📦 Model Registry & Retraining

Model artifacts are described in models/registry.json with version numbers, source datasets, and headline metrics.
To refresh a model:
1. Retrain the estimator offline.
2. Export the artifact to models/ using the naming convention from the registry.
3. Update the registry entry with the new version, dataset, and evaluation metrics.
4. Re-run pytest to confirm schema compatibility before deploying.

☁️ Deployment Tips

Streamlit Community Cloud: push the repo and connect it directly for instant hosting. Remember to configure environment variables for any sensitive credentials.
Docker: wrap the app in a lightweight container and deploy to your preferred platform (Heroku, Azure App Service, AWS Fargate, etc.).
Monitoring: schedule periodic batch prediction checks and log fairness metrics to track drift over time.

🤝 Contributing

Contributions are welcome!

Feel free to open an issue or submit a pull request to enhance features, improve models, or add datasets.

📜 License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Explainable ML for Bank Marketing Campaigns

🚀 Features

📂 Project Structure

⚙️ Installation

▶️ Usage

✅ Validation & Testing

📦 Model Registry & Retraining

☁️ Deployment Tips

🤝 Contributing

📜 License

📚 References

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 107 Commits
.devcontainer		.devcontainer
.github		.github
app_utils		app_utils
models		models
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
sample_data.csv		sample_data.csv
streamlit_app.py		streamlit_app.py

License

CParaschivDev/Explainable-ML-Bank-Marketing-Campaigns

Folders and files

Latest commit

History

Repository files navigation

Explainable ML for Bank Marketing Campaigns

🚀 Features

📂 Project Structure

⚙️ Installation

▶️ Usage

✅ Validation & Testing

📦 Model Registry & Retraining

☁️ Deployment Tips

🤝 Contributing

📜 License

📚 References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages