GitHub - OscarArroyoVega/RUST-train-serve-boilerplate: boilerplate code in Rust for aplications with 2 microservices, 1- Data Management and train the model ; 2 - Pick the model From AWS S3 buchet, creates a server with an API to do prediction requests to the trained model

Rust Train&Serve- Boilerpalte (evolved from Let's Rust! at RealWorldML.net) - ready for deployment

This is a boilerplate code for applications that download data from a data source (CSV by default), process the data, train a model (in this case, XGBoost), and push the trained model to an AWS S3 bucket. Then, a second microservice creates a server, downloads the model from the S3 bucket, and opens a request API to serve predictions using the model and the payload given by the client. The database used for this boilerplate is the Boston Housing Price dataset in CSV format. All of this is written in Rust, using Polars. The main funtionalities of the repository were done following Pau Labarta Bajo as the instructor for the cohort "learn Rust together".

Features

Data download and processing
Model training with XGBoost
Model storage in AWS S3
Prediction API server in EC2
Streamlit frontend to test the API

Tech Stack

Rust
Python
Polars
Cargo
Docker
AWS s3, EC2
ngrok (for local tests)
streamlit

Prerequisites

Make
Rust
Rust-Analyzer (recomemnded)
Cargo
Docker
AWS account

Getting Started

Clone the repository:

git clone https://github.com/OscarArroyoVega/RUST-boilerplate-train-API.git

Usage

The data processing service will download the Boston Housing Price dataset, process it, train an XGBoost model, and upload the model to an AWS S3 bucket. The prediction API server once deployed will download the trained model from the S3 bucket and serve predictions based on the client payload. A streamlit basic frontend has been built to complete the system.

Deployment in AWS

For deploying the inference service to an EC2 instance (or any other that supports multiple containers) is recommended to compress the local docker image into a .tar file instead of cloning the repository and building it again inside the instance. This is to avoid cloning again the XGBoost package (slow approach). Access to the API service is configured to be just accessible from the frontend application. The API service restarts automatically. The frontend runs also in docker.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.devcontainer		.devcontainer
docker		docker
frontend		frontend
src		src
.env.example		.env.example
.gitignore		.gitignore
Cargo.toml		Cargo.toml
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Rust Train&Serve- Boilerpalte (evolved from Let's Rust! at RealWorldML.net) - ready for deployment

Features

Tech Stack

Prerequisites

Getting Started

Usage

Deployment in AWS

About

Uh oh!

Releases

Packages

Uh oh!

Languages

OscarArroyoVega/RUST-train-serve-boilerplate

Folders and files

Latest commit

History

Repository files navigation

Rust Train&Serve- Boilerpalte (evolved from Let's Rust! at RealWorldML.net) - ready for deployment

Features

Tech Stack

Prerequisites

Getting Started

Usage

Deployment in AWS

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages