Machine Learning Project Workflow

This project outlines the workflow for a machine learning model training process, involving data retrieval, cross-validation experiments, and model training using optimal parameters.

Project Steps

Retrieve Data and Store in Google Cloud Storage
- The initial step involves collecting the necessary data from football-data.co.uk and securely storing it in Google Cloud Storage. This ensures that the data is easily accessible for subsequent processes.
Cross-Validation Experiment
- Data is retrieved from Google Cloud Storage.
- Cross-validation experiments are conducted to evaluate model performance across different configurations in Vertex AI.
- The results of these experiments, including metrics and model parameters, are stored in Neptune.ai.
Model Training with Best Parameters
- The best-performing parameters are retrieved from Neptune.ai.
- Using these parameters, the final model is trained in Vertex AI.
- The model is then stored back in Google Cloud Storage for further use.
Inference
- The server Flask app is deployed.
- It loads the model from the Google Cloud Storage.
- Serves predictions.

Flowchart

Tools and Technologies

Google Cloud Storage: Used for storing data and models.
Artifact Registry: Used for storing custom Docker images.
Vertex AI: Used for experiment running and model training.
Google Cloud Run: Used for running the server Flask app.
Neptune.ai: Used for tracking experiments and storing results.
Python/Scikit-learn: Tools for custom model training and cross-validation.

Usage

Data Storage: Upload your dataset to Google Cloud Storage.
Experimentation: Run cross-validation and log results to Neptune.ai.
Model Training: Train the model using the best parameters from Neptune.ai and store the model in Google Cloud Storage.
Inference: Serve predictions through Cloud Run.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
football_match_predictor		football_match_predictor
tests		tests
.gitignore		.gitignore
Football match predictor Flowchart.png		Football match predictor Flowchart.png
README.md		README.md
experiment_pipeline.Dockerfile		experiment_pipeline.Dockerfile
feature_pipeline.Dockerfile		feature_pipeline.Dockerfile
gcp_experiment_run.Dockerfile		gcp_experiment_run.Dockerfile
gcp_model_train.Dockerfile		gcp_model_train.Dockerfile
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
server_app.Dockerfile		server_app.Dockerfile
training_pipeline.Dockerfile		training_pipeline.Dockerfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Project Workflow

Project Steps

Flowchart

Tools and Technologies

Usage

About

Releases

Packages

Languages

MajkellVZ/football-match-predictor

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Project Workflow

Project Steps

Flowchart

Tools and Technologies

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages