This is the repository for the first challenge hosted by Politecnico di Milano for the Artificial neural networks and deep learning course in 2022.
In this task, we are required to classify species of plants, which are
divided into 8 categories according to the species of the plant to which they belong. Being a classification
problem, given an image, the goal is to predict the correct class label.
You can find the dataset here.
The dataset provided is structured in a single folder containing the following classes:
- Species1 : 186 images
- Species2 : 532 images
- Species3 : 515 images
- Species4 : 511 images
- Species5 : 531 images
- Species6 : 222 images
- Species7 : 537 images
- Species8 : 508 images
The dataset contains in total 3542 images of size 96x96.
The repository is structured in the following way:
- utilities: folder containing all the scripts used to manipulate the dataset before uploading it to kaggle.
- best_model.ipynb: notebook that generates the best model we were able to build.
- report.pdf: pdf file containing a brief recap of all our experiments before reaching the best model.
Our best model reached an accuracy of 91% on the validation set and 88.3% accuracy in the hidden test provided by the challenge host.
F1-score table on the test set:
Species1 | Species2 | Species3 | Species4 | Species5 | Species6 | Species7 | Species8 | |
---|---|---|---|---|---|---|---|---|
F1-score | 0.7704 | 0.8871 | 0.9292 | 0.8629 | 0.8980 | 0.9222 | 0.9632 | 0.8257 |
Final position: 78 out of 300.