MLP Network for Digit Recognition using MNIST 784 dataset

Overview

This project implements a Multi-Layer Perceptron (MLP) from scratch in Python, using the MNIST dataset for classifying handwritten digits (0-9). The model is built with basic operations using NumPy and is modularized into separate components such as activation functions, forward and backward propagation, loss computation, and training. The model is evaluated based on accuracy on the MNIST dataset.

The project does not rely on machine learning frameworks like TensorFlow or PyTorch but implements the MLP with manual backpropagation.

Features

MLP neural network with a single hidden layer.
Modular design with separate files for activations, loss functions, and training logic.
Training and evaluation functionality implemented from scratch.
Basic matrix operations using NumPy.
Utilizes scikit-learn for data preprocessing and train-test splitting.
Achieves competitive accuracy on the MNIST dataset.

Installation

You can set up the project by installing the required dependencies and running the training script.

Clone the repository:

git clone https://github.com/yourusername/digit-classifier.git
cd digit-classifier

Install the dependencies via pip:

pip install -r requirements.txt

Or, alternatively, using setup.py:

pip install .

Usage

Training the model:

After installing the dependencies, you can train the model by running the main script:

python main.py

This will preprocess the MNIST data, initialize the MLP, and train it over multiple epochs.

Modifying the Model:

You can adjust the number of neurons, learning rate, and number of epochs by editing the hyperparameters in the main.py script.
The model architecture (layers, activations) can be modified in the module/core/core.py file and main.py.

Code Structure

The project is divided into several modules:

module: Contains the core network/model definition, including forward propagation, backpropagation, and weight updates.
activations: Defines activation functions (ReLU, softmax) and their derivatives.
loss: Contains the cross-entropy loss function for classification tasks.
trainer: Handles training and evaluation logic for the MLP model.
utils: Preprocessing functions like one-hot encoding and train-test splitting.

Testing

Unit tests are available to verify the correctness of the MLP implementation, activations, and loss function. To run the tests:

python -m unittest discover

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Author

Created by Sebastian Mandal.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
docs		docs
module		module
tests		tests
.gitignore		.gitignore
.style.yapf		.style.yapf
LICENSE		LICENSE
README.rst		README.rst
main.py		main.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLP Network for Digit Recognition using MNIST 784 dataset

Overview

Features

Installation

Usage

Code Structure

Testing

License

Author

About

Releases

Packages

Languages

License

sebmandal/classifier

Folders and files

Latest commit

History

Repository files navigation

MLP Network for Digit Recognition using MNIST 784 dataset

Overview

Features

Installation

Usage

Code Structure

Testing

License

Author

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages