Skip to content

Multi-Agent RL approach to solving MNIST problem where agents have partial view of the handwritten digit image

License

Notifications You must be signed in to change notification settings

varun646/MARL_MNIST

Repository files navigation

Multi-Agent RL for MNIST Image Identification

This repository implements a multi-agent reinforcement learning (RL) system to identify MNIST images using the custom Gymnasium environment Multi-Agent MNIST Reconstruction. The RL agents work together to reconstruct and identify an MNIST image in as few steps as possible. A computer vision attention model is used to evaluate the reconstructed image and guide the training process.

Overview

Project Description

The project aims to train multiple RL agents to collaboratively identify an MNIST image. The agents are placed on random parts of the image and have limited views. They navigate the image using actions (up, down, left, right), and their observations are combined to form a reconstructed image. A computer vision attention model is then used to evaluate how well the reconstructed image matches the original MNIST image. The RL system is trained to minimize the number of steps needed to accurately identify the image.

Features

  • Multi-Agent RL: Each agent learns to navigate the image optimally, contributing to the overall image reconstruction.
  • Attention Model: A computer vision attention model evaluates the reconstructed image, providing feedback for the RL agents.
  • Efficient Training: The system is designed to minimize the number of steps needed for accurate image identification.

Installation

Prerequisites

Installation Steps

  1. Clone the repository:

    git clone https://github.com/varun646/MARL_MNIST.git
  2. Install the required packages:

    pip install -r requirements.txt
  3. Install the custom environment:

    pip install -e .

About

Multi-Agent RL approach to solving MNIST problem where agents have partial view of the handwritten digit image

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages