Introduction

This project was built as a part of insight data engineering program. The project aims at building realtime streaming pipeline for sensor data from IoT sensors onboard Unmanned Autonomous Vehicles(UAV). For the use case presented here, the project collects data from 3 sensors onboard delivery drones to detect and monitor malfunctioning drones.

Motivation

It is estimated that by the year 2026 there will be ~800,000 delivery drones making deliveries across US. Manually monitoring such large such large number of drones would be practically impossible. The goal of this project is 1) To provide realtime access to location of malfunctioning drones, 2) To store sensor data close to malfunctioning event for later analysis. The locations of malfunctioning drones would then allow flight operator at drone delivery companies to take appropriate actions. The sensor data close to malfunctioning events can be later analyzed to prevent future malfunctions.

The web app provides locations of malfunctioning drones and crashed drones for easy access.

Tools and Technologies used

Apache Kafka
Apache Spark Streaming
AWS S3 bucket
PostgreSQL
Flask

Pipeline

The sensor data is streamed through kafka into spark streaming. Spark streaming calculates root mean square error of sensor data with characterized malfunctioning drone data to drone with unexpected flight path. The locations to malfunctioning drones is then displayed onto flask app so that flight operators at drone delivery companies can take appropriate actions. The sensor data close to malfunctioning events is stored into S3 bucket for later analysis.

Cluster setup

3 node Kafka cluster
3 node Spark cluster
1 node for PostgreSQL
1 node for flask
S3 bucket

Installation

Kafka setup

To setup kafka cluster I use pegasus. Which is a tool developed by insight to make setup easier. However you can also follow the post here

run bash setup/kafka_environment.sh to setup kafka and complete environment setup

ssh into the kafka master and run following to produce data bash src/scripts/kafka_submit.sh <Number of drones>

Spark setup

To setup spark cluster run bash setup/spark_environment.sh

To stark running spark, log into spark master and run bash src/scripts/sparksubmit.sh

PostgreSQL setup

Create an EC2 instance and follow setup/postgres.txt to install and create PostgreSQL database.

Flask setup

Run bash setup/flask_app.sh to create a flask node and install dependencies. Run sudo ./src/flask/run.py to start webserver

Possible extension

Use historical data to learn malfunctioning events. These learnt events can then be used to detect future malfunctioning events.

Slides

Presentation slides are available here

Name		Name	Last commit message	Last commit date
Latest commit History 286 Commits
Images		Images
Location_data		Location_data
setup		setup
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Motivation

Tools and Technologies used

Pipeline

Cluster setup

Installation

Kafka setup

Spark setup

PostgreSQL setup

Flask setup

Possible extension

Slides

About

Releases

Packages

Languages

Rohit-Kakodkar/DroneDetect

Folders and files

Latest commit

History

Repository files navigation

Introduction

Motivation

Tools and Technologies used

Pipeline

Cluster setup

Installation

Kafka setup

Spark setup

PostgreSQL setup

Flask setup

Possible extension

Slides

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages