This project was done from the SupRTwin: Sensing, Understanding, and Provisioning of Robotic Digital Twins at Universität des Saarlandes.
To construct a Digital Twin of the environment as captured by the Drone during its flight.
Drone takes an inspection flight in a confined environment. Deep learning model YOLOv5 is deployed to detect the objects in the input stream from the drone. From the detected object, location of the objects are found based on the location of the drone and the bounding box pixels.
Overall, implementation-wise we can split the process into 3 phases.
Drone API which are implemented in MQTT APIs are subscribed and published with appropriate params to successfully send/receive data from and to the drone.
We created a custom dataset of objects with training set of 4.1K images, validation set of 628 images, testing set of 212 images.
We then used YOLOv5 model to train the dataset. And obtained the following performance. We see that box_loss is steadily lowered both train and validation set loss.
We used Homography matrix and change the center pixels of the image to Mapping matrix using Homography matrix obtained using the intrinsic and extrinsic camera parameters.
We get the object's bounding box and center.
Then we use rotation matrix in z-axis to reverse rotate the object's co-ordinate along the z-axis with respect to the drone's rotation.
With the adjusted object's location and class of object in hand, we reconstruct the whole scene in Unity 3D. We are already provided with the scene's environment in Unity 3D in which we place the detected objects dynamically on the fly.
Finally the end-end flow looks like this.
This project was done in collaboration with the German Research Center for Artificial Intelligence and ZeMA - Zentrum für Mechatronik und Automatisierungstechnik gemeinnützige GmbH. Implementation's Repo: https://mrk40.dfki.de/drone-inspection-seminar/droneinspection