Convert Cuboid2D to/from KITTI 3D data #1639

itrushkin · 2024-10-14T12:03:33Z

Summary

CVS-151427

New features

New Cuboid2D methods:
- Cuboid2D.from_3d(dimensions, location, rotation_y, P, Tr_velo_to_cam): Creates a Cuboid2D object from KITTI 3D bbox annotation data. Matrix P (P2 in Kitti format context) is a 3x4 projection matrix in the left color camera coordinate system. Matrix Tr_velo_to_cam is a 3x4 projection matrix between LiDAR and camera coordinate systems.
- cuboid_2d.to_3d(P_inv): Reconstructs approximate KITTI 3D bbox annotation data (dimensions, location and rotation_y) from 2D projection coordinates. P_inv matrix is a pseudo-inverse of camera-to-image projection matrix.

How to test

See unit test changes

Checklist

I have added unit tests to cover my changes.
I have added integration tests to cover my changes.
I have added the description of my changes into CHANGELOG.
I have updated the documentation accordingly

License

I submit my code changes under the same MIT License that covers the project.
Feel free to contact the maintainers if that's a concern.
I have updated the license header for each file (see an example below).

# Copyright (C) 2024 Intel Corporation
#
# SPDX-License-Identifier: MIT

Signed-off-by: Ilya Trushkin <[email protected]>

codecov · 2024-10-14T12:15:58Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 81.23%. Comparing base (ff5fd94) to head (067bf4a).
Report is 20 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #1639      +/-   ##
===========================================
+ Coverage    81.06%   81.23%   +0.16%     
===========================================
  Files          278      281       +3     
  Lines        32517    32881     +364     
  Branches      6607     5289    -1318     
===========================================
+ Hits         26360    26710     +350     
- Misses        4701     4721      +20     
+ Partials      1456     1450       -6

Flag	Coverage Δ
ubuntu-20.04_Python-3.10	`81.21% <100.00%> (+0.16%)`	⬆️
windows-2022_Python-3.10	`81.21% <100.00%> (+0.16%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

sooahleex · 2024-10-15T06:33:59Z

src/datumaro/components/annotation.py

+        return np.array([a, b, c, d])
+
+    @staticmethod
+    def _get_denorm(Tr_velo_to_cam_homo):


Just a question, what is the meaning of Tr_velo_to_cam_homo?

Calibration matrix Tr_velo_to_cam has a shape of 3 x 4. To project 3D points to the 2D plane, we need to have homogeneous coordinates where each point is represented as a vector with 1 additional dimension.

This is the projection matrix between Velodyne LiDAR to Camera, where LiDAR contains 4 dimensions ([X, Y, Z, 1]) and Camera contains 3 dimensions ([u, v, 1]). velo stands for Velodyne :)

sooahleex · 2024-10-15T06:36:22Z

src/datumaro/components/annotation.py

+      2---3
     /|  /|
-    5-+-8 |
-    | 2 + 3
+    1-+-4 |
+    | 5 + 6
    |/  |/
-    1---4
+    8---7


I understood the overall structure, is there any reason to change the bottom and top face?

I aligned the order of points with Kitti format which describes top face first.

sooahleex

Could you update the documents and explanation of this feature in this pr too?

Signed-off-by: Ilya Trushkin <[email protected]>

itrushkin · 2024-10-15T12:30:51Z

Could you update the documents and explanation of this feature in this pr too?

@sooahleex, documentation is added in 657bf1b. PR description is extended as well.

Convert to/from 3D

7f1a8ab

Signed-off-by: Ilya Trushkin <[email protected]>

itrushkin requested review from a team as code owners October 14, 2024 12:03

itrushkin requested review from sooahleex and removed request for a team October 14, 2024 12:03

Update changelog

54cb8af

Signed-off-by: Ilya Trushkin <[email protected]>

sooahleex reviewed Oct 15, 2024

View reviewed changes

itrushkin added 2 commits October 15, 2024 15:28

Document methods

657bf1b

Signed-off-by: Ilya Trushkin <[email protected]>

Merge branch 'develop' into cuboid_2d_from_3d

067bf4a

wonjuleee approved these changes Oct 15, 2024

View reviewed changes

sooahleex approved these changes Oct 16, 2024

View reviewed changes

wonjuleee merged commit 3d533b9 into openvinotoolkit:develop Oct 16, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert Cuboid2D to/from KITTI 3D data #1639

Convert Cuboid2D to/from KITTI 3D data #1639

itrushkin commented Oct 14, 2024 •

edited

Loading

codecov bot commented Oct 14, 2024 •

edited

Loading

sooahleex Oct 15, 2024

itrushkin Oct 15, 2024

wonjuleee Oct 15, 2024

sooahleex Oct 15, 2024

itrushkin Oct 15, 2024

sooahleex left a comment

itrushkin commented Oct 15, 2024

+---3
                    /|  /|
--+-8 |
-                  | 2 + 3
+-+-4 |
+                  | 5 + 6
                   |/  |/
----4
+---7

Convert Cuboid2D to/from KITTI 3D data #1639

Convert Cuboid2D to/from KITTI 3D data #1639

Conversation

itrushkin commented Oct 14, 2024 • edited Loading

Summary

New features

How to test

Checklist

License

codecov bot commented Oct 14, 2024 • edited Loading

Codecov Report

sooahleex Oct 15, 2024

Choose a reason for hiding this comment

itrushkin Oct 15, 2024

Choose a reason for hiding this comment

wonjuleee Oct 15, 2024

Choose a reason for hiding this comment

sooahleex Oct 15, 2024

Choose a reason for hiding this comment

itrushkin Oct 15, 2024

Choose a reason for hiding this comment

sooahleex left a comment

Choose a reason for hiding this comment

itrushkin commented Oct 15, 2024

itrushkin commented Oct 14, 2024 •

edited

Loading

codecov bot commented Oct 14, 2024 •

edited

Loading