Reading MOT dataset with seqinfo produces 0-based indexing in frames #560

maartenvds · 2021-11-23T10:21:12Z

According to the MOT data format spec, "All frame numbers, target IDs and bounding boxes are 1-based" (quote from https://github.com/dendorferpatrick/MOTChallengeEvalKit/tree/master/MOT#data-format). If this claim is correct, there is a bug in mot_format.py.

When testing the CVAT tool, which uses datumaro, I failed to import a MOT dataset with an error "ValueError: Unknown internal frame id -1\n". To debug, I installed CVAT in developer mode (with VS code) and found two issues that lead to the cause of this error, where one affects this repository. In mot_format.py on line 147 (latest develop branch) there are the following lines:

for row in csv.DictReader(csv_file, fieldnames=MotPath.FIELDS):
    frame_id = int(row['frame_id'])
    item = items.get(frame_id)

Since items contains frame ids that start with zero, the item with id zero never gets accessed in this loop because frame_id starts with 1 when mot annotation files is loaded. Also, a possible overrun can occur when frame_id equals the length of the sequence (which is possible since its base-1).

My suggested fix:

for row in csv.DictReader(csv_file, fieldnames=MotPath.FIELDS):
    frame_id = int(row['frame_id']) - 1 # one based frame ids
    item = items.get(frame_id)

The text was updated successfully, but these errors were encountered:

zhiltsov-max · 2021-11-23T10:53:20Z

Hi, thank you for reporting the problem! Probably, we need to review the indexing logic. As I see, we already subtract 1 in the CVAT format handler. Do you use a seqinfo file?

maartenvds · 2021-11-23T14:53:18Z

Yes I use a seqinfo file. I also reported a bug on the CVAT repo that is related this this one cvat-ai/cvat#3940 (I included a description of the .zip file I used over there). Since datumaro returned zero indexed item ids, regardless of my suggested fix, the -1 in CVAT format handler is wrong and also contributed to this problem. But indeed its a good thing to review the indexing logic. However, it would be nice if this issue got fixed as soon as possible.

zhiltsov-max · 2021-11-23T15:46:06Z

Yes I use a seqinfo file.

Then, probably, the fix should be done here: https://github.com/openvinotoolkit/datumaro/blob/develop/datumaro/plugins/mot_format.py#L125-L126

We should just start from 1 instead of 0.

maartenvds · 2021-11-24T09:03:35Z

We should just start from 1 instead of 0.

That also works and does not require modifications to the CVAT tool.
I implemented the following to test your suggestion:

if self._seq_info:
            for frame_id in range(1, self._seq_info['seqlength'] + 1):  # base-1 frame ids
                items[frame_id] = DatasetItem(
                    id=frame_id,

And it works!

Shall I make a PR for this?

zhiltsov-max · 2021-11-24T09:24:20Z

Glad to hear this.

Shall I make a PR for this?

Yes, it would be great!

* Suggested fix for upstream issue #560 * Added unit test for mot_format.py that covers a dataset with seqinfo.ini * Updated changelog with bugfix info

zhiltsov-max assigned sizov-kirill Nov 23, 2021

zhiltsov-max added BUG Something isn't working data formats PR is related to dataset formats labels Nov 24, 2021

zhiltsov-max changed the title ~~Bug in mot_format.py~~ Reading MOT dataset with seqinfo produces 0-based indexing in frames Nov 24, 2021

maartenvds added a commit to maartenvds/datumaro that referenced this issue Nov 24, 2021

Suggested fix for upstream issue openvinotoolkit#560

b8373f4

maartenvds mentioned this issue Nov 24, 2021

Suggested fix for issue #560 #564

Merged

5 tasks

zhiltsov-max pushed a commit that referenced this issue Nov 24, 2021

Suggested fix for issue #560 (#564)

b700444

* Suggested fix for upstream issue #560 * Added unit test for mot_format.py that covers a dataset with seqinfo.ini * Updated changelog with bugfix info

zhiltsov-max closed this as completed Dec 21, 2021

zhiltsov-max linked a pull request Dec 21, 2021 that will close this issue

Suggested fix for issue #560 #564

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reading MOT dataset with seqinfo produces 0-based indexing in frames #560

Reading MOT dataset with seqinfo produces 0-based indexing in frames #560

maartenvds commented Nov 23, 2021

zhiltsov-max commented Nov 23, 2021 •

edited

Loading

maartenvds commented Nov 23, 2021 •

edited

Loading

zhiltsov-max commented Nov 23, 2021

maartenvds commented Nov 24, 2021

zhiltsov-max commented Nov 24, 2021

Reading MOT dataset with seqinfo produces 0-based indexing in frames #560

Reading MOT dataset with seqinfo produces 0-based indexing in frames #560

Comments

maartenvds commented Nov 23, 2021

zhiltsov-max commented Nov 23, 2021 • edited Loading

maartenvds commented Nov 23, 2021 • edited Loading

zhiltsov-max commented Nov 23, 2021

maartenvds commented Nov 24, 2021

zhiltsov-max commented Nov 24, 2021

zhiltsov-max commented Nov 23, 2021 •

edited

Loading

maartenvds commented Nov 23, 2021 •

edited

Loading