You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Do you represent each label as separate channels in the dataset loader?
I ask because there is a lot of class overlap in COCO and the z order isn't always correct. For example the table category often blocks out all the objects on top of the table if you put it all into a single categorical channel, rather than a one-hot (multiple-hot?) encoding.
The text was updated successfully, but these errors were encountered:
You're right about the issue, thanks. However, I've tried a couple ways to represent the ground truth. One of them is as a NxHxWxC tensor, each label being a separate channel, like you describe; so in that case, C is 80 plus an extra channel for background, which makes 81 in total.
Do you represent each label as separate channels in the dataset loader?
I ask because there is a lot of class overlap in COCO and the z order isn't always correct. For example the table category often blocks out all the objects on top of the table if you put it all into a single categorical channel, rather than a one-hot (multiple-hot?) encoding.
The text was updated successfully, but these errors were encountered: