-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Biwi Kinect Head Pose dataset. #3903
Add Biwi Kinect Head Pose dataset. #3903
Conversation
The documentation is not available anymore as the PR was closed or merged. |
Thanks for the detailed explanation of the structure!
features = Features({
"rgb": Sequence(Image()), # for the png frames
"rgb_cal": {"intrisic_mat": Array2D(shape=(3, 3), dtype="float32"), "extrinsic_mat": {"rotation": Array2D(shape=(3, 3), dtype="float32"), "translation": Sequence(Value("float32", length=3)}},
"depth": Sequence(Value("string")), # for the depth frames
"depth_cal": the same as "rgb_cal",
"head_pose_gt": Sequence({"center": Sequence(Value("float32", length=3), "rotation": Array2D(shape=(3, 3), dtype="float32")}),
"head_template": Value("string"), # for the person's obj file
}) We can add a "Data Processing" section to the card to explain how to parse the files.
|
Thanks for the suggestions @mariosasko, yielding one example for each person would make things much easier. |
Added the following :
To-Do :
Any inputs on what to include in the "Data Processing" section ? |
@mariosasko Please could you review this when you get time. Thank you. |
In the Data Processing section, I've added example code for a compressed binary depth image file. Updated the Readme as well. |
@mariosasko / @lhoestq , Please could you review this when you get time. Thank you. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Cool thanks ! I left a few comments :)
It looks like the CI on pyarrow 5 is failing because of an issue with Array2D that is unrelated to this dataset:
AttributeError: 'Array2DExtensionType' object has no attribute 'wrap_array'
Before merging this dataset we'll have to fix this issue. Let me create an issue on github
Created an issue here: #4152 |
Co-authored-by: Quentin Lhoest <[email protected]>
Use os.path.join for consistency. Co-authored-by: Quentin Lhoest <[email protected]>
Use os.path.join Co-authored-by: Quentin Lhoest <[email protected]>
Add encoding when we open files. Co-authored-by: Quentin Lhoest <[email protected]>
Add encoding when we open files. Co-authored-by: Quentin Lhoest <[email protected]>
Got it. Thanks for the comments. I've collapsed the C++ code in the readme and added the suggestions. |
Hi ! The |
I haven't been able to figure out why CI is failing, the error shown is :
Any inputs would be helpful. |
I think it's because there are tabulations in the c++ code, can you replace them with regular spaces please ? (then in another PR we can maybe fix the Readme parser to support text indented with tabulations) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks ! I played with the dataset a bit and I find it not very convenient to have the images grouped together. Don't you think it would be more practical to have one example = one image in this dataset ?
Add contributions info. Co-authored-by: Quentin Lhoest <[email protected]>
@lhoestq , initially the idea was to have one example = one image with an additional field mentioning the frame_number. But each subject, we had a head template, calibration information for the depth and the color camera which was common to all the examples for that subject. Also, the images were continuous frames. |
Having one example = one image would be good but since we have a head template, calibration information for the depth and the color camera which is common to all the images for that subject and the images being continuous frames, I think it makes sense to group the images together for each subject. This will make the feature representation easier. |
Ok I see, sounds good then. Users can still separate the images if they want to |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's all good then, thanks for adding this dataset ! :)
The CI fails are unrelated to this PR and fixed on master, merging ! |
This PR adds the Biwi Kinect Head Pose dataset.
Dataset Request : Add Biwi Kinect Head Pose Database #3822
The Biwi Kinect Head Pose Database is acquired with the Microsoft Kinect sensor, a structured IR light device.It contains 15K images of 20 people with 6 females and 14 males where 4 people were recorded twice.
For each frame, there is :
The ground truth is the 3D location of the head and its rotation.
The dataset structure is as follows :
Preview of frame_00003_pose.txt :
I have used the following dataset features :
I am giving the path to the depth_image here.
I need some inputs for the following :
Wanted to know how we can represent these features ?