Add video track and attachment extraction #45

chuong · 2020-02-11T23:52:40Z

This pull request is to add a needed feature for people like me to extract color, depth and IR video tracks and calibration attachment from a recorded stream from Microsoft Azure Kinect. An example code using this new feature is like this:

import pymkv
import json

# load mkv file
mkvFile = pymkv.MKVFile('../data/AzureKinect.mkv')

# show track and attachment info
print('track:')
[print(json.dumps(track.__dict__, indent=1)) for track in mkvFile.tracks]
print('attachments:')
[print(json.dumps(attac.__dict__, indent=1)) for attac in mkvFile.attachments]

# extract videos and attachments to folder ../data
track_names = mkvFile.extract_video_tracks('../data/')  # this may take a while for long video
print("Extracted tracks to:\n", '\n'.join(track_names))
attachment_names = mkvFile.extract_attachments('../data/')
print("Extracted attachments to:\n", '\n'.join(attachment_names))

My AzureKinect.mkv video file can be obtained from https://drive.google.com/open?id=1X62D8MX9jPfByfYp2l627WrvVXAk--AK

sheldonkwoodward

Hi @chuong, thanks for submitting this PR! This functionality you wrote will be very useful once merged. I spent some time going over your changes and left some comments on some issues I am concerned with. Other than those I would suggest adding a few comments in the extraction functions to make a it a bit easier to follow what is happening. Let me know if you have any questions or need help making the changes I have requested.

sheldonkwoodward · 2020-03-08T01:15:46Z

pymkv/MKVFile.py

+                new_attachment = MKVAttachment(file_path,
+                                               name=attachment['file_name'],
+                                               description=attachment['description'])
+                if 'id' in attachment:
+                    new_attachment.id = attachment['id']
+                if 'size' in attachment:
+                    new_attachment.size = attachment['size']
+                if 'properties' in attachment and 'uid' in attachment['properties']:
+                    new_attachment.uid = attachment['properties']['uid']


Looking at the mkvmerge output schema, it seems that "id", "size", and "properties" will always be present in an attachment's section of the info_json. Therefore, these should be added in the MKVAttachment constructor so they are documented and not just dynamically assigned.

Note that "uid" is not required within the "properties" section of the mkvmerge output schema. I recommend including "properties" as a parameter to the constructor, then within the constructor, check if "uid" exists and assign it to an attribute within the MKVAttachment object.

Perhaps an implementation that allows something like this:

new_attachment = MKVAttachment( file_path, name=attachment['file_name'], description=attachment['description'], id=attachment['id'], size=attachment['size'], properties=attachment['properties'] )

sheldonkwoodward · 2020-03-08T01:36:03Z

pymkv/MKVFile.py

+        for track in self.tracks:
+            if track._track_type == 'video':
+                bname = splitext(basename(track.file_path))[0]
+                name = '{}.mp4'.format(join(out_folder, bname + "_" + track.track_name))


Three comments for this line:

Before using the out_folder parameter, you should run it through os.path.expanduser to ensure output directories like ~/data/ are compatible.

track_name is an optional attribute in MKVTrack and will default to None. According to the mkvmerge output schema, "codec", "id", and "type" are the only required properties in a track. I would suggest combining these properties along with the filename to produce unique names for each video track.

Video tracks are not guaranteed to be compatible with mp4 containers. The mkvextract docs list the different types of video tracks here (the video tracks are prepended with a "V_"). I think the best solution for the time being is to create a dictionary that maps each video track type to a compatible container type and use this to decide on an extension.

sheldonkwoodward · 2020-03-08T01:54:22Z

pymkv/MKVFile.py

+        name_args = []
+        for attachment in self.attachments:
+            bname = splitext(basename(attachment.file_path))[0]
+            name = join(out_folder, bname + "_" + attachment.name)


Same concerns as part 1 of the comment on line 688. As for the naming of attachment files, I think it would be more appropriate to keep the attachment name specified and not prepend the basename.

Nguyen, Chuong (Data61, Black Mountain) added 2 commits February 12, 2020 10:25

Add ability to extract video tracks and attachments

e30e32b

Fix document style

f7c64c5

sheldonkwoodward requested changes Mar 8, 2020

View reviewed changes

sheldonkwoodward self-assigned this Mar 8, 2020

sheldonkwoodward changed the base branch from develop to release/1.1 March 10, 2020 03:52

sheldonkwoodward added the feature New feature label Mar 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add video track and attachment extraction #45

Add video track and attachment extraction #45

chuong commented Feb 11, 2020

sheldonkwoodward left a comment

sheldonkwoodward Mar 8, 2020

sheldonkwoodward Mar 8, 2020

sheldonkwoodward Mar 8, 2020

Add video track and attachment extraction #45

Are you sure you want to change the base?

Add video track and attachment extraction #45

Conversation

chuong commented Feb 11, 2020

sheldonkwoodward left a comment

Choose a reason for hiding this comment

sheldonkwoodward Mar 8, 2020

Choose a reason for hiding this comment

sheldonkwoodward Mar 8, 2020

Choose a reason for hiding this comment

sheldonkwoodward Mar 8, 2020

Choose a reason for hiding this comment