You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is the line in 'lfvila8m_clipid.jsonl' a video clips-sentence pair? And I see an variational number of video-clips per row. So how the video-clips of 'lfvila8m_clipid.jsonl' is divided from the original ‘hdvila_clip_text_100m.jsonl’? In addition to the selection of videos with more than 4 clips mentioned in the paper, are there any details?
The text was updated successfully, but these errors were encountered:
sunwhw
changed the title
Hi, how to define the LF-hdvila-8m?
Hi, how to understand the LF-hdvila-8m?
Apr 6, 2024
Is the line in 'lfvila8m_clipid.jsonl' a video clips-sentence pair? And I see an variational number of video-clips per row. So how the video-clips of 'lfvila8m_clipid.jsonl' is divided from the original ‘hdvila_clip_text_100m.jsonl’? In addition to the selection of videos with more than 4 clips mentioned in the paper, are there any details?
Where can I find annotation files containing video captions, "hdvila_clip_text_100m.jsonl" ? Thanks
Is the line in 'lfvila8m_clipid.jsonl' a video clips-sentence pair? And I see an variational number of video-clips per row. So how the video-clips of 'lfvila8m_clipid.jsonl' is divided from the original ‘hdvila_clip_text_100m.jsonl’? In addition to the selection of videos with more than 4 clips mentioned in the paper, are there any details?
The text was updated successfully, but these errors were encountered: