Transform from v2d format into video_rgb format and save in `video_rgb/` directory #10

kdu4108 · 2024-07-04T15:04:02Z

Goal: given v2d format of

 ├── 00000.tar
 |     ├── 00000.mp4
 |     ├── 00000.txt
 |     ├── 00000.json
 |     ├── 00001.mp4
 |     ├── 00001.txt
 |     ├── 00001.json
 |     └── ...
 |     ├── 10000.mp4
 |     ├── 10000.txt
 |     ├── 10000.json
 ├── 00001.tar
 |     ├── 10001.mp4
 |     ├── 10001.txt
 |     ├── 10001.json
 │     ...
 ...

produce a video_rgb/ modality data folder of the following format:

root/video_rgb/shard-00000.tar
 |     ├── 00000.mp4 # this corresponds to one video.
 |     ├── 00001.mp4
 |     └── ...

Option 1: This should mostly just involve extracting the mp4/video files from the video2dataset format and moving it into the right directory paths.

Option 2: We can use v2d now to normalize the videos by making them same number of frames, etc.

We choose option #2 because by the time we get something in a modality folder, it should already be the last preprocessing step before pseudolabeling for aligned data.

Child issue of #3.

The text was updated successfully, but these errors were encountered:

kdu4108 · 2024-07-22T16:39:22Z

Finished by #17

kdu4108 · 2024-07-22T16:41:27Z

One thing we overlooked is we actually want to have a directory of the format

root/video_rgb/train/*.tar
root/video_rgb/val/*.tar
root/video_rgb/test/*.tar

So we should modify the script that goes from raw to video_rgb to do this train/val/test split as well.

kdu4108 self-assigned this Jul 4, 2024

kdu4108 mentioned this issue Jul 4, 2024

[PARENT ISSUE] Data preprocessing and pseudolabeling #3

Open

kdu4108 assigned yahya010 and kdu4108 and unassigned kdu4108 Jul 5, 2024

kdu4108 added the in progress label Jul 10, 2024

markus583 mentioned this issue Jul 11, 2024

add save_vq_tokens_vid.py #14

Open

kdu4108 added the under review label Jul 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transform from v2d format into video_rgb format and save in `video_rgb/` directory #10

Transform from v2d format into video_rgb format and save in `video_rgb/` directory #10

kdu4108 commented Jul 4, 2024 •

edited

Loading

kdu4108 commented Jul 22, 2024

kdu4108 commented Jul 22, 2024

Transform from v2d format into video_rgb format and save in video_rgb/ directory #10

Transform from v2d format into video_rgb format and save in video_rgb/ directory #10

Comments

kdu4108 commented Jul 4, 2024 • edited Loading

kdu4108 commented Jul 22, 2024

kdu4108 commented Jul 22, 2024

Transform from v2d format into video_rgb format and save in `video_rgb/` directory #10

Transform from v2d format into video_rgb format and save in `video_rgb/` directory #10

kdu4108 commented Jul 4, 2024 •

edited

Loading