Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transform from v2d format into video_rgb format and save in video_rgb/ directory #10

Open
kdu4108 opened this issue Jul 4, 2024 · 2 comments

Comments

@kdu4108
Copy link
Collaborator

kdu4108 commented Jul 4, 2024

Goal: given v2d format of

 ├── 00000.tar
 |     ├── 00000.mp4
 |     ├── 00000.txt
 |     ├── 00000.json
 |     ├── 00001.mp4
 |     ├── 00001.txt
 |     ├── 00001.json
 |     └── ...
 |     ├── 10000.mp4
 |     ├── 10000.txt
 |     ├── 10000.json
 ├── 00001.tar
 |     ├── 10001.mp4
 |     ├── 10001.txt
 |     ├── 10001.json
 │     ...
 ...

produce a video_rgb/ modality data folder of the following format:

root/video_rgb/shard-00000.tar
 |     ├── 00000.mp4 # this corresponds to one video.
 |     ├── 00001.mp4
 |     └── ...

Option 1: This should mostly just involve extracting the mp4/video files from the video2dataset format and moving it into the right directory paths.

Option 2: We can use v2d now to normalize the videos by making them same number of frames, etc.

We choose option #2 because by the time we get something in a modality folder, it should already be the last preprocessing step before pseudolabeling for aligned data.

Child issue of #3.

@kdu4108
Copy link
Collaborator Author

kdu4108 commented Jul 22, 2024

Finished by #17

@kdu4108
Copy link
Collaborator Author

kdu4108 commented Jul 22, 2024

One thing we overlooked is we actually want to have a directory of the format

root/video_rgb/train/*.tar
root/video_rgb/val/*.tar
root/video_rgb/test/*.tar

So we should modify the script that goes from raw to video_rgb to do this train/val/test split as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants