WebVid-10M
is a large-scale dataset of short videos with textual descriptions sourced from stock footage sites. The videos are diverse and rich in their content.
- 10.7M video-caption pairs.
- 52K total video hours.
https://maxbain.com/webvid-dataset
Train split:
http://www.robots.ox.ac.uk/~maxbain/webvid/results_10M_train.csv
Validation split:
http://www.robots.ox.ac.uk/~maxbain/webvid/results_10M_val.csv
Video data coming soon.
