You were saying? – Spoken Language in the V3C Dataset

12/15/2022

∙

This paper presents an analysis of the distribution of spoken language in the V3C video retrieval benchmark dataset based on automatically generated transcripts. It finds that a large portion of the dataset is covered by spoken language. Since language transcripts can be quickly and accurately described, this has implications for retrieval tasks such as known-item search.

READ FULL TEXT

You were saying? – Spoken Language in the V3C Dataset

Sign in with Google

Consider DeepAI Pro