Browsing: Unsupervised

ViSMaP: Unsupervised Summarization of Hour-Long Videos Using Meta-Prompting and Short-Form Datasets

By The NewsApril 28, 2025

Video captioning models are typically trained on datasets consisting of short videos, usually under three minutes in length, paired with…