Tech, AI, and Fintech Innovations ViSMaP: Unsupervised Summarization of Hour-Long Videos Using Meta-Prompting and Short-Form DatasetsBy The NewsApril 28, 2025 Video captioning models are typically trained on datasets consisting of short videos, usually under three minutes in length, paired with…