Diffusing Surrogate Dreams of Video Scenes to Predict Video Memorability


MediaEval 2022 のビデオの記憶力の予測タスクの一環として、視覚的な記憶力、それを特徴付ける視覚的表現、およびその視覚的表現によって描写される基本的な概念の間の関係を探ります。


As part of the MediaEval 2022 Predicting Video Memorability task we explore the relationship between visual memorability, the visual representation that characterises it, and the underlying concept portrayed by that visual representation. We achieve state-of-the-art memorability prediction performance with a model trained and tested exclusively on surrogate dream images, elevating concepts to the status of a cornerstone memorability feature, and finding strong evidence to suggest that the intrinsic memorability of visual content can be distilled to its underlying concept or meaning irrespective of its specific visual representational.


著者 Lorin Sweeney,Graham Healy,Alan F. Smeaton
発行日 2022-12-19 09:10:23+00:00
arxivサイト arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.AI, cs.CV パーマリンク