Awesome papers & datasets specifically focused on long-term videos.
-
Updated
Oct 9, 2025
Awesome papers & datasets specifically focused on long-term videos.
[2025 TPAMI] Mettle: Meta-Token Learning for Memory-Efficient Audio-Visual Adaptation
Related papers about Weakly-supervised Audio-Visual Video Parsing (AVVP) & Audio-Visual Event Localization (AVE)
Add a description, image, and links to the audio-visual-event-localization topic page so that developers can more easily learn about it.
To associate your repository with the audio-visual-event-localization topic, visit your repo's landing page and select "manage topics."