MeMAD's participation to the MediaEval Media Memorability 2019, 2020 and 2021 challenges

Please cite the following if you use this code.

@inproceedings{reboud2019combining,
  title={Combining Textual and Visual Modeling for Predicting Media Memorability},
  author={Reboud, Alison and Harrando, Ismail and Laaksonen, Jorma and Francis, Danny and Troncy, Rapha{\"e}l and Mantec{\'o}n, H{\'e}ctor Laria},
  booktitle = {MediaEval 2019: Multimedia Benchmark Workshop},
  year={2019},
  address = {Sophia Antipolis, France}
}

@inproceedings{reboud2020predicting,
  title={Predicting Media Memorability with Audio, Video, and Text representation},
  author={Reboud, Alison and Harrando, Ismail and Laaksonen, Jorma and Troncy, Rapha{\"e}l and others},
  booktitle={MediaEval 2020: Multimedia Benchmark Workshop},
  year={2020}
}

@inproceedings{reboud2021exploring,
  title={Exploring Multimodality, Perplexity and Explainability forMemorability Prediction},
  author={Reboud, Alison and Harrando, Ismail and Laaksonen, Jorma and Troncy, Rapha{\"e}l and others},
  booktitle={MediaEval 2021: Multimedia Benchmark Workshop},
  year={2020}
}

2021 MeMAD's approach

For the 2021 edition we submitted different approaches. The first one is a multimodal approach (vision,audio,text) with early fusion (the features are concatenated to produce a single prediction)

We also proposed an explainable text approach as well as an approach which relies on text perplexity measuring Presentation slides

2020 MeMAD's approach

Our approach for the 2020 edition is a weighted average method combining predictions made separately from visual, audio, textual and visiolinguisticrepresentations of videos. Two improvements from the 2019 approach are that we are now using the audio modality and focusing on video features (as opposed to image features ) allowing to better model action rich videos.

2019 MeMAD's approach

Our approach for the 2019 edition is a weighted average method combining predictions made separately from visual, visual embeddings and textual and representations of videos.

Usage

The approach consists in computing three different scores independently and later averaging them.

Computing the text scores

Read and follow textual_scores prediction's README.md

Computing the memorability visiolinguistic scores (2020 edition only)

Extracting Vilbert features from the frozen task-agnostic Vilbert model, following the instructions in the README.mdunder vilbert/vilbert-multi-task

Obtaining and computing the memorability scores using

python vilbert/mediaeval2020_pred.py

Computing the visual and audio-visual scores

Read and follow PicSOM_prediction's README.md

Finding the best weights combination and getting the final scores

Obtain the final score by running, combine_scores_2020.py, a code snippet for evaluating all linear combinations of values to combine different modalities.

python combine_scores_2020.py

For the 2019 edition, obtain the final scores runing

python combine_scores_2019.py

Usage additional experiments 2021 edition

Note: All script finishing by _ns (not submitted) are experiments that werenot included in the final runs Not submitted: late_fusion_2021_ns.py SVR_ensemble_2021_ns.py is an additional experiment for late fusion ( for each modality and perplexity scores) with scores obtained with a SVM

Name		Name	Last commit message	Last commit date
Latest commit History 150 Commits
PicSOM_prediction		PicSOM_prediction
images		images
textual_scores		textual_scores
vilbert		vilbert
.gitignore		.gitignore
MediaEval_18_paper_1.pdf		MediaEval_18_paper_1.pdf
README.md		README.md
SVR_ensemble_2021_ns.py		SVR_ensemble_2021_ns.py
combine_scores_2019.py		combine_scores_2019.py
combine_scores_2020.py		combine_scores_2020.py
language_vectors_2019.py		language_vectors_2019.py
late_fusion_2021_ns.py		late_fusion_2021_ns.py
models_2019.py		models_2019.py
requirements.txt		requirements.txt
results-2018.png		results-2018.png
text_pred_training_set_trecvid.csv		text_pred_training_set_trecvid.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MeMAD's participation to the MediaEval Media Memorability 2019, 2020 and 2021 challenges

2021 MeMAD's approach

2020 MeMAD's approach

2019 MeMAD's approach

Usage

Computing the text scores

Computing the memorability visiolinguistic scores (2020 edition only)

Computing the visual and audio-visual scores

Finding the best weights combination and getting the final scores

Usage additional experiments 2021 edition

About

Releases

Packages

Contributors 4

Languages

MeMAD-project/media-memorability

Folders and files

Latest commit

History

Repository files navigation

MeMAD's participation to the MediaEval Media Memorability 2019, 2020 and 2021 challenges

2021 MeMAD's approach

2020 MeMAD's approach

2019 MeMAD's approach

Usage

Computing the text scores

Computing the memorability visiolinguistic scores (2020 edition only)

Computing the visual and audio-visual scores

Finding the best weights combination and getting the final scores

Usage additional experiments 2021 edition

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages