ImageCLEFfusion

Motivation

While deep neural network methods have proven their predictive power in a large number of tasks, there still are a number of particular domains where a single deep learning network is not enough for attaining high precision. Of course this phenomenon has further repercussions, as it may impede future development or even market integration and adoption for methods that target those particular tasks and domains. Late fusion (also called ensembling or decision-level fusion) represents one of the methods that researchers in machine learning employ in order to increase the performance of single-system approaches. It consists of using a series of weaker learner methods called inducers, that are trained and tested on the dataset, whose prediction outputs are combined in the final step, via a fusion method (also called ensembling method or strategy) in order to create a new and improved set of predictions. These systems have a long history and are shown to be particularly useful in scenarios where the performance of single-system approaches is not considered satisfactory.

The usefulness of ensembling methods has been proved in a large number of tasks in the current literature. To this point, these approaches have been successful even with a low number of inducers, in traditional tasks such as video action recognition [Sudhakaran2020]. However, the ImageCLEFfusion 2022 task proposes some differences to these kinds of approaches.

First of all, we propose to focus this task on the prediction of a couple of subjective concepts, where ground truth is not absolute and may be different for different annotators. Therefore, we choose the prediction of media interestingness and search result diversification. Secondly, we wish to explore the power of late fusion approaches as much as possible, and therefore propose to provide a set of prediction results extracted from a very large number of inducers.

We are interested in exploring a number of aspects of fusion approaches for this task, including but not limited to: the performance of different fusion methods, methods for selecting inducers from a larger given set of inducers, the exploitation of positive and negative correlations between inducers, etc.

News

More information will be added soon!

Preliminary Schedule

15.11.2021: registration opens for all ImageCLEF tasks
28.01.2022: development data release starts
25.03.2022: test data release starts
06.05.2022: deadline for submitting the participants runs
13.05.2022: release of the processed results by the task organizers
27.05.2022: deadline for submission of working notes papers by the participants
13.06.2022: notification of acceptance of the working notes papers
01.07.2022: camera ready working notes papers

Task description

In a general sense, ensembling systems are represented by an algorithm or function F that, given a set S composed of M samples, and a set A composed of N inducers that output a vector of N predictions for each sample, is able to create a newer and better set of predictions for the set of samples by combining the outputs for each individual sample.

Fusion scheme

As mentioned, we provide two types of data for ImageCLEFfusion, generating two tasks, related to:

media interestingness (ImageCLEFfusion-int), a regression task
result diversification (ImageCLEFfusion-div), a retrieval task

For ImageCLEFfusion, we will provide the outputs of the inducers associated with the media samples. The use of external data is prohibited for the two tasks, as well as the development and the use of any additional inducers other than the ones we provide. In this way we want to ensure a fair comparison of the fusion method and inducer selection principles without any variations in the inducer set.

Data

ImageCLEFfusion-int. The data for this task is extracted and corresponds to the Interestingness10k dataset [Constantin2021b]. We will provide output data from 33 inducers, while 1826 samples will be used for the development set, and 609 samples will be used for the testing set.

ImageCLEFfusion-div. The data for this task is extracted and corresponds to the Retrieving Diverse Social Images Task dataset [Ionescu2020]. We will provide outputs data from 117 inducers, while 104 queries will be used for the development set, and 35 samples will be used for the testing set.

Evaluation methodology

Evaluation will be performed by using the metrics specific to each dataset we use. Therefore, we will use MAP@10 for the Interestingness10k dataset (ImageCLEFfusion-int task), and F1 and Cluster Recall at 20 for the diversity dataset (ImageCLEFfusion-div task). We will provide scripts that will help you calculate these metrics and instructions with regards to file structure.

Participant registration

Please refer to the general ImageCLEF registration instructions.

Submission instructions

The submissions will be received through the AIcrowd system.
Participants will be permitted to submit up to 10 runs. External training data is not allowed.

More information will be added soon!

Results

Diversificaiton

Participant Name	Best Run	F1@20	CR@20
AIMultimediaLab	183900	0.6216	0.4916
klssncse	183156	0.5634	0.4414
shreya_sriram	182834	0.5604	0.4373

Media Interestingness

Participant Name	Best Run	MAP@10
AIMultimediaLab	183921	0.2192
ssn_it	181512	0.1106
UECORK	183804	0.1097

CEUR Working Notes

All participating teams with at least one graded submission, regardless of the score, should submit a CEUR working notes paper.
Official detailed instructions for the CLEF 2022 working notes can be found here: https://clef2022.clef-initiative.eu/index.php?page=Pages/instructions_for_authors.html.

Citations

When referring to ImageCLEF 2022, please cite the following publication:

Bogdan Ionescu, Henning Müller, Renaud Péteri, Johannes Rückert, Asma Ben Abacha, Alba García Seco de Herrera, Christoph M. Friedrich, Louise Bloch, Raphael Brüngel, Ahmad Idrissi-Yaghir, Henning Schäfer, Serge Kozlovski, Yashin Dicente Cid, Vassili Kovalev, Liviu-Daniel Ștefan, Mihai Gabriel Constantin, Mihai Dogariu, Adrian Popescu, Jérôme Deshayes-Chossart, Hugo Schindler, Jon Chamberlain, Antonio Campello, Adrian Clark, Overview of the ImageCLEF 2022: Multimedia Retrieval in Medical, Social Media and Nature Applications, in Experimental IR Meets Multilinguality, Multimodality, and Interaction. Proceedings of the 13th International Conference of the CLEF Association (CLEF 2022), Springer Lecture Notes in Computer Science LNCS, Bologna, Italy, September 5-8, 2022.
BibTex:
@inproceedings{ImageCLEF2022,
author = {Bogdan Ionescu and Henning M\"uller and Renaud P\’{e}teri and Johannes R\"uckert and Asma {Ben Abacha} and Alba Garc\’{\i}a Seco de Herrera and Christoph M. Friedrich and Louise Bloch and Raphael Br\"ungel and Ahmad Idrissi-Yaghir and Henning Sch\"afer and Serge Kozlovski and Yashin Dicente Cid and Vassili Kovalev and Liviu-Daniel \c{S}tefan and Mihai Gabriel Constantin and Mihai Dogariu and Adrian Popescu and J\'er\^ome Deshayes-Chossart and Hugo Schindler and Jon Chamberlain and Antonio Campello and Adrian Clark},
title = {{Overview of the ImageCLEF 2022}: {Multimedia Retrieval in Medical, Social Media and Nature Applications}},
booktitle = {Experimental IR Meets Multilinguality, Multimodality, and Interaction},
series = {Proceedings of the 13th International Conference of the CLEF Association (CLEF 2022)},
year = 2022,
volume = {},
publisher = {{LNCS} Lecture Notes in Computer Science, Springer},
pages = {},
month = {September 5-8},
address = {Bologna, Italy}
}

When referring to ImageCLEF2022Fusion general goals, general results, etc. please cite the following publication which will be published by September 2022:

Ștefan Liviu-Daniel, Constantin Mihai Gabriel, Dogariu Mihai and Ionescu Bogdan. Overview of the ImageCLEFfusion 2022 Task: Ensembling Methods for Media Interestingness Prediction and Result Diversification, in Experimental IR Meets Multilinguality, Multimodality, and Interaction. CEUR Workshop Proceedings (CEUR-WS.org), Bologna, Italy, September 5-8, 2022.
BibTex:
@inproceedings{ImageCLEF2022Fusion,
author = {Liviu-Daniel \c{S}tefan and Mihai Gabriel Constantin and Mihai Dogariu and Bogdan Ionescu},
title = {Overview of {ImageCLEFfusion} 2022 Task -- {Ensembling Methods for Media Interestingness Prediction and Result Diversification}},
booktitle = {CLEF2022 Working Notes},
series = {{CEUR} Workshop Proceedings},
year = {2022},
volume = {},
publisher = {CEUR-WS.org },
pages = {},
month = {September 5-8},
address = {Bologna, Italy}
}

References

[Sudhakaran2020] Sudhakaran, S., Escalera, S., & Lanz, O. (2020). Gate-shift networks for video action recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 1102-1111).
[Sagi2018] Sagi, O., & Rokach, L. (2018). Ensemble learning: A survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 8(4), e1249.
[Gomes2017] Gomes, H. M., Barddal, J. P., Enembreck, F., & Bifet, A. (2017). A survey on ensemble learning for data stream classification. ACM Computing Surveys (CSUR), 50(2), 1-36.
[Ştefan2020] Ştefan, L. D., Constantin, M. G., & Ionescu, B. (2020, June). System Fusion with Deep Ensembles. In Proceedings of the 2020 International Conference on Multimedia Retrieval (pp. 256-260).
[Constantin2021a] Constantin, M. G., Ştefan, L. D., & Ionescu, B. (2021, June). DeepFusion: Deep Ensembles for Domain Independent System Fusion. In the International Conference on Multimedia Modeling (pp. 240-252). Springer, Cham.
[Constantin2022] Constantin, M. G., Ştefan, L. D., & Ionescu, B. (2022). Exploring Deep Fusion Ensembling for Automatic Visual Interestingness Prediction. In Human Perception of Visual Information (pp. 33-58). Springer, Cham.
[Constantin2021b] Constantin, M. G., Ştefan, L. D., Ionescu, B., Duong, N. Q., Demarty, C. H., & Sjöberg, M. (2021). Visual Interestingness Prediction: A Benchmark Framework and Literature Review. International Journal of Computer Vision, 1-25.
[Ionescu2020] Ionescu, B., Rohm, M., Boteanu, B., Gînscă, A. L., Lupu, M., & Müller, H. (2020). Benchmarking Image Retrieval Diversification Techniques for Social Media. IEEE Transactions on Multimedia, 23, 677-691.

Organizers

Liviu-Daniel Ștefan, University Politehnica of Bucharest, Romania.
Mihai Gabriel Constantin, University Politehnica of Bucharest, Romania.
Mihai Dogariu, University Politehnica of Bucharest, Romania.
Bogdan Ionescu, University Politehnica of Bucharest, Romania

Acknowledgements

This task is supported under project AI4Media, A European Excellence Centre for Media, Society and Democracy, H2020 ICT-48-2020, grant #951911.

Navigation

You are here