This collection of Wikipedia images was used in the ImageCLEF's wikipediaMM task to provide a testbed for the system-oriented evaluation of visual information retrieval. The aim is to investigate retrieval approaches in the context of a large and heterogeneous collection of images (similar to those encountered on the Web) that are searched for by users with diverse information needs.
This is an ad-hoc image retrieval task; the evaluation scenario is thereby similar to the classic TREC ad-hoc retrieval task: the system knows the set of documents to be searched, but the topics are not known to the system in advance. The goal of the simulation is: given a textual query and sample images describing a user's (multimedia) information need, find as many relevant images as possible from the Wikipedia image collection.