Improving tag transfer for image annotation using visual and semantic information

Sergio Rodriguez-Vaamonde, Lorenzo Torresani, Koldo Espinosa, Estibaliz Garrote

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper addresses the problem of image annotation using a combination of visual and semantic information. Our model involves two stages: a Nearest Neighbor computation and a tag transfer stage that collects the final annotations. For the latter stage, several algorithms have been implemented in the past using labels' information or including implicitly some visual features. In this paper we propose a novel algorithm for tag transfer that takes advantage explicitly of semantic and visual information. We also present a structured training procedure based on a concept we have called Image Networking: all the images in a training database are 'connected' visually and semantically, so it is possible to exploit these connections to learn the tag transfer parameters at annotation time. This learning is local for the test image and it exploits the information obtained in the Nearest Neighbor computation stage. We demonstrate that our approach achieves state-of-The-art performance on the ImageCLEF2011 dataset.

Original languageEnglish
Title of host publication2014 12th International Workshop on Content-Based Multimedia Indexing, CBMI 2014
PublisherIEEE Computer Society
ISBN (Print)9781479939909
DOIs
Publication statusPublished - 2014
Event12th International Workshop on Content-Based Multimedia Indexing, CBMI 2014 - Klagenfurt, Austria
Duration: 18 Jun 201420 Jun 2014

Publication series

NameProceedings - International Workshop on Content-Based Multimedia Indexing
ISSN (Print)1949-3991

Conference

Conference12th International Workshop on Content-Based Multimedia Indexing, CBMI 2014
Country/TerritoryAustria
CityKlagenfurt
Period18/06/1420/06/14

Keywords

  • Image annotation
  • Image indexing
  • multi-modal information fusion
  • tag transfer

Fingerprint

Dive into the research topics of 'Improving tag transfer for image annotation using visual and semantic information'. Together they form a unique fingerprint.

Cite this