[ieee 2009 seventh international workshop on content-based multimedia indexing (cbmi) - chania,...

2
Special Session: Multimedia Indexing for Content Based Search Marco Brambilla Politecnico di Milano P.zza L. da Vinci 32 - 20133 Milano , Italy [email protected] Francesco Saverio Nucci Engineering Ingegneria Informatica Via San Martino della Battaglia 56 - 00185 Roma [email protected] Worldwide, the volume of stored information is growing exponentially, and an increasing share is audiovisual con- tent. This content drives the demand for new services, mak- ing audiovisual search one of the major challenges for or- ganisations and businesses today. Digital data is the greatest value that many organisations possess, and the ability to use it, rather than just store it, will be one of the most important strategic aspects in the coming decade. In this scenario, several research efforts focused on studying advanced search architectures for enabling con- sumers, businesses, and organisations to unlock the val- ues found in audiovisual content through innovative access paradigms. In particular, the focus of current researches is on managing and enabling access to information sources of all types, supporting advanced audiovisual processing and content handling that will enhance control, creation, and sharing of multimedia for all users in the value chain. Several research projects financed by the European Com- mission tackle this problem from different perspectives and provide diverse visions for the future. This will impact in the mid-long term on audiovisual industry, allowing com- panies to provide more effective and efficient access to con- tents thanks to innovative annotation techniques and search paradigms. The aim of this CBMI special session, organized with the support of the PHAROS project 1 , is to offer an overview of the research initiatives at European level that address the problems related to processing, annotation, indexing, and provisioning of contents within search applications. The session includes nine peer-reviewed contributions, reported in this volume, and an invited speech. The invited speech, given by professor Stefano Ceri, from Politecnico di Milano, Italy, will deliver a vision- ary discussion on the topic of Search Computing, a novel multi-disciplinary science which will provide the abstrac- tions, foundations, methods, and tools required to answer cross-domain search queries, that cannot be addressed by 1 PHAROS (Platform for searcHing of Audiovisual Resources across Online Spaces) Integrated Project (IST-2005-2.6.3) financed by the EC IST 6th Framework. current search engines. A typical example of multi-domain query is “Where can I attend an interesting Information Re- trieval conference close to a sunny beach, with direct flight connection to Europe and having a nice and cheap hotel accomodation?”. The generality of the problem makes it extremely relevant for the information retrieval community and poses additional challenges to the field of multimedia content annotation and indexing. The Search Computing project is currently financed by the ERC under the IDEAS Advanced Grants programme. The other contributions to the session include a work by Daras and Axenopoulos, that present a novel view-based approach for 3D object retrieval, that exploits automatic generation of a set of 2D images from a 3D object for cal- culating a global shape similarity between two 3D models that can support multimodal queries. The work by Bozzon, Brambilla, and Fraternali dis- cusses the use of a model-driven approach for specifying multimedia indexing processes, verifying properties of in- terest in such processes, and generating the code that or- chestrates the components, so as to enable rapid prototyp- ing of content analysis processes in presence of evolving requirements. Zidouni, Quafafou, and Glotin focus on the role of struc- tures in named entity retrieval inside audio transcription. They exploit this information, extracted through Condi- tional Random Fields (CRFs), for deducing an optimal hier- archical structure of the space of concepts(named entities), which are represented by nodes or any sub-paths in the hi- erarchy. The paper by Schreer et al. reports on the experiences achieved by the RUSHES EU research project in the field of multimedia analysis and reuse of raw un-edited audiovisual content, providing insights on automatic semantic annota- tion, indexing and retrieval of raw content, as well as on the main features of the final RUSHES search engine. Ren and Jose present their work on exploitation of the media document representation called feature terms to gen- erate a query from multiple media examples, such as im- ages. Their approach helps feature accumulation from mul- 2009 Seventh International Workshop on Content-Based Multimedia Indexing 978-0-7695-3662-0/09 $25.00 © 2009 IEEE DOI 10.1109/CBMI.2009.54 113

Upload: francesco-saverio

Post on 24-Feb-2017

216 views

Category:

Documents


2 download

TRANSCRIPT

Special Session: Multimedia Indexing for Content Based Search

Marco BrambillaPolitecnico di Milano

P.zza L. da Vinci 32 - 20133 Milano , [email protected]

Francesco Saverio NucciEngineering Ingegneria Informatica

Via San Martino della Battaglia 56 - 00185 [email protected]

Worldwide, the volume of stored information is growing

exponentially, and an increasing share is audiovisual con-

tent. This content drives the demand for new services, mak-

ing audiovisual search one of the major challenges for or-

ganisations and businesses today. Digital data is the greatest

value that many organisations possess, and the ability to use

it, rather than just store it, will be one of the most important

strategic aspects in the coming decade.

In this scenario, several research efforts focused on

studying advanced search architectures for enabling con-

sumers, businesses, and organisations to unlock the val-

ues found in audiovisual content through innovative access

paradigms. In particular, the focus of current researches is

on managing and enabling access to information sources

of all types, supporting advanced audiovisual processing

and content handling that will enhance control, creation,

and sharing of multimedia for all users in the value chain.

Several research projects financed by the European Com-

mission tackle this problem from different perspectives and

provide diverse visions for the future. This will impact in

the mid-long term on audiovisual industry, allowing com-

panies to provide more effective and efficient access to con-

tents thanks to innovative annotation techniques and search

paradigms.

The aim of this CBMI special session, organized with the

support of the PHAROS project1, is to offer an overview of

the research initiatives at European level that address the

problems related to processing, annotation, indexing, and

provisioning of contents within search applications. The

session includes nine peer-reviewed contributions, reported

in this volume, and an invited speech.

The invited speech, given by professor Stefano Ceri,

from Politecnico di Milano, Italy, will deliver a vision-

ary discussion on the topic of Search Computing, a novel

multi-disciplinary science which will provide the abstrac-

tions, foundations, methods, and tools required to answer

cross-domain search queries, that cannot be addressed by

1PHAROS (Platform for searcHing of Audiovisual Resources across

Online Spaces) Integrated Project (IST-2005-2.6.3) financed by the EC IST

6th Framework.

current search engines. A typical example of multi-domain

query is “Where can I attend an interesting Information Re-

trieval conference close to a sunny beach, with direct flight

connection to Europe and having a nice and cheap hotel

accomodation?”. The generality of the problem makes it

extremely relevant for the information retrieval community

and poses additional challenges to the field of multimedia

content annotation and indexing. The Search Computing

project is currently financed by the ERC under the IDEAS

Advanced Grants programme.

The other contributions to the session include a work by

Daras and Axenopoulos, that present a novel view-based

approach for 3D object retrieval, that exploits automatic

generation of a set of 2D images from a 3D object for cal-

culating a global shape similarity between two 3D models

that can support multimodal queries.

The work by Bozzon, Brambilla, and Fraternali dis-

cusses the use of a model-driven approach for specifying

multimedia indexing processes, verifying properties of in-

terest in such processes, and generating the code that or-

chestrates the components, so as to enable rapid prototyp-

ing of content analysis processes in presence of evolving

requirements.

Zidouni, Quafafou, and Glotin focus on the role of struc-

tures in named entity retrieval inside audio transcription.

They exploit this information, extracted through Condi-

tional Random Fields (CRFs), for deducing an optimal hier-

archical structure of the space of concepts(named entities),

which are represented by nodes or any sub-paths in the hi-

erarchy.

The paper by Schreer et al. reports on the experiences

achieved by the RUSHES EU research project in the field of

multimedia analysis and reuse of raw un-edited audiovisual

content, providing insights on automatic semantic annota-

tion, indexing and retrieval of raw content, as well as on the

main features of the final RUSHES search engine.

Ren and Jose present their work on exploitation of the

media document representation called feature terms to gen-

erate a query from multiple media examples, such as im-

ages. Their approach helps feature accumulation from mul-

2009 Seventh International Workshop on Content-Based Multimedia Indexing

978-0-7695-3662-0/09 $25.00 © 2009 IEEE

DOI 10.1109/CBMI.2009.54

113

tiple examples and enables the exploration of text-based

retrieval models for multimedia retrieval. Their work dis-

cusses three optimisation criteria for feature term selection

and two ranking functions for relevance estimation.

The paper by Dunker et al., from Fraunhofer IDMT, de-

scribes a solution for automated slideshow generation by

extracting a set of high-level features from music and in-

telligently combining this set with image high-level fea-

tures. The technique allows users to create personal-

ized slideshows intelligently combining music and pictures,

based on their features.

Georges Quenot et al. propose an approach based on the

International Phonetic Alphabet (IPA) for content-based in-

dexing and retrieval of multilingual audiovisual documents.

The approach includes the building of an IPA-based multi-

lingual acoustic model and a dynamic programming based

method for searching document segments by IPA string

spotting. The approach works even if the languages of the

document are unknown.

Cyril Laurier and the Music Technology Group from

Universitat Pompeu Fabra present a robust and efficient

technique for automatic music mood annotation. A songs

mood is expressed by a supervised machine learning ap-

proach based on musical features extracted from the raw

audio signal, that grants robustness to different audio com-

pression schemes.

Smeaton and Rothwell explore three questions related

to automatic detection of film highlights by measuring the

physiological responses of viewers of those films: whether

emotional highlights can be detected through viewer bio-

metrics; whether individuals watching a film in a group ex-

perience similar emotional reactions; and whether the pres-

ence of music in a film correlates with the occurrence of

emotional highlights. The results of this work could impact

the way in which movie content is indexed, e.g., by paying

special significance to movie segments that are likely to be

highlights.

The diverse contributions and skills of the participants

make the session an extremely interesting workbench for

open discussion and cross-fertilization between different

disciplines and across various research project consortia.

The contributions also give a hint on the creative and pro-

ductive area of multimedia search and indexing in the con-

text of the EU-funded research.

Marco BrambillaFrancesco S. Nucci

114